Maintaining and automating data workloads Questions
Practice questions for Maintaining and automating data workloads topic in Google Cloud Professional Data Engineer. 35 questions covering this domain.
A serverless orchestration workflow is failing intermittently. Which built-in observability feature can help identify the exact failing step and revie...
A governance team wants alerts when scheduled data quality scans fail or when scores fall below a target. Which capability supports that requirement?
A company wants data quality scans to run with least privilege, respect row-level and column-level access policies, provide clearer audit attribution,...
An engineer needs detailed metadata about recent jobs, streaming errors, and reservation utilization to troubleshoot workload behavior in near real ti...
A platform team wants test queries to stop competing with production queries for slot capacity. Which design best supports that goal?
A team wants to analyze data quality results across many scans in a dashboarding tool. Which supported approach should they use?
When do BigQuery query metrics become available in Cloud Monitoring?
A data steward wants to validate data quality for one BigQuery table on a schedule using built-in rules and custom SQL checks managed by the governanc...
In BigQuery capacity-based pricing, what are the pools of slots called?
A BigQuery workload has bursty demand, and the team wants to use capacity-based pricing while having slots scale up automatically when needed and down...
Which type of audit log records read, list, and other access operations on user data and is disabled by default for many services in Google Cloud?
A team needs to stop a streaming Dataflow job cleanly so that all in-flight data is processed and committed before workers shut down. Which Dataflow o...
Which Google Cloud service collects metrics, dashboards, alerts, and uptime checks for resources, including BigQuery and Dataflow?
A Managed Service for Apache Airflow team wants tasks that fail to retry automatically a few times before being marked as failed. Which Airflow capabi...
A finance team wants to minimize storage costs for objects that are rarely accessed after 30 days but must remain immediately accessible. Which Cloud ...
A streaming Dataflow pipeline accumulates a growing system lag and Pub/Sub backlog over several hours. Which combination of actions is most likely to ...
A BigQuery workload has high cost from a single recurring report query. The query joins a large fact table with several small dimension tables. Which ...
An engineer notices that BigQuery queries scan many more bytes than expected. Which query optimization is generally recommended to reduce scanned byte...
A BigQuery slot utilization alert fires when average slot utilization exceeds 90% for a sustained period, indicating a need to increase capacity. Whic...
A streaming Dataflow pipeline must be upgraded to a new version of the pipeline code with new transforms while keeping it running without losing in-fl...
Sign in to see all 35 questions
Create a free account to browse all questions — completely free during our launch phase.