Skip to content

Maintaining and automating data workloads Questions

Practice questions for Maintaining and automating data workloads topic in Google Cloud Professional Data Engineer. 35 questions covering this domain.

35 questions9 easy18 medium8 hard
Q1
medium

A serverless orchestration workflow is failing intermittently. Which built-in observability feature can help identify the exact failing step and revie...

Q2
medium

A governance team wants alerts when scheduled data quality scans fail or when scores fall below a target. Which capability supports that requirement?

Q3
hard

A company wants data quality scans to run with least privilege, respect row-level and column-level access policies, provide clearer audit attribution,...

Q4
medium

An engineer needs detailed metadata about recent jobs, streaming errors, and reservation utilization to troubleshoot workload behavior in near real ti...

Q5
medium

A platform team wants test queries to stop competing with production queries for slot capacity. Which design best supports that goal?

Q6
medium

A team wants to analyze data quality results across many scans in a dashboarding tool. Which supported approach should they use?

Q7
easy

When do BigQuery query metrics become available in Cloud Monitoring?

Q8
medium

A data steward wants to validate data quality for one BigQuery table on a schedule using built-in rules and custom SQL checks managed by the governanc...

Q9
easy

In BigQuery capacity-based pricing, what are the pools of slots called?

Q10
medium

A BigQuery workload has bursty demand, and the team wants to use capacity-based pricing while having slots scale up automatically when needed and down...

Q11
easy

Which type of audit log records read, list, and other access operations on user data and is disabled by default for many services in Google Cloud?

Q12
medium

A team needs to stop a streaming Dataflow job cleanly so that all in-flight data is processed and committed before workers shut down. Which Dataflow o...

Q13
easy

Which Google Cloud service collects metrics, dashboards, alerts, and uptime checks for resources, including BigQuery and Dataflow?

Q14
medium

A Managed Service for Apache Airflow team wants tasks that fail to retry automatically a few times before being marked as failed. Which Airflow capabi...

Q15
hard

A finance team wants to minimize storage costs for objects that are rarely accessed after 30 days but must remain immediately accessible. Which Cloud ...

Q16
hard

A streaming Dataflow pipeline accumulates a growing system lag and Pub/Sub backlog over several hours. Which combination of actions is most likely to ...

Q17
hard

A BigQuery workload has high cost from a single recurring report query. The query joins a large fact table with several small dimension tables. Which ...

Q18
medium

An engineer notices that BigQuery queries scan many more bytes than expected. Which query optimization is generally recommended to reduce scanned byte...

Q19
medium

A BigQuery slot utilization alert fires when average slot utilization exceeds 90% for a sustained period, indicating a need to increase capacity. Whic...

Q20
medium

A streaming Dataflow pipeline must be upgraded to a new version of the pipeline code with new transforms while keeping it running without losing in-fl...

Sign in to see all 35 questions

Create a free account to browse all questions — completely free during our launch phase.