Data Pipeline Orchestration Questions
Practice questions for Data Pipeline Orchestration topic in Google Cloud Associate Data Practitioner. 38 questions covering this domain.
Which statement about BigQuery scheduled queries is correct?
A data engineer wants to visually build and manage integration pipelines by connecting sources, transforms, and sinks through plugins in a web interfa...
Which service is designed for unified batch and streaming data processing at scale and provides exactly-once processing by default?
A team wants to build ELT workflows in BigQuery by using SQLX, Git integration, dependency management, and assertions for data quality tests. Which se...
A workflow must coordinate several steps across cloud and on-premises systems, run on a schedule, and expose DAG-level monitoring and retries. Which o...
An organization wants a central event bus with many-to-many routing, event transformation, and cross-project delivery for a growing event-driven archi...
A user configures a BigQuery scheduled query to read source data in one region and write results to a destination table in another region. What happen...
Which Google Cloud service provides a fully managed Apache Airflow environment for creating Python DAGs that schedule and monitor workflows?
A pipeline must read Pub/Sub events, aggregate them across time windows, and then write summarized results to BigQuery. Which design is the best fit?
A daily Dataflow job sometimes fails because BigQuery quotas are exhausted. Which Cloud Composer pattern most directly improves resilience for this de...
A data team wants to manage SQL transformation pipelines in BigQuery using version control, dependency graphs, and tests. Which Google Cloud product i...
Which Google Cloud service provides managed Apache Airflow for orchestrating data pipelines?
A team must orchestrate a sequence of BigQuery SQL transformations with dependencies and run them daily. Which combination is the most idiomatic Googl...
A Cloud Composer DAG must be scheduled to run hourly with retries on failure. Which Airflow construct controls how often the DAG runs?
A pipeline must continue running historical backfills while keeping current daily runs on schedule, without manually triggering each historical date. ...
A team wants different DAGs in Cloud Composer to share Python utility code without duplicating files in each DAG. Which Airflow feature should they us...
A pipeline must trigger downstream processing as soon as a new object is created in a Cloud Storage bucket. Which event-driven approach is most approp...
Which Google Cloud service runs cron-style scheduled jobs that can invoke HTTP endpoints, App Engine targets, or Pub/Sub topics?
A Cloud Composer DAG reads a file from Cloud Storage and should wait until the file exists before proceeding. Which Airflow operator is the most appro...
Which BigQuery scheduling capability allows a SQL query to run automatically on a recurring basis and write results to a destination table?
Sign in to see all 38 questions
Create a free account to browse all questions — completely free during our launch phase.