Skip to content
MLS-C01
Data Engineering
medium
Question 5 of 40

A team needs to run distributed Apache Spark jobs to preprocess petabyte-scale datasets stored in Amazon S3 before ML training. They need the ability to choose specific instance types and use spot instances. Which AWS service best fits?

AAWS Glue
BAmazon EMR
CAWS Batch
DAmazon Athena

Educational Content — CertQnA practice questions are written against official exam objectives, covering the same domains tested on the real exam. All content is original and independent — not actual exam questions, not affiliated with any certification vendor. Learn more about our content policy

Discussion

Be the first to share your understanding of this concept

⚠️ Discussion is for concept clarification only. Do not share or request actual exam questions or answers.

Sign in to join the discussion