Skip to content
DEP
Debugging and Deploying
hard
Question 5 of 20

A data engineer investigates a slow Spark stage in the Spark UI. Most tasks complete in 2–5 seconds, but one task takes over 10 minutes. Which observation in the Spark UI most strongly indicates that data skew is the root cause?

AUniformly high GC time across all executor summaries
BA large variance in input data size across tasks, where one task reads significantly more data than the others
CHigh aggregate shuffle write bytes in the stage summary metrics
DMultiple failed task attempts shown in the task list

Educational Content — CertQnA practice questions are written against official exam objectives, covering the same domains tested on the real exam. All content is original and independent — not actual exam questions, not affiliated with any certification vendor. Learn more about our content policy

Discussion

Be the first to share your understanding of this concept

⚠️ Discussion is for concept clarification only. Do not share or request actual exam questions or answers.

Sign in to join the discussion