Applying site reliability engineering practices Questions

Practice questions for Applying site reliability engineering practices topic in Google Professional Cloud DevOps Engineer. 37 questions covering this domain.

37 questions9 easy18 medium10 hard

medium

Why is a 100% SLO generally considered a poor target in SRE guidance?

medium

A service is close to exhausting its error budget. According to SRE guidance, which action is most appropriate?

easy

A service has a 99.9% SLO over its compliance period. What error budget does that imply?

hard

A team argues that deploying a service in two zones guarantees near-perfect availability because the two instances are independent. What is the main f...

hard

A dependency owned by another team causes your service to miss its SLO. Which response is presented as the more user-centered approach in the SRE Work...

medium

You only have a latency percentile metric per 10-minute interval. Which SLO type is the appropriate fit?

medium

Which activity is the clearest example of toil?

medium

Which compliance period is more closely aligned with recent user experience because it continuously evaluates the latest interval, such as the last 30...

easy

Which definition best describes a service-level indicator (SLI)?

Q10

easy

Which SRE concept describes the explicit maximum proportion of bad events a service is allowed to experience within the SLO compliance period?

Q11

medium

A service's error budget is being consumed faster than expected due to dependency failures outside the team's control. The SRE Workbook recommends a p...

Q12

easy

According to SRE guidance, what type of work is toil most likely to crowd out if left uncontrolled?

Q13

easy

What is the key difference between a calendar-window SLO and a rolling-window SLO?

Q14

medium

A services team reviews their SLO monthly and consistently achieves far above the target, maintaining a large unused error budget. According to SRE gu...

Q15

medium

A team is designing SLIs for a batch data-processing pipeline. Requests are not user-facing; instead, correctness of processed records matters. Which ...

Q16

hard

A team's SLO is based on request availability. Over the past compliance period, they experienced a 2-hour full outage and several shorter degraded-ava...

Q17

medium

A new microservice is being instrumented for its first SLI. The team debates using availability vs. latency as the primary SLI. According to SRE guida...

Q18

hard

A team experiences alert fatigue because their CPU-based alert fires every night due to a scheduled batch job, but the batch job never causes a user-f...

Q19

medium

A team wants to reduce on-call burden. Their first step is to identify which alerts are consuming the most engineer time. Which type of analysis direc...

Q20

medium

A team designs an SLO for a data pipeline. The pipeline must process all records without dropping any. Which SLI type from the SRE Workbook is most ap...

Sign in to see all 37 questions

Create a free account to browse all questions — completely free during our launch phase.