Intermediate Documentation Types #runbook #sre #devops #operations

Runbook & Operational Procedures

5 exercises — trigger conditions, exact commands, verification steps, rollback procedures, and runbook maintenance. Write runbooks people can follow at 3am.

0 / 5 completed

Runbook writing checklist

Trigger: Exactly when to use this runbook. Symptoms, alert names, conditions.
Prerequisites: Access, credentials, tools needed before starting
Steps: Exact commands, expected outcomes, verification for each step
Escalation: Who to call if a step fails, and how to reach them
Rollback: How to undo each action safely
Last tested: Date and name — outdated runbooks cause incidents

1 / 5

An SRE is writing a runbook step for restarting the payments service. Which version is best for use at 3am during an incident?

2 / 5

A runbook begins: "This runbook covers restarting the auth service." A senior SRE says it's incomplete. The critical missing element in the runbook's opening is _____.

3 / 5

A runbook step says: "Scale up the consumer service." A new engineer is confused. Which rewritten step is clearest?

4 / 5

A runbook includes a section titled "Rollback Procedure." When should this section be used — and why is it essential?

5 / 5

After an engineer successfully follows a runbook to resolve an incident, they notice one of the steps was outdated and no longer matched the current Kubernetes namespace. What is the correct action regarding the runbook?