Workshop Kits
Incident Readiness Workshop
An incident readiness workshop checks whether teams can detect, escalate, mitigate, communicate, and learn from production failure.
Inputs
Bring:
- Service catalog entries.
- Current alert list.
- Dashboards and runbooks.
- On-call and escalation paths.
- Recent incident examples.
- SLOs or critical workflow definitions.
Agenda
| Topic |
|---|
| Scope and critical services |
| Alert and escalation review |
| Runbook walkthrough |
| Tabletop failure scenario |
| Communications and stakeholder updates |
| Gaps, owners, and next steps |
Related pages
- Incident Management
- Incident Review Checklist
- Postmortem Template
- Runbook Template
- On-Call and Alerting
- SLO Implementation
Tabletop prompts
- Primary database is unavailable.
- Latest deployment caused elevated errors.
- Third-party dependency is timing out.
- Queue backlog is growing rapidly.
- Production credential was exposed.
Outputs
- Alert cleanup backlog.
- Runbook gaps.
- Escalation path fixes.
- SLO or dashboard improvements.
- Incident process action items.
Watchouts
- If responders cannot find owners quickly, fix ownership first.
- If alerts are noisy, readiness is already degraded.
- Tabletop exercises should produce action items, not just confidence.