On-call fatigue and the case for rest
Pagers do not care about duty time limits — what aviation rest rules taught me about sustainable on-call rotations and tired humans making tired mistakes.
13 posts on this topic.
Pagers do not care about duty time limits — what aviation rest rules taught me about sustainable on-call rotations and tired humans making tired mistakes.
Kubernetes schedules on promises and kills on limits — why copying YAML from a tutorial is not a sizing strategy, and what I do instead.
A personal note on habits from the cockpit that still show up when a cluster misbehaves.
You don't need every system green to operate safely — you need to know what's deferred, what's compensating, and when to stop.
When every ping is an emergency, none of them are — what aviation radio habits taught me about paging on-call engineers.
Why a staging cluster earns its keep when you treat it like recurrent training — not a cheaper copy of production you never fly.
Rolling back or stopping a release mid-flight is a trained maneuver — not an admission that the team is bad at their jobs.
Two-person review and a preflight checklist mindset for kubectl, Helm, and GitOps — borrowed from aviation, adapted for YAML.
Running a cluster at 100% utilisation is like planning a flight with zero alternate fuel — workable on paper until the weather shifts.
Notes from a Lufthansa pilot who started over in DevOps — no shortcut story, just what actually helped.
Who speaks, who listens, who decides — lessons from the cockpit for noisy war rooms.
Below ten thousand feet, pilots limit conversation to essentials — production deserves the same focus when the blast radius is real.
Procedures don't replace thinking — but they buy you time when the adrenaline kicks in.