Kill the flies before fighting fires

Kill the flies. Then you’ll finally have time for the fires.

Hot take: most teams don’t have an “incident response” problem. They have a noise economy problem. We celebrate the Friday-night P0 save and ignore the 300 pages that ate someone’s entire week. Half of those “real” alerts auto-resolve. That’s not resilience, it’s Stockholm syndrome.

If your on-call spends the week acknowledging PagerDuty, you’re not improving MTTR - you’re burning the team’s mean thinking time.

What I keep seeing:

The counterintuitive part: the P0s are not the real enemy. They’re where engineers actually learn the system (albeit a bit stressfully!).

The enemy is the swarm that keeps you from ever getting there.

Seriously, take the time to kill the flies.

More posts

AI agent learning beats demo flashiness

The coding interview needs to die

Hallucination rate is the wrong question