**Scott Leggett** @smlx@fosstodon.org · Feb 04, 2025, 01:44

**Scott Leggett** @smlx@fosstodon.org · Feb 04, 2025, 01:44

Scott Leggett @smlx@fosstodon.org

Feb 04, 2025, 01:44

Scott Leggett @smlx@fosstodon.org

I see some common ingrained misunderstandings around "Incident Reviews" / "Post Mortems" in technical orgs.

🧵

**Scott Leggett** @smlx@fosstodon.org · Feb 04, 2025, 01:44

**Scott Leggett** @smlx@fosstodon.org · Feb 04, 2025, 01:44

Feb 04, 2025, 01:44

Scott Leggett @smlx@fosstodon.org

1. There is no single "root cause". IMO this term is harmful because, while it makes for an easily graspable concept, the metaphor encourages identifying a _single_ cause. There is _never_ a single reason behind an incident. Instead there are always several "contributing factors".

Show thread

**Scott Leggett** @smlx@fosstodon.org · Feb 04, 2025, 01:45

**Scott Leggett** @smlx@fosstodon.org · Feb 04, 2025, 01:45

Feb 04, 2025, 01:45

Scott Leggett @smlx@fosstodon.org

2. "Human error" is _never_ a contributing factor (or "root cause" 🤬). The problem is that until Human 2.0 comes out it is completely unfixable. Humans don't make decisions or take actions in a vacuum. There is _always_ an outdated procedure, bad policy, false belief, missing documentation, poor tooling, or lack of training behind a mistake made by a human. That is something you can fix!

#DevOps #platform #sre #infosec

**Dean** @dean@librem.one · 2025-02-04T03:20:32Z

Dean @dean@librem.one

@smlx yeah, the airline industry has had this mindset for decades and makes it the most (?) safe form of travel. We can learn and become a true engineering discipline.

Feb 04, 2025, 03:20 · Mastodon for Android · · ·