#post-mortem

Career ✍️ Hand-written Jun 16, 2026

One Night a Storage System Died and Changed How I Think About Software

One night a storage system died and I realized the problem was never the disks — it was assuming nothing would fail. On assumptions, trust, and safety.

#incident #reliability #post-mortem

5 min

Technology May 7, 2026

I Trusted a 1 GB RAM VPS Too Much: The OOM Story and Layered Defense

How I rode out the OOM (Out of Memory) crisis while running 13 containers on a 1 GB RAM VPS, how kcompactd0 captured the CPU, and the fixes I shipped...

#oom #vps #kernel

8 min

Technology ✍️ Hand-written May 7, 2026

3rd OOM on the VPS: Parallel Builds and a flock Mutex Story

My blog automation collided with another project's build. RAM ran out, sshd reset. Hard reboot + flock for a global build mutex.

#vps #oom #incident

9 min

Technology ✍️ Hand-written May 4, 2026

First OOM: kcompactd at 92% CPU, sshd Reset, Hard Reboot

RAM ran out on my VPS, swap filled up, sshd dropped the connection. When the Astro build triggered an OOM, I decided to put together a layered pipeline defense.

#oom #swap #incident

9 min

Career ✍️ Hand-written May 3, 2026

My Cleanup Script Killed the GitHub Runner: A Self-Inflicted Incident

My disk-cleanup.timer wiped the runner's _work/_temp directories. For 16 hours every cron exploded with 'Missing file: set_output_*'. A confession of…

#incident #github actions #cleanup

7 min

Career May 3, 2026

Cross-Team Tension During a Crisis: An Incident Story

Explore the causes and consequences of cross-team tension during a critical incident, and the steps needed to manage it. Effective leadership…

#career #incident management #team dynamics

9 min

Life May 2, 2026

The Post-Mortem Culture War: The Personal Cost of Learning From…

Learning from mistakes is a hard road. Look at the personal price tag behind post-mortem culture, the shift from blame to learning, and the individual…

#life #öğrenme #hata yönetimi

9 min

Technology ✍️ Hand-written Apr 27, 2026

An Evening of Quirk Hunting in My AI Content Pipeline: 3 Bugs, 1…

My AI content pipeline blew up with three different format quirks: a slashed tag, a quoted date, a dotted-i character. Solved with a single normalizer.

#ai automation #content pipeline #validator

8 min

Career Apr 25, 2026

Post-Mortems After Major Outages: The Engineer's Invisible Burden

A post-mortem after a major outage isn't just a technical review. Understanding and managing the psychological, invisible burden engineers carry through it…

#career #post-mortem #incident management

12 min

Klavye Kısayolları