ALL
POSTS
41 posts so far.
March 13, 2026Docker9 min read
How a DigitalOcean Firewall Rule Silently Dropped 23% of Production Traffic for 11 Days
Intermittent user timeouts, normal server metrics, and zero firewall logs — how a stateless firewall rule was killing TCP connections before they reached Nginx, and why it took eleven days to find it.
March 12, 2026Docker8 min read
How a Redis Connection Leak Crashed Our AWS ECS Cluster at 3AM
A Redis client spawned inside getServerSideProps accumulated 8,847 open connections over six hours, OOM-killed every ECS task, and took the service down for 47 minutes before we found the root cause.