Notes

Engineering Notes

Short, practical lessons from building production systems. Each note captures one key insight.

Session Reuse Eliminates Hangs in Batch LLM Processing

Problem A batch processing script using an on-device LLM hung after 5-6 items. No timeout, no error — just stuck. The script created a fresh session for every item: for item in items: session = L

Mar 7, 2026llm python

Building a Live Terminal Dashboard for AI Coding Sessions

Problem AI coding assistants consume tokens, make tool calls, and modify files — but the only visibility you get is the chat output. There's no dashboard showing context window usage, cost, active too

Mar 5, 2026python devtools

Reducing LLM Token Usage by 44% with Selective Context Injection

Problem An LLM-powered HTML generator injected all 18 design system component patterns into every prompt — ~3,055 tokens of context. But most requests only needed 3-7 components. The unused patterns w

Feb 20, 2026llm performance

Always Profile Before You Optimize

Problem I assumed migrating a synchronous FastAPI service to async would be the biggest performance win. I was ready to rewrite the entire database layer. Key Insight Profiling with Datadog APM reveal

Feb 20, 2025performance python

Alert on Percentiles, Not Averages

Problem Our average latency alerts never fired, but users were complaining about slow responses. The dashboard showed 45ms average — well within thresholds. Key Insight One 10-second request hidden in

Jan 10, 2025observability monitoring

Correlation IDs Go in Middleware, Not App Code

Problem Distributed tracing across 15+ microservices was broken. Developers were supposed to pass correlation IDs in every request, but they kept forgetting. Some services generated their own IDs, oth

Dec 5, 2024observability python

Kubernetes Liveness != Readiness

Problem We used the same /health endpoint for both liveness and readiness probes. Under load, the health check included a database ping that timed out. Kubernetes marked pods as unhealthy and restarte

Nov 15, 2024kubernetes devops

Structured Logging Is a Library, Not a Guideline

Problem We published a "Logging Best Practices" style guide. It said: use JSON format, include correlation IDs, follow a consistent schema. Six months later, every service logged differently. Some use

Oct 20, 2024observability python

Bots Scan Your App the Moment It Goes Live

Problem Within hours of deploying MindInChess, Cloudflare's access logs showed a flood of requests hitting paths that didn't exist — /admin, /wp-login.php, /.env, random port numbers, and API endpoint

Aug 15, 2024security devops

Engineering Notes

Short, practical lessons from building production systems. Each note captures one key insight.