Articles

Engineering Notes & Write-ups

Technical insights on Python, APIs, DevOps, and web development — grounded in real production experience.

SERIES·Learning AI Engineering by Building·7 parts

~39 min total

Start from Part 1 →

SERIES·How I Think About System Design·1 parts

~4 min total

1.How I Think About System Design4 min

Start from Part 1 →

The Mental Shift That Made AI Coding Tools Work for Me

FEATURED

An AI coding assistant is a brilliant new hire with amnesia. Once I understood that, everything about how I use these tools changed.

Mar 18, 20264 min readai claude devtools workflow learning

How I Built This Portfolio

FEATURED

I built this portfolio as a knowledge graph — cross-linked pages with build-time search, automatic backlinks, reader reactions, and continue reading — all on a static site. Here's the architecture behind it.

Mar 15, 20265 min readnextjs velite architecture devtools redis

Regex vs On-Device LLM for PII Detection: A 25-Case Benchmark

FEATURED

A comprehensive benchmark comparing regex pattern matching against Apple's on-device Foundation Models for PII detection — 52% F1 vs 100% F1, and why binary classification beats extraction.

Mar 8, 20263 min readpython llm security benchmark apple-fm

On-Device vs Cloud LLM: A Practical Benchmark

Benchmarking Apple's on-device Foundation Models against cloud LLMs across commit message generation, code review, and text classification — latency, quality, cost, and privacy tradeoffs.

Mar 7, 20262 min readllm benchmark apple-fm architecture

How to Build an LLM-Powered UI Generator

A technical deep dive into building a system where users type plain English and get live HTML mockups that match a specific design system — grounding, token budgets, streaming, and security.

Feb 20, 20263 min readllm python fastapi architecture frontend

Designing AI Workflows with Claude Code: From Prompts to Agents and Teams

FEATURED

How AI workflows evolve from simple prompts into structured systems of skills, agents, and collaborative teams — patterns observed while experimenting with Claude Code.

Jun 1, 20252 min readai claude ai-workflows agents devtools

Building FastAPI Services on Kubernetes

FEATURED

How I structure FastAPI applications for Kubernetes deployment — from project layout to health checks, pod templates, and CI/CD.

Mar 15, 20251 min readpython fastapi kubernetes devops

Async Python in Production: What They Don't Tell You

FEATURED

Async improves throughput but introduces debugging complexity, connection pool pitfalls, and error handling surprises. Lessons from running async APIs at scale.

Jan 20, 20252 min readpython async fastapi performance

Why CPU-Based Autoscaling Fails for API Services

CPU utilization is the wrong signal for scaling API services. Here's why request latency-based HPA produces better scaling behavior and how to implement it.

Sep 15, 20242 min readkubernetes devops performance infrastructure