Case studies.
Each follows the same shape — problem, constraints, architecture, trade-offs, outcome — so you can compare how I think.
- 2025Scaling an Order Pipeline to 50k Events/Sec
The existing order pipeline was a synchronous chain of HTTP calls. At peak traffic, tail latency and failures cascaded: a slow downstream caused order timeouts, duplicate writes from naive retries, and intermittent data inconsistency. The business needed to grow 10x without growing incidents.
- 2025RAG That Actually Works in Production
Support and product teams were drowning in repetitive questions that were already answered in our docs, tickets, and runbooks. Earlier RAG attempts were 'demo-great, production-bad': plausible answers, frequent hallucinations, no trust signals.
- 2024Multi-Tenant Isolation on AWS
A single noisy customer could degrade the platform for everyone. We needed isolation that didn't explode costs, plus a story for running the biggest customers on dedicated infrastructure.