Blog Archive
Voice Agents Don't Know When You're Done Talking
Most builders assume end-of-turn detection is a silence threshold. That model breaks in production. The fix is architectural: four probabilistic events, speculative reasoning, and everything downstream cancellable.
Read Post
The 8th Specialist: An AI That Breaks Things on Purpose
An 8th specialist that touches the browser — navigating, clicking, and typing to find bugs that scripted journeys miss. How exploratory testing with Playwright catches what verification alone cannot.
Read Post
Specialist Agents: Looking at Every Page with Different Eyes
Seven specialists, each with their own checklist — QA, UX, UI, Security, Performance, Data Leakage, Language. How splitting evaluation into focused agents catches more.
Read Post
25 Issues Overnight: Batch AI That Doesn't Need You
The leap from AI-assisted coding to autonomous batch processing. Fresh context per task, filesystem locks, model routing, and orchestration that runs while you sleep.
Read Post
The Boring Stuff That Keeps AI Running at 3am
Exponential backoff, dual timeouts, SSE heartbeats, idempotency caches — the unglamorous patterns that keep LLM-powered systems running at 3am.
Read Post
/qa-run: AI-Driven QA That Closes the Loop
Real browser, predefined journeys, specialist agents checking every step. How QA findings feed back into the issue tracker to close the development loop.
Read Post
/work-issue: Autonomous Implementation
The AI picks up a planned issue, implements it, tests it, commits, and marks it done. How headless batch mode lets me plan in the afternoon and wake up with code in the morning.
Read Post
Sub-10ms AI Responses Without Calling the LLM
Users ask similar questions in different words. Semantic caching with pgvector turns repeated intent into instant answers — no LLM call, no embedding, no retrieval pipeline.
Read Post