AI Operator track — production AI agent engineering. Validation loops, tool-call reliability, completion ownership, context engineering, cost forensics, and observability for systems that ship and stay shipped.
Long-form writing and video on production AI agent engineering — the 90% of work that happens after the model gives you a demo that works.
Topics: validation loops, tool-call completion ownership, context engineering, retry budgets, cost forensics, and observability for systems that ship and stay shipped.
Companion to the Harrison AI Operator YouTube channel. Blog and video are listed below — most recent first.
Blog

Inside query.ts — the 1,729-line async generator that is Claude Code's beating heart. 10 steps per iteration, 9 continue points, 4-stage compression, and streaming tool execution. With line numbers.
2026-04-03
9 min read
Blog

AI API calls are unlike ordinary RPC: per-request cost varies 100×, tokens and models are first-class, streaming muddies timing, caching changes the pricing. A T-shaped instrumentation architecture — shared stem, specialized arms — that handles tracing, billing, and cost analytics without any of them contaminating the others.
2026-04-01
13 min read
Blog

The MEMORY.md frontmatter spec from Claude Code's leaked source: 4 types (user, feedback, project, reference), 200-line index cap, LLM-based picker. What's right, where it breaks at scale.
2026-04-01
6 min read
Blog

Claude Code v2.1.88 accidentally exposed 510K lines. The 5 hidden features: Kairos (permanent memory), Undercover Mode (stealth), Ultraplan (deep planning), Pet System (Buddy), Multi-Agent. Source-cited.
2026-03-31
7 min read
Video
Extended 48-minute version of Episode 1. Same LLM-talks-program-walks framework, but with deeper exploration of every layer from token streaming to multi-agent orchestration.
2026-03-29
Video
Extended podcast version of the AI Stack thesis. LLM, Token, Context, Function Calling, MCP, Agent, Skill — eight concepts that confuse every engineer until you see they're all the same pattern.
2026-03-29
Blog

A first-principles breakdown of the entire AI stack — from LLM to Agent in one mental model. An LLM can only output text. Everything else is the program.
2026-03-28
8 min read
Video
A first-principles breakdown of the entire AI stack in 15 minutes — from LLM to Agent. One mental model: the LLM can only output text, the program does everything else.
2026-03-28