[ AI_OPERATOR ]

AI Operator track — production AI agent engineering. Validation loops, tool-call reliability, completion ownership, context engineering, cost forensics, and observability for systems that ship and stay shipped.

Long-form writing and video on production AI agent engineering — the 90% of work that happens after the model gives you a demo that works.

Topics: validation loops, tool-call completion ownership, context engineering, retry budgets, cost forensics, and observability for systems that ship and stay shipped.

Companion to the Harrison AI Operator YouTube channel. Blog and video are listed below — most recent first.

Blog

Claude Code Deep Dive Part 2: The 1,421-Line While Loop That Runs Everything

Inside query.ts — the 1,729-line async generator that is Claude Code's beating heart. 10 steps per iteration, 9 continue points, 4-stage compression, and streaming tool execution. With line numbers.

2026-04-03 9 min read

Blog

Observability and Billing for AI API Calls: A T-Shaped Architecture

AI API calls are unlike ordinary RPC: per-request cost varies 100×, tokens and models are first-class, streaming muddies timing, caching changes the pricing. A T-shaped instrumentation architecture — shared stem, specialized arms — that handles tracing, billing, and cost analytics without any of them contaminating the others.

2026-04-01 13 min read

Blog

Claude Code MEMORY.md Spec: The 4 Frontmatter Types Decoded (user / feedback / project / reference)

The MEMORY.md frontmatter spec from Claude Code's leaked source: 4 types (user, feedback, project, reference), 200-line index cap, LLM-based picker. What's right, where it breaks at scale.

2026-04-01 6 min read

Blog

Claude Code Source Leaked: Kairos, Undercover Mode, Ultraplan — 5 Hidden Features (510K Lines)

Claude Code v2.1.88 accidentally exposed 510K lines. The 5 hidden features: Kairos (permanent memory), Undercover Mode (stealth), Ultraplan (deep planning), Pet System (Buddy), Multi-Agent. Source-cited.

2026-03-31 7 min read

Video

The Complete AI Architecture Deep Dive — From LLM to Autonomous Agent (48 min)

Extended 48-minute version of Episode 1. Same LLM-talks-program-walks framework, but with deeper exploration of every layer from token streaming to multi-agent orchestration.

2026-03-29

Video

The AI Stack Explained — Extended Podcast (22 min)

Extended podcast version of the AI Stack thesis. LLM, Token, Context, Function Calling, MCP, Agent, Skill — eight concepts that confuse every engineer until you see they're all the same pattern.

2026-03-29

Blog

The AI Stack Explained: LLM Talks, Program Walks

A first-principles breakdown of the entire AI stack — from LLM to Agent in one mental model. An LLM can only output text. Everything else is the program.

2026-03-28 8 min read

Video

The AI Stack Explained: LLM Talks, Program Walks

A first-principles breakdown of the entire AI stack in 15 minutes — from LLM to Agent. One mental model: the LLM can only output text, the program does everything else.

2026-03-28

[ AI_OPERATOR ]

[ Connect_With_Me ]