The AI Operator Deep Dive

First-principles breakdowns of AI architecture — LLM, Function Calling, MCP, Agent, RAG and beyond. No hype. No buzzwords. Just how it actually works. For engineers building AI workflows. By Harrison Guo.

Episodes

The 90% AI Agent Problem

Episode 3 | April 18, 2026 | 18:30

Building an AI agent that works is easy. Building one that doesn't break is 90% of the work. In this episode, I break down the five pillars of agent architecture, the LLM vs. Code divide, and how I improved a production agent from 40% to 60% using code changes alone.

The Complete AI Architecture Deep Dive: From LLM to Autonomous Agent (48 min)

Episode 2 | March 28, 2026 | 47:30

The extended 48-minute deep dive into every layer of the AI stack — tokenization costs, Function Calling in production, MCP server architecture, real-world agents (Claude Code, Cursor, Copilot), progressive disclosure, and token economics. For engineers who want the full picture.

The AI Stack Explained: LLM, Token, Context, Function Calling, MCP, Agent, Skill — They're All the Same Thing

Episode 1 | March 28, 2026 | 22:00

A 22-minute first-principles breakdown of the entire AI stack. An LLM can only output text — the program does everything else. Learn how Function Calling, MCP, Agents, and Skills all follow one pattern: LLM talks, program walks.

The AI Operator Deep Dive

Subscribe

Episodes

The 90% AI Agent Problem

The Complete AI Architecture Deep Dive: From LLM to Autonomous Agent (48 min)

The AI Stack Explained: LLM, Token, Context, Function Calling, MCP, Agent, Skill — They're All the Same Thing

[ Connect_With_Me ]