Loading lesson page...
AI From Scratch/Lesson 03/30 hours
Capstone 03 — Real-Time Voice Assistant (ASR to LLM to TTS)
A voice agent that feels right has end-to-end latency under 800ms, knows when you have stopped talking, handles barge-in, and can call a tool without stalling. Retell, Vapi, LiveKit Agents, and Pipecat all hit this bar in 2026. They do it...
CapstonePython (agent + pipeline)TypeScript (web client)