Loading lesson page...
AI From Scratch/Lesson 12/30 hours
Capstone 12 — Video Understanding Pipeline (Scene, QA, Search)
Twelve Labs productized Marengo + Pegasus. VideoDB shipped the CRUD-for-video API. AI2's Molmo 2 published open VLM checkpoints. Gemini long-context handles hours of video natively. TimeLens-100K defined temporal grounding at scale. The 20...
CapstonePython (pipeline)TypeScript (UI)