Phase 19: Capstone Projects
AI From Scratch/Lesson 12/30 hours

Capstone 12 — Video Understanding Pipeline (Scene, QA, Search)

Twelve Labs productized Marengo + Pegasus. VideoDB shipped the CRUD-for-video API. AI2's Molmo 2 published open VLM checkpoints. Gemini long-context handles hours of video natively. TimeLens-100K defined temporal grounding at scale. The 20...

CapstonePython (pipeline)TypeScript (UI)
Loading lesson page...