Phase 10: LLMs from Scratch
AI From Scratch/Lesson 21/~60 minutes

Jamba — Hybrid SSM-Transformer

State space models (SSMs) and transformers want different things. Transformers buy quality via attention at quadratic cost. SSMs buy linear-time inference and constant memory via a recurrence but lag quality. AI21's Jamba (March 2024) and...

LearnNo prerequisites
Loading lesson page...