Loading lesson page...
AI From Scratch/Lesson 79/~90 min
Pipeline Parallel and Bubble Analysis
Tensor parallelism splits the matrix multiply across ranks. Pipeline parallelism splits the model across ranks, one stage per rank. Microbatches flow through the pipeline. The empty time at the start and end is the bubble; minimising it is...
BuildPythonNo prerequisites