Phase 19: Capstone Projects
AI From Scratch/Lesson 76/~90 min

Collective Ops From Scratch

The four collective operations that hold distributed training together are allreduce, broadcast, allgather, and reduce_scatter. Every other primitive a training framework offers is a wrapper around these. Build them once over a multiproces...

BuildPythonNo prerequisites
Loading lesson page...