AI From Scratch/Lesson 03/~75 minutes

Audio Classification — From k-NN on MFCCs to AST and BEATs

Everything from "dog barking vs siren" to "which language is this" is audio classification. The features are mels. The architecture moves each decade. The evaluation stays AUC, F1, and per-class recall.

BuildPython

Loading lesson page...