Skip to content

Edge Quantization

Server-class quantization covers the math; this module covers the deployment details specific to edge: file formats, calibration recipes, and the long tail of K-quant variants people argue about on Reddit.

0 / 1 lessons~14 min total

Lesson coming: QAT for Edge — quantization-aware training recipes for sub-FP16 mobile deployment.