Skip to content

Parallelism & SIMD

Modern CPUs give you parallelism at three levels: across cores (threads), within a single core (instruction-level), and within a single instruction (SIMD). ML kernels exploit all three.