Parallelism & SIMD
Modern CPUs give you parallelism at three levels: across cores (threads), within a single core (instruction-level), and within a single instruction (SIMD). ML kernels exploit all three.
Modern CPUs give you parallelism at three levels: across cores (threads), within a single core (instruction-level), and within a single instruction (SIMD). ML kernels exploit all three.