Skip to content

Kernels & Hardware

When the compiler isn’t enough, you write the kernel. This module covers the kernel DSLs that win in practice — Triton (the daily driver), CUTLASS / CuTe (the NVIDIA reference), ThunderKittens / TileLang (the post-Triton frontier) — and a coming lesson on the 2026 hardware landscape that frames which DSL fits which chip.