Skip to content

Stable Learning

This is a collection of learning materials maintained by the lab. Each topic is structured as a progressive reading path with interactive visualizations. Pick a topic below and start from the beginning, or jump to any section.

From MDPs to modern policy optimization for LLM alignment.

Prerequisites: calculus, basic probability, familiarity with neural networks.

How multicore CPUs keep caches correct, from hardware protocols to memory fences.

Prerequisites: basic computer architecture (registers, memory, assembly helps but not required).