Writing about ML implementations and the occasional non-technical stuff. More coming soon.

Linear-Time Sequential Modeling: Mamba from Scratch
Implementing FlashAttention using Pallas
Implementing GRPO from Scratch in JAX
Proximal Policy Optimization, in JAX