Interactive Lab

Attention in LLMs

Large Language Models are essentially massive neural networks supercharged by an Attention Mechanism.

In an effort to better understand what's going on under the hood, I built a couple of visualizers. I hope it's somewhat illustrative for you!

Dual Metaphor

Switch between 'Story Mode' for intuition and 'Tech Mode' for precision.

Real-Time Math

Watch Online Softmax update as context rotates through the cluster.

No Information Loss

Learn why Ring Attention is bit-perfect, unlike RAG or sliding windows.