my favorite visual explainer I've made so far: SPECULATIVE DECODING
2-3x speedup on inference. but how?
1. draft quickly with a small model
2. accept/reject in parallel with a large model
full step-by-step explanation with ✨3D Animations ✨https://t.co/hJv3Cknjd1
I'm experimenting with creating the best possible technical video explainer of attention, but in two modes
⬛ Hyperactive dark mode
⬜ Slower-paced light mode
which do you prefer?
i'm permanently replacing adobe in my workflow
i made this video entirely with code, mostly vibe coding, using my own ECS game engine that's built on three.js
it also doubles as an interactive visualization! written version coming soon