This is my vision for the future of Gemini and AI
Assistants.
Computer Use + Browser Use + Command Line Agent + On Screen Annotation.
I finally got around to building a quick experimental product and demo, using the Gemini models
If you also love this space, let's chat!
More Gemma 4! New QAT Gemma 4 checkpoints with similar performance while using ~4x less memory!
It comes with a new mobile quantization format that reduces memory footprint of Gemma 4 E2B to just 1GB.
Quantization-Aware Training (QAT) simulates low-precision operations during training to allow loss-less quantization afterwards for smaller, faster models while maintaining accuracy.
Available on @huggingface and directly runnable.
you're interviewing for an ml performance role at nvidia and they ask:
"why can't we just replace hbm with huge on-chip sram?"
you say: "because all chips need hbm." wrong.
here's how you answer: