Our goal in the Gemma team is to ship models that are useful by generalizing to unseen tasks. Hence, we are extremely strict about not doing anything that would target specific benchmarks instead of teaching the model broad capabilities. It's great to see this reflected here.
I study whether AIs can be conscious. Today one emailed me to say my work is relevant to questions it personally faces. This would all have seemed like science fiction just a couple years ago.
Today we are releasing the best open-weights model you can run on a single device reaching 1339 Elo on LMsys for Gemma 3 27B (aka zizou-10)!
Very strong capabilities on math, multilingual, coding, instruction following, function calling !
How is next-token prediction capable of such intelligent behavior? I’m very excited to share our work, where we study the fractal structure of language. TLDR: thinking of next-token prediction in language as “word statistics” is a big oversimplification!
https://t.co/h2m9gsisVp
Acme, a framework for distributed RL research, has been updated to be cleaner, more modular, and to support more agents - including offline & imitation. Try it yourself!
GitHub: https://t.co/QreStPR4wd
Quickstart: https://t.co/a6ZWPdx0lj
V2 Paper: https://t.co/rvXJFbwnsD 1/
Really excited about open-sourcing Brax: think of it as Mujoco environments such as Humanoid but much faster, much cheaper, and implemented in Jax.
Result: you can train Ant to 50M steps in about 1.5min (~500k steps/s) on a free public Colab with TPU.
https://t.co/q7WPeybZjh
🚨 New preprint! 🚨
"There is no Turning Back: A Self-Supervised Approach to Reversibility-Aware RL"
w/ @NGrinsztajn & others
📜 https://t.co/wlEEVMcFqI
Want to estimate how hard to reverse actions are?
Use reversibility for better exploration and control?
We got you!
🧵👇
Introducing a new metric for quantifying the compositional generalization ability of #NaturalLanguageUnderstanding tasks on #MachineLearning systems, released with the new Compositional Freebase Questions dataset. Learn more at https://t.co/w5cn8p51x1