On a subset of ForecastBench questions, an LLM has matched superforecaster performance for the first time.
A submission from @GoogleDeepMind, named “green tree,” is now #1 on dataset questions on ForecastBench, our AI forecasting benchmark.
Superforecasters remain #1 overall.
Happy to share that the @GoogleDeepMind Gemini team is starting a new research team in Singapore!
This new team will be focused on advanced reasoning, LLM/RL and improving bleeding edge SOTA models such as Gemini, Gemini Deep Think and beyond. 🔥
This team will be led by yours truly and reports up to Quoc Le (@quocleix)'s broader team in Mountain View which was recently in the center of both IMO gold medal and ICPC gold medal breakthroughs with Gemini Deep Think, amongst many other significant Gemini advancements. 🚀
We’re starting out with a very small but intensely capable force because talent density is key over anything else in the LLM era. Over the past few months, we have gone around and gathered the best of the best talent (in the region and beyond) and I’m confident we’ll have a super cracked team very soon.
If you are interested in joining and have made truly exceptional contributions in any domain or area, (engineering and/or research etc) please contact me.
This is quite an exciting time, with the Gemini / GenAI team at Google Deepmind leading the charge at the frontier. This is also the best opportunity to be on the critical path to AGI from the sunny island of Singapore. 🏝️
Many thanks to leadership support from @quocleix@JeffDean@benoitschilling, @EugenieRives and @demishassabis for the support of this team.
Wonderful and fun image generated by Nano Banana 👇
I really enjoyed the day and vibes and I'm sure everyone did as well! Just look at the food and celebratory vibes below 👇.
Special thanks to Divy @divy93t who led the organising of this event (all I did was to "host" 😁).
Shoutout to my SWE friends at Google SG who went above and beyond to help to make the event a success! Especially @LimYiFan and Justin Yip! All the super nice photos in this thread were taken by Caryn Heng. 😁
There's still more stuff I intended to post in this thread but Im kinda tired so Im gonna put a EOS here. 😂
First official Gold medal at IMO from DeepMind🥇 with Gemini Deep Think.
A general purpose text-in text-out model achieving gold medal is something quite unthinkable just about one year ago and here we are! The frontier of AI is incredibly exciting!
Happy to have co-led / captained model training! 😎 A fun fact was that I got roped into this effort thinking it was going to be a fun little side quest, little did I know that it turned out to be such a huge breakthrough.
And yes, the results are certified by the IMO! 😉