Gemini 3 is also available in Google Search AI Mode. Here is an example:
"Create several interactive mathematical fractals. Don't just list them; let me click on them to zoom in infinitely ".
Seeing concepts come to life right in search using code written by Gemini 3 is just beautiful.
I mostly agree, but there is something special about multimodality. I also think it won't be chat, but I believe it will be more than just Graphical (the G in GUI). So my bet is for MUI: Multimodal User Interfaces that harness the good old graphics and add voice and gestures.
"Chatting" with LLM feels like using an 80s computer terminal. The GUI hasn't been invented, yet but imo some properties of it can start to be predicted.
1 it will be visual (like GUIs of the past) because vision (pictures, charts, animations, not so much reading) is the 10-lane highway into brain. It's the highest input information bandwidth and ~1/3 of brain compute is dedicated to it.
2 it will be generative an input-conditional, i.e. the GUI is generated on-demand, specifically for your prompt, and everything is present and reconfigured with the immediate purpose in mind.
3 a little bit more of an open question - the degree of procedural. On one end of the axis you can imagine one big diffusion model dreaming up the entire output canvas. On the other, a page filled with (procedural) React components or so (think: images, charts, animations, diagrams, ...). I'd guess a mix, with the latter as the primary skeleton.
But I'm placing my bets now that some fluid, magical, ephemeral, interactive 2D canvas (GUI) written from scratch and just for you is the limit as capability goes to \infty. And I think it has already slowly started (e.g. think: code blocks / highlighting, latex blocks, markdown e.g. bold, italic, lists, tables, even emoji, and maybe more ambitiously the Artifacts tab, with Mermaid charts or fuller apps), though it's all kind of very early and primitive.
Shoutout to Iron Man in particular (and to some extent Start Trek / Minority Report) as popular science AI/UI portrayals barking up this tree.
Today we are announcing ARC-AGI-2, an unsaturated frontier AGI benchmark that challenges AI reasoning systems (same relative ease for humans).
Grand Prize: 85%, ~$0.42/task efficiency
Current Performance:
* Base LLMs: 0%
* Reasoning Systems: <4%
Looking to hire a student researcher to work on cool project for 6 months in DeepMind Montreal.
Reqs:
- Full-time masters/PhD student 🧑🏾🎓
- Substantial expertise in multi-agent RL, ideally including publication(s) 🤖🤖
- Strong Python coding skills 🐍
This you? Get in touch!
🎧Now live! The Moonshot Podcast! 🎧
In our inaugural episode, we revisit the origin stories of self-driving cars and drone deliveries—technologies experts thought were impossible 15 years ago.
Listen here: https://t.co/B1VBtPi58J
Unlock the secrets to enhancing your game's performance at our #GDC2025 sessions → https://t.co/4UrUsVq0x2
Learn about:
- Practical uses of Gemini and Gemma in game development
- Creating AI-driven mechanics
- Exploring the future of gaming with Google DeepMind
and more...
Introducing our AI co-scientist, a multi-agent AI system built with Gemini 2.0.
We think of it as a virtual collaborator for scientists, using advanced reasoning to synthesize a huge amount of literature, generate novel hypotheses, and suggest detailed research plans. We’re seeing promising early results in important research areas like liver fibrosis treatments, antimicrobial resistance, and drug repurposing. As a next step, we’re opening up a trusted tester program for scientists around the world.
Love Terence Tao's description of a genius step: "The first time you climb one step of the ladder is always heroic because you are at the edge of what the data and the math and the technology can give you."
Cosmic distance ladder https://t.co/kor9bVdx9D Can't wait for part II!
I just put up a new video, which was a collaboration with Terence Tao about the cosmic distance ladder. You can find the full video on YouTube, and here's a bit of extra footage that didn't make it into the final.
This is exactly what my co-PI and friend @jonc101x has written about so eloquently in @NEJM_AI in Who's Training Who -- https://t.co/tVBUA61iYb.
The power of LLMs (right now) in medicine is likely improving HUMAN-HUMAN interactions.
Knowledge compression is the single differentiator that has allowed humanity to make scientific & technological progress in ways lacking everywhere else. That in itself makes the effort novel and impactful. Grateful for all who pursue it despite unwarranted skepticism.
Distillation has been on the news (!) due to @deepseek_ai. The paper https://t.co/fRbFdfoHT1 was actually rejected from NeurIPS 2014 due to lack of novelty 🧐 (true-ish), and lack of impact 🙃.
Thanks reviewer#2 (literally), and thanks for @arxiv!
@geoffreyhinton@JeffDean
Engineers, creatives, designers - lend me your ears, and eyes, to check my🥇1st tech demo of 2025: A #WebAI Agent running entirely client side in the browser, that's capable of controlling a flight search webpage, to get the job done! I'm using @Google's Gemma 2 (2B) model in #JavaScript via #WebGPU thanks to the MediaPipe Web LLM library, combined with some extra function calling logic to enable advanced user experiences.
If you like what you see, give this post a share, as I'm just one guy trying to show the world the power of Web AI to do great things. Try it for yourself using the link below after watching the video, but do so on a modernish machine (I have verified it runs on my 6 year old Dell XPS with only an Intel integrated GPU) but use a modern browser like @googlechrome as it needs WebGPU to run.
Want me to make a demo for your industry? Tell me what you need / want to see next! Let me know what you think, and if you have any questions, then drop them in the comments below!
YouTube Video Link: https://t.co/AsmejlmRLE
Such a privilege to contribute a tiny bit to these efforts! Grateful for the 20% opportunity I had in 2023 to identify powerful use cases for multimodal search AI experience on glasses and build a dataset of books to showcase the potential crystalized in Project Astra in 2024.
It’s been an amazing last couple of weeks, hope you enjoyed our end of year extravaganza as much as we did!
Just some of the things we shipped: state-of-the-art image, video, and interactive world models (Imagen 3, Veo 2 & Genie 2); Gemini 2.0 Flash (a highly performant and efficient foundation model); Gemini-Exp-1206 model (top of the Chatbot Arena leaderboard); 2.0 Flash Thinking (our first ‘thinking’ model, expect a lot more news on this soon - as many of you remember, we pioneered this type of model with AlphaGo, AlphaZero, AlphaProof…); upgrades to @GeminiApp with Deep Research and more, fantastic new NotebookLM features, a new image remixing tool (Whisk); and a series of agentic research prototypes that can help people get things done (Project Astra, Mariner, Jules) all built on Gemini 2.0.
And then on top of all of that, we dropped the world’s most accurate weather prediction model (GenCast), the world’s most advanced quantum chip that performed a computation that would have taken 10 septillion years (Willow - could come in handy for training AI one day!), and it was the honour of a lifetime to receive the Nobel Prize for AlphaFold, which has revolutionised structural biology and is being used by 2 million researchers around the world to understand disease and accelerate drug discovery.
I could not be more proud of all the exceptionally talented teams at @GoogleDeepMind & @Google who have worked so incredibly hard on all these amazing projects. It’s the greatest joy one can have professionally to get to explore the outer reaches of science and human knowledge with such wonderful colleagues.
We’ve been inventing the future of AI for well over a decade now, and we’ll continue to be a relentless engine of innovation. In many ways we’re only just getting started… if you want to be at the frontier of the most exciting scientific and technological journey ever, this is the place to be, come join us!
Topology is about understanding continuous associations between things -crucial for how nature is structured.
If there's something I've enjoyed in 2024 is thinking about the implications of this for how we structure knowledge.
All easier to digest thanks to Grant @3blue1brown
New* video! If you’ve ever wondered what topology is, this problem is one of the best examples I know of to give an authentic sense of what it’s all about: https://t.co/YKZXcjoJ8l
World leaders and experts are gathered in Cali for @UNBiodiversity#COP16Colombia, to negotiate and agree on a path forward to safeguarding the planet and make #PeaceWithNature.
See what’s at stake at this summit: https://t.co/luXx5L0Ko0