The Physical Turing Test: your house is a complete mess after a Sunday hackathon. On Monday night, you come home to an immaculate living room and a candlelight dinner. And you couldn't tell whether a human or a machine had been there. Deceptively simple, insanely hard.
It is the next North Star of AI. The dream that keeps me awake 12 am at the lab. The vision for the next computing platform that automates chunks of atoms instead of chunks of bits.
Thanks Sequoia for hosting me at AI Ascent! Below is my full talk on the first principles to solve general-purpose robotics: how we think about the data strategy and scaling laws. I assure you it will be 17 minutes you don't regret!
A great illustration of the first-mover effect: nothing happens before the first demo, then once the existential proof is established, everyone else catches up quickly against all odds.
Just a year ago, it was unthinkable that any other model would even remotely approach GPT's lead. Today, Sonnet-3.5 (not even Anthropic's biggest Opus model) is already slightly above and Llama-3-400B is around the corner.
Just 4 months ago, Sora blew everyone's mind and seemed so out of reach. Today, we have at least 4-5 clones of Sora at 70-80% quality, such as Kling, Luma, and Runway. The clones wouldn't have rallied without OpenAI's first move.
Many companies have the technical muscle. But very, very few have good taste in projects and strong will to execute. First movers take tremendous risk, yet their advantages don't stick for too long.
Are medical studies being written with ChatGPT?
Well, we all know ChatGPT overuses the word "delve".
Look below at how often the word 'delve' is used in papers on PubMed (2023 was the first full year of ChatGPT).
I’m excited to announce that today I’m joining @Microsoft as CEO of Microsoft AI. I’ll be leading all consumer AI products and research, including Copilot, Bing and Edge. My friend and longtime collaborator Karén Simonyan will be Chief Scientist, and several of our amazing teammates have chosen to join us.
@InflectionAI will continue on its mission under a new CEO, and look to reach more people than ever by making its API widely available to developers and businesses the world over.
It’s been an amazing journey, with so much more to come. Thank you to everyone for your support. Things really are just getting started.
Today is the beginning of our moonshot to solve embodied AGI in the physical world. I’m so excited to announce Project GR00T, our new initiative to create a general-purpose foundation model for humanoid robot learning.
The GR00T model will enable a robot to understand multimodal instructions, such as language, video, and demonstration, and perform a variety of useful tasks. We are collaborating with many leading humanoid companies around the world, so that GR00T may transfer across embodiments and help the ecosystem thrive.
GR00T is born on NVIDIA’s deep technology stack. We simulate in Isaac Lab (new app on Omniverse Isaac Sim for humanoid learning), train on OSMO (new compute orchestration system to scale up models), and deploy to Jetson Thor (new edge GPU chip designed to power GR00T).
Announced in Jensen's keynote, Project GR00T is a cornerstone for the “Foundation Agent” roadmap of the newly founded GEAR Lab. At GEAR, we are building generally capable agents that learn to act skillfully in many worlds, virtual and real. See if you can spot "GEAR" in the video ;)
Join us on the journey to land on the moon.