This is THE moment of Physical AI!
We are officially announcing Cosmos 3: Omnimodal World Models for Physical AI 🚀
- Cosmos 3 is an omnimodal world model: within a unified architecture, it can understand and generate language, images, video, audio, and actions.
- It is not just a VLM, not just a video generator, not just an audio-visual generative model, and not just a physics simulator / world-action model. It can understand images and videos, generate images, videos, and audio, simulate future worlds, predict actions, and generate robot policies—enabling models to truly begin to “touch the world.”
- Cosmos 3 is the #1 open-weight reasoner / T2I / I2V / robot policy across many benchmarks.
Huge thanks to every teammate who fought side by side on this journey—from architecture, data, training, infra, serving, and evaluation to post-training. Every part of this project carries an incredible amount of hard work. This was my first time leading a project as Tech Lead, and I feel truly fortunate.
The future of Physical AI needs models that can not only “see” and “describe” the world, but also “imagine,” “simulate,” and “act”—and eventually close the loop with the real world. I hope Cosmos 3 can become an important starting point for this direction, and I’m excited to push Physical AI into its next stage together with the open-source community.
Welcome to the era of Physical AI.
HuggingFace: https://t.co/QW5h5pIWWM
Project Website: https://t.co/Jppa0gkn16
Code: https://t.co/aJgaLm5BaG
In China you can now call the cleaning service with a humanoid robot to your house in only 20 usd per 3 hours.
I ordered one by got cancelled right before the session, because the humanoid asked for a leave due to a system upgrade. 😂
OpenAI Robotics is hiring, looking for exceptional full-stack hardware, ops, systems, and ML engineers to help us program and manufacture robots that are useful for society.
AI should be able to help people in the physical world. In the short term, we are focused on robots to support skilled workers to build our future infrastructure; in the long term, we imagine everyone having a personal robot doing anything they need.
Our world simulation research program, led by Aditya Ramesh (@model_mechanic), has evolved over the past year into OpenAI Robotics. Progress is rapid, and based on a foundation of co-design between robotics hardware and ML research.
If you love working hands-on across the robotics stack and want to build the future, please consider joining us. Send an email with your background and evidence of exceptional accomplishment to: [email protected]
"Comment “shift” and we’ll send you an early access link."
This single CTA is powerful to let your launch video stand out. Comments take significant weights in the algo.
We should be aware of this as well, in addition to all the discussion about data & deployment & model training by robotics people.
Today, we're launching shift. We're starting by cleaning your apartment in New York City, for free.
Here's how it works. Book a shift cleaning. A vetted shift operator comes to your home wearing one of our devices. They clean. They leave. You pay nothing.
In exchange, we record the cleaning. Robotics is being built on data about how people do daily tasks, and the value of that recording is what funds the service. Anything personal in it is anonymized before the recording is processed.
By now, you have heard about the shift to AI more times than you can count. About the shift toward you, the part where you actually feel it, you have heard almost nothing. Shift is what starts to make it concrete, in specific cities, with specific services.
Today, cleaning in New York. Soon, handymen, repairs, and errands across the globe. And this is just one side of shift, with more on the way.
Comment “shift” and we’ll send you an early access link.
Breaking: China government launches the first Humanoid Robot ID System
Now humanoids robot in China have ID card as well. Mainly to address challenges amid large-scale deployment.
For example:
>> inconsistent enterprise coding rules blocking cross-company identity recognition;
>> heightened public security risks as robots enter work, daily life and public spaces;
>> ambiguous accountability across the industrial chain leading to buck-passing and untraceable incidents.
The 29-bit ID code, split into four segments, underpins the full lifecycle traceability system:
1. 2-digit country code: Indicates origin for cross-border tracking and exports.
2. 4-digit enterprise code: Uniquely identifies manufacturers for clear accountability.
3. 6-digit model code: Records product specs, categories and production dates.
4. 17-digit serial number: Distinguishes individual units for full lifecycle traceability from production to recycling.
We underestimated how much AI spending is flowing into server vendors.
Dell reported a huge earnings beat and the stock exploded higher.
Key numbers:
>> Revenue: $43.8B
>> AI server orders: +757%
>> AI-optimized server backlog surged
>> Full-year guidance raised sharply
The stock jumped about 30%+ after earnings, making it one of the biggest large-cap movers today.
Yesterday, we presented at the @KraneShares SF Innovation Forum.
KraneShares’ Global Humanoid Robotics and Physical AI Index ETF, $KOID, is one way retail and institutional investors can gain exposure to robotics and physical AI. OpenMind Founder @JanLiphardt and AI expert @chipro shared their perspectives on the industry tailwinds driving this market forward.
We’re excited to work closely with KraneShares to help bring robot adoption to the masses. Stay tuned for more soon.
We've raised $65 billion in Series H funding at a $965 billion post-money valuation, led by @AltimeterCap, Dragoneer, @Greenoaks, and @sequoia.
This investment will help us advance our research and expand our capacity to meet growing demand for Claude.