Yesterday was my last day at @LumaLabsAI.
Over the last three years, I had the privilege of helping drive the company's transition from 3D AI to video generation and native multimodal foundation models.
I am grateful to have worked alongside an extraordinary group of researchers, and I look forward to seeing the next chapter of the company's story unfold.
Today we’re introducing Meta AI Voice Conversations powered by Muse Spark that let you talk naturally to Meta AI (interrupt, switch topics, or swap languages), and as you talk, Meta AI can generate images and pull up recommendations from Reels, maps, and more. We’re also bringing live AI to the app, so you can point your camera at the world and ask about what you’re seeing in real time.
https://t.co/A5JJOD1lyc
People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way.
We share our approach, early results, and a quick look at our model in action.
https://t.co/AFJZ5kH7Ku
Codex grew programmatic policies with no neural nets: max score on Breakout, and SOTA-level scores on MuJoCo.
Maybe heuristics were not too weak. Maybe they were just too expensive to maintain. Maybe it's the next paradigm.
https://t.co/1ZaIneleuW
We are back. After one year of quiet building.
Introducing GENE-26.5, our first robotic brain that takes a major step toward human-level capability.
For years, robotics has struggled to learn from the world’s largest and valuable data source: Humans.
Solving it means rethinking the whole stack from the ground up:
- A robotics-native foundation model.
- A 1:1 human-like robotic hand.
- A noninvasive data collection glove for motion, force, and touch.
- A simulator that turns weeks of experiments into minutes.
GENE-26.5 is trained across language, vision, proprioception, tactile, and action. We designed a set of tasks to test how far we can go with this new paradigm.
Fully autonomous, 1x speed, one model, same weights. (Enjoy with sound on)
We are approaching the endgame for robotics.
And this is just a beginning.
I’ve been reflecting a lot on this journey, and I feel incredibly grateful.
To the Eigen AI team: thank you for choosing to build something hard together, and for bringing so much ambition, intensity, and care every day. I’m truly proud of what we have built, and even more grateful that we get to continue this next chapter together.
To our customers and developers: thank you for trusting us early, bringing us meaningful problems to solve, and shaping Eigen through your feedback, partnership, and belief in what we were building.
To our advisors, investors, and supporters: thank you for believing in us early, guiding us along the way, challenging us to think bigger, and standing with us through every stage of the journey.
And to the Nebius team: thank you for the conviction, trust, and partnership throughout this process. We’re excited to join forces, keep building with the same ambition, and take this next chapter even further.
This milestone would not have been possible without all of you. I’m deeply thankful, proud of what we have built together, and excited for what comes next! ❤️
🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.
🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.
Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!
📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM
1/n
Our goal is to build practical models with comprehensive capabilities beyond open benchmarks. And the only way to do it to co-design with diverse products while scaling solidly.
Tencent has the best product ecosystem and a solid, low-ego culture, and we are just getting started!
This is what I’ve been cooking in the past 4 months . GPT Image 2 is over a massive 240 elo jump over the second place model, marking the biggest jump bigger than the rest of the leaderboard combined
🚀 Muse Spark Safety & Preparedness Report for Meta AI is out.
We start with our pre-deployment assessment under Meta's Advanced AI Scaling Framework, covering chemical and biological, cybersecurity, and loss of control risks. Our assessment flagged potentially elevated chem/bio risk, so we implemented safeguards and validated mitigations before deployment - bringing residual risk to within acceptable levels.
Beyond the Framework, we also share findings and early explorations of model behavior (honesty, intent understanding, etc.), jailbreak robustness, eval awareness, and more.
We're sharing this report to give a closer look at how we evaluate advanced AI safety. Always more work to do, and we welcome feedback from the community.
https://t.co/azpKHwu7x9
Muse spark 🥑 is here!
9 months ago we started from scratch. Today we’ve rebuilt the entire stack and shipped a well-rounded, multimodal, and agent-native model.
This is just the beginning of a new era toward personal superintelligence. We’re early, but the momentum is real.
Give it a try at https://t.co/JYOXmerYWh — would love your feedback!
Introducing Muse Spark, the first in the Muse family of models developed by Meta Superintelligence Labs.
Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration.
Muse Spark is available today at https://t.co/wHkMPH82ZH and the Meta AI app. We’re also making it available in private preview via API to select partners, and we hope to open-source future versions of the model.
Learn more: https://t.co/PloE9q5x96
Check out Muse Spark, our first milestone in the quest for personal superintelligence! Scaling this with the team has been a total blast. Give it a spin and let us know what you think! 🥑
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new infrastructure, new architecture, new data pipelines. muse spark is the result of that work, and now it powers meta ai. 🧵
Breaking: @AIatMeta just released Muse Spark — now live across @ScaleAILabs leaderboards.
Here’s how it stacks up:
Tied for 🥇on SWE-Bench Pro
Tied for 🥇on HLE
Tied for 🥇on MCP Atlas
Tied for 🥇on PR Bench - Legal
Tied for 🥈on SWE Atlas Test Writing
🥈on PR Bench - Finance
🥉on SWE Atlas QnA