Microsoft researchers and engineers release Zero Redundancy Optimizer (ZeRO) and DeepSpeed library, a system able to train 100-billion-parameter deep learning models. Learn about this breakthrough and how it led to Turing Natural Language Generation: https://t.co/NY75qYd07a
If you’re looking to meet @MicrosoftAI at #ICML2026 follow me and I’ll keep you posted on booth slots, we will have a mix of Recruiters like myself and @AlasTriin and some of our ML Researchers ready to discuss our latest frontier models & careers! #hiring
It was an honor working on MAI-Thinking-1! A lot of thought and effort went into this model, and we want to share some learnings with you. 109 pages worth 😅
https://t.co/4G4BD1Wgs3
Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier.
First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks.
- It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities.
- It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks.
- And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end.
Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing.
Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI.
- Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost.
All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat.
Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost.
Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare.
Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: https://t.co/v65eop5Ixq
I’d like to hire strong data engineers to join our Microsoft Super Intelligence (MSI) team.
I am interested in people who are good at processing PDFs and other documents at billion scale, and people good at parsing the web at trillion scale.
If you dream of processing all of human knowledge to advance science and engineering, this is for you.
Also looking for strong evaluation and post-training engineers.
Be part of our first launches this year 🚀
We have all the resources in the world to support you, working in startup mode, while powering a large organisation with billions of users.
Hiring in London, Zurich, New York, Boston, Toronto, Seattle and SF.
Please send your CV to [email protected]
We are hiring star research and data engineers to invent the future of AI. [email protected]
If you’re finishing your undergrad or PhD at Imperial, Cambridge, Oxford, UCL, Toronto, MIT, MILA, UBC, ETH, Stanford, Caltech, UCLA, Berkeley, CMU, UW, NYU, Princeton, Columbia, Harvard, Yale or any other top school in STEM, please apply too. I love working with energetic people, who are prepared to work on what is needed to shape AI, make it safe, make it brilliant, make it creative, and make it useful in math, science, healthcare, education, energy and environment.
I’ve joined @Microsoft AI to advance the frontier of large scale multimodal AI research and to build products for people to achieve meaningful goals and dreams.
The MAI team is small, but well resourced and ambitious. We are now looking for exceptional ICs, who like to ship.
If you you’re interested in multimodal AI, both recognition and generation, love to collaborate and empower others, believe in diversity and inclusion, have a growth mindset, and want to impact the future of AI in a positive and profound way, please message me directly.
I believe this is a rare and unique opportunity to join a new AI team that will shape the future.
@black_in_ai@WiMLworkshop@_LXAI
The UK has phenomenal AI talent and a long established culture of responsible AI development.
Today I’m proud to be opening a new office: Microsoft AI London. If you’d like to join us, get in touch. We’re hiring!
https://t.co/DmFD3wFwQi
Happy #PiDay, let's toast to infinite adventures with @pi - now available on 13 platforms.
Ready for wherever life’s adventures take you. Cheers to 3.14 and beyond!
https://t.co/YAGfm71pUN
Pi just got a huge upgrade! It’s now powered by our latest LLM: Inflection-2.5, which is neck and neck with GPT-4 on all benchmarks and used less than half the compute to train.
Pi now has world class IQ, combined with its distinctively kind and curious personality.
Give it a go at https://t.co/waslkXahQ1
https://t.co/XLDmvwKCcO
The other news is the introduction of @MSFTDeepSpeed Meetups, which will be conducted once about every 3 months.
The inaugural one will be on Feb 12 6:00 PM - 8:00 PM at Redmond Reactor
https://t.co/CorCWdDqqo
Quote: "This will be the first ever meetup for the DeepSpeed open-source project. We will have an overview of DeepSpeed, latest features and release, and deeper dive talks on particular new/important features and use cases."
I think they plan to organize it at other locales as well, such as Seattle and San Francisco.
I hope they record those!