Shaden Smith

@shaden_smith

Technical Staff at @MicrosoftAI. Prev. @InflectionAI, @MSFTDeepSpeed, and @Intel. Into horror, herpetology, and high performance computing. he/him

Bellevue, WA

Joined January 2017

690 Following

257 Followers

196 Posts

Pinned Tweet

Shaden Smith @shaden_smith

over 6 years ago

Very excited to share some work from my new group at Microsoft! We open sourced our DeepSpeed library for large-scale deep learning.

Microsoft Research

@MSFTResearch

over 6 years ago

Microsoft researchers and engineers release Zero Redundancy Optimizer (ZeRO) and DeepSpeed library, a system able to train 100-billion-parameter deep learning models. Learn about this breakthrough and how it led to Turing Natural Language Generation: https://t.co/NY75qYd07a

632

195

shaden_smith retweeted

Jonny Kaye @k44yej

3 days ago

If you’re looking to meet @MicrosoftAI at #ICML2026 follow me and I’ll keep you posted on booth slots, we will have a mix of Recruiters like myself and @AlasTriin and some of our ML Researchers ready to discuss our latest frontier models & careers! #hiring

Shaden Smith @shaden_smith

24 days ago

@aaronbatilo Mango <3

shaden_smith retweeted

Paul Soulos @paulsoulos

27 days ago

It was an honor working on MAI-Thinking-1! A lot of thought and effort went into this model, and we want to share some learnings with you. 109 pages worth 😅 https://t.co/4G4BD1Wgs3

paulsoulos's tweet photo. It was an honor working on MAI-Thinking-1! A lot of thought and effort went into this model, and we want to share some learnings with you. 109 pages worth 😅

https://t.co/4G4BD1Wgs3 https://t.co/5WDtWtQIlj

Who to follow

Conglong Li

@conglongli

李葱茏/リーツォンロン Senior Research Scientist @GoogleDeepMind Japan office. Ex @Microsoft @DeepSpeedAI member. @SCSatCMU PhD. Views are my own. English/中文/日本語.

Keita Teranishi

@KeitaTeranishi

Programming Systems Group Leader @ Oak Ridge National Laboratory. Penn State and U Tennessee Alumnus. 岩手県二戸市出身

Jeff Rasley

@jeffra45

@Snowflake AI Research Team. @DeepSpeedAI co-founder, @BrownCSDept PhD, @uwcse alum

shaden_smith retweeted

Mustafa Suleyman

@mustafasuleyman

27 days ago

Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier. First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks. - It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities. - It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks. - And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end. Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing. Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI. - Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost. All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat. Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost. Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare. Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: https://t.co/v65eop5Ixq

mustafasuleyman's tweet photo. Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier.
First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks.
- It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities.
- It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks.
- And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end.

Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing.

Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI.
- Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost.

All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat.

Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost.

Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare.

Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: https://t.co/v65eop5Ixq

192

544

shaden_smith retweeted

Nando de Freitas

@NandoDF

5 months ago

I’d like to hire strong data engineers to join our Microsoft Super Intelligence (MSI) team. I am interested in people who are good at processing PDFs and other documents at billion scale, and people good at parsing the web at trillion scale. If you dream of processing all of human knowledge to advance science and engineering, this is for you. Also looking for strong evaluation and post-training engineers. Be part of our first launches this year 🚀 We have all the resources in the world to support you, working in startup mode, while powering a large organisation with billions of users. Hiring in London, Zurich, New York, Boston, Toronto, Seattle and SF. Please send your CV to [email protected]

522

484

104K

shaden_smith retweeted

Aaron

@aaronbatilo

8 months ago

Y'all watch @karpathy to learn about the whole network but I watch him to learn about F2L

shaden_smith retweeted

Nando de Freitas

@NandoDF

8 months ago

We are hiring star research and data engineers to invent the future of AI. [email protected] If you’re finishing your undergrad or PhD at Imperial, Cambridge, Oxford, UCL, Toronto, MIT, MILA, UBC, ETH, Stanford, Caltech, UCLA, Berkeley, CMU, UW, NYU, Princeton, Columbia, Harvard, Yale or any other top school in STEM, please apply too. I love working with energetic people, who are prepared to work on what is needed to shape AI, make it safe, make it brilliant, make it creative, and make it useful in math, science, healthcare, education, energy and environment.

674

721

223K

shaden_smith retweeted

Mustafa Suleyman

@mustafasuleyman

10 months ago

Excited to share our first @MicrosoftAI in-house models: MAI-Voice-1 and MAI-1-preview. Details and how you can test below, with lots more to come⬇️

mustafasuleyman's tweet photo. Excited to share our first @MicrosoftAI in-house models: MAI-Voice-1 and MAI-1-preview. Details and how you can test below, with lots more to come⬇️ https://t.co/LtL2YmzuTv

958

170

315

348K

shaden_smith retweeted

Charlie Marsh

@charliermarsh

over 1 year ago

uv now ships with dedicated documentation for PyTorch

513

36K

Shaden Smith @shaden_smith

over 1 year ago

@aaronbatilo https://t.co/fDZJwK5d5C

Shaden Smith @shaden_smith

over 1 year ago

@aaronbatilo So much love for Panopticon's Kentucky album that blends bluegrass and black metal. https://t.co/M2ovsiObm2

Shaden Smith @shaden_smith

almost 2 years ago

@aaronbatilo Wanna throw back a couple of cold ones at the Spaghetti Factory

Shaden Smith @shaden_smith

almost 2 years ago

@aaronbatilo I appreciate your unwavering dedication to accurate telemetry.

shaden_smith retweeted

Nando de Freitas

@NandoDF

almost 2 years ago

I’ve joined @Microsoft AI to advance the frontier of large scale multimodal AI research and to build products for people to achieve meaningful goals and dreams. The MAI team is small, but well resourced and ambitious. We are now looking for exceptional ICs, who like to ship. If you you’re interested in multimodal AI, both recognition and generation, love to collaborate and empower others, believe in diversity and inclusion, have a growth mindset, and want to impact the future of AI in a positive and profound way, please message me directly. I believe this is a rare and unique opportunity to join a new AI team that will shape the future. @black_in_ai @WiMLworkshop @_LXAI

NandoDF's tweet photo. I’ve joined @Microsoft AI to advance the frontier of large scale multimodal AI research and to build products for people to achieve meaningful goals and dreams.

The MAI team is small, but well resourced and ambitious. We are now looking for exceptional ICs, who like to ship.

If you you’re interested in multimodal AI, both recognition and generation, love to collaborate and empower others, believe in diversity and inclusion, have a growth mindset, and want to impact the future of AI in a positive and profound way, please message me directly.

I believe this is a rare and unique opportunity to join a new AI team that will shape the future.

@black_in_ai @WiMLworkshop @_LXAI

956

209

109K

shaden_smith retweeted

Aaron

@aaronbatilo

about 2 years ago

Y'all out here judging LLMs on instruction following but have you ever asked a human to follow an onboarding document before?

208

shaden_smith retweeted

Mustafa Suleyman

@mustafasuleyman

about 2 years ago

The UK has phenomenal AI talent and a long established culture of responsible AI development. Today I’m proud to be opening a new office: Microsoft AI London. If you’d like to join us, get in touch. We’re hiring! https://t.co/DmFD3wFwQi

250

466

358K

shaden_smith retweeted

Inflection AI @inflectionAI

over 2 years ago

Happy #PiDay, let's toast to infinite adventures with @pi - now available on 13 platforms. Ready for wherever life’s adventures take you. Cheers to 3.14 and beyond! https://t.co/YAGfm71pUN

153

101K

shaden_smith retweeted

Inflection AI @inflectionAI

over 2 years ago

Pi just got a huge upgrade! It’s now powered by our latest LLM: Inflection-2.5, which is neck and neck with GPT-4 on all benchmarks and used less than half the compute to train. Pi now has world class IQ, combined with its distinctively kind and curious personality. Give it a go at https://t.co/waslkXahQ1 https://t.co/XLDmvwKCcO

953

245

268

685K

shaden_smith retweeted

Stas Bekman

@StasBekman

over 2 years ago

The other news is the introduction of @MSFTDeepSpeed Meetups, which will be conducted once about every 3 months. The inaugural one will be on Feb 12 6:00 PM - 8:00 PM at Redmond Reactor https://t.co/CorCWdDqqo Quote: "This will be the first ever meetup for the DeepSpeed open-source project. We will have an overview of DeepSpeed, latest features and release, and deeper dive talks on particular new/important features and use cases." I think they plan to organize it at other locales as well, such as Seattle and San Francisco. I hope they record those!

Shaden Smith

@shaden_smith

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users