Nadav Outmezguine @NadavOut - Twitter Profile

NadavOut retweeted

stochasm

@stochasticchasm

about 1 month ago

okay live reading/reacting thread here, there's a ton to go through

13

553

36

476

77K

NadavOut retweeted

Adam Sadovsky @asadovsky

about 1 month ago

Today we announced MAI-Thinking-1, a strong generalist and reasoning LLM built from the ground up without distilling third-party models. 97% on AIME 2025; 53% on SWE-Bench Pro; preferred by human raters over Sonnet 4.6 (blind side-by-side). Tech report: https://t.co/qxGQWX5cOt

14

266

19

70

21K

NadavOut retweeted

Hanxiao Liu @Hanxiao_6

about 1 month ago

Happy to share our recent work!

4

71

9

6

18K

NadavOut retweeted

Hanna Hajishirzi

@HannaHajishirzi

about 1 month ago

MAI-Thinking-1 is out! Excited to share what we are building and how climbing from scratch (no distillation) actually works: simple recipes, rigorous science, self-distillation, patience, and great infra. Check out our tech report has the full story of our RL climbs. https://t.co/aLW40sWz4d

HannaHajishirzi's tweet photo. MAI-Thinking-1 is out!

Excited to share what we are building and how climbing from scratch (no distillation) actually works: simple recipes, rigorous science, self-distillation, patience, and great infra.

Check out our tech report has the full story of our RL climbs.
https://t.co/aLW40sWz4d

25

875

128

382

132K

Who to follow

Patrick Meade

@theory_dad

Theoretical Physicist and Dad. Professor at Yang Institute for Theoretical Physics, Stony Brook Univ. Views are my own.

Gordan Krnjaic

@GordanKrnjaic

Theoretical physicist @ Fermilab & U. Chicago

the finite physicist

@FinitePhysicist

theoretical particle physics & phenomenology || don't sleep through dreams that can come true

NadavOut retweeted

Mustafa Suleyman

@mustafasuleyman

about 1 month ago

Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier. First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks. - It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities. - It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks. - And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end. Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing. Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI. - Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost. All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat. Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost. Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare. Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: https://t.co/v65eop5Ixq

mustafasuleyman's tweet photo. Super excited to announce seven new world-class MAI models today. They represent what we consider a new era in AI designed to keep you in control and on the frontier.
First is our text foundation model, MAI-Thinking-1, exceptionally strong on reasoning and SWE tasks.
- It’s a 35B active parameter MoE with a 256K context window. Independent human raters on Surge prefer it for overall quality in blind side-by-sides versus Sonnet 4.6, and it’s achieved 97% on AIME 2025, the key measure of its general-purpose reasoning abilities.
- It's at 53% on SWE Bench Pro, placing it right alongside Opus 4.6 on one of the toughest coding benchmarks.
- And since we co-designed our models with our own silicon, MAI-Thinking-1 is optimized on our MAIA 200 chip. Benchmarking head-to-head against the GB200, we see 30% better performance per dollar as well as a 1.4x performance-per-watt gain when running our MAI models on the MAIA 200 end-to-end.

Next is MAI-Image-2.5 and its Flash variant. Two super strong models now at #2 on the leaderboards, surpassing the score of Nano Banana 2 on image editing.

Last for now is MAI-Code-1-Flash, our new inference efficient coding model, especially tuned for VS Code and GitHub Copilot CLI.
- Code-1-Flash achieves 51% on SWE Bench Pro, despite having just 5B parameters, putting it closer to Haiku in size but cheaper in cost.

All of this is the foundation for Microsoft Frontier Tuning. It lets you customize our models to create custom, company-specific agents that only you control. You can make our model, your model. Your data. Your agents. Your moat.

Early adopters are already seeing a difference. When we tuned our models for McKinsey’s tasks, MAI delivered the highest win rate, outperforming GPT-5.5 on quality, while being 10x lower on cost.

Also really excited to be collaborating with the amazing team at Mayo Clinic to jointly train a new frontier AI model for healthcare.

Our announcements today mark another milestone on the road to humanist superintelligence. You can learn more and about our other new models in our latest blog: https://t.co/v65eop5Ixq

192

4K

542

1K

1M

NadavOut retweeted

Anastasios Nikolas Angelopoulos

@ml_angelopoulos

about 1 month ago

Are you guys clocking how big this is

5

76

2

20

19K

Nadav Outmezguine @NadavOut

6 months ago

@GutmanYotam @notevildvir רק עכשיו הבנתי שזה טרנטינו ולא דני רופ

0

1

0

42

Nadav Outmezguine @NadavOut

7 months ago

@Kralizec18 @gilgameshinnyc @StopAntisemites @UCBerkeley Yes. 100%

0

16

Nadav Outmezguine @NadavOut

10 months ago

@FeedTechILUncen לא

0

142

NadavOut retweeted

Peyman Milanfar

@docmilanfar

about 1 year ago

Amen

0

73

1

2

8K

Nadav Outmezguine @NadavOut

over 1 year ago

@four_form Oh man…

0

42

NadavOut retweeted

Gideon Sa'ar | גדעון סער

@gidonsaar

over 3 years ago

החלטת נתניהו לפטר את שר הבטחון יואב גלנט היא מעשה של טירוף, המעידה על העדר מוחלט של שיקול-דעת. אין תקדים בתולדות ישראל לשר בטחון שפוטר בגלל שהתריע, כמתחייב מתפקידו, על סכנה ביטחונית. נתניהו נחוש לדרדר את ישראל לתהום. כל יום בו מכהן נתניהו בתפקידו מסכן את ישראל ועתידה.

1K

8K

2K

85

573K

Nadav Outmezguine @NadavOut

over 1 year ago

@cajohare You deserve a medal 🎖️

0

202

NadavOut retweeted

🇱🇧🇻🇦𐤍𐤏𐤅𐤓 @nonabeleerf

over 1 year ago

We will have peace soon enough. Despite Iran's scums deviding us.

245

5K

427

123

148K

Nadav Outmezguine @NadavOut

over 1 year ago

@DrDM777 @mc_limor מאד רוצה לראות את האינטגרל! :)

1

0

37

Nadav Outmezguine @NadavOut

over 1 year ago

@BenBetzalel מעניין לחזור אל התרגיל הזה עם תמונה פחות מוכרת

0

1

0

36

NadavOut retweeted

Sasha Rush

@srush_nlp

over 1 year ago

Updating all my NeurIPS papers.

37

2K

163

121

250K

Nadav Outmezguine @NadavOut

over 1 year ago

@ChristiStoica @Anthony_Bonato Wanted to say the same! The thing they call mathematicians is actually physicists

0

2

0

173

NadavOut retweeted

Eran @GrabinerEran

over 1 year ago

אני לא יודע אם הסעודים מתואמים איתנו, מאמין שלא. אבל אכן קרו בשנים האחרונות שינויים שעשויים לגרום לסעודיה ואפילו לארה״ב לשמוח מפגיעה בתעשיית הנפט האיראנית. סעודיה מבזבזת הרבה מדי כסף וארה״ב הפכה להיות יצרנית הנפט הגדולה בעולם ויצואנית של נפט:

1

2

1

324

Nadav Outmezguine

@NadavOut

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users