Joe Fenton @JoeFenton - Twitter Profile

Pinned Tweet

almost 2 years ago

“All that matters for anyone in life is their family, their health and that’s always the same for everyone” — Mike Lynch https://t.co/qxhJc7E66C

0

1

0

932

Joe Fenton @JoeFenton

1 day ago

https://t.co/L46Y3OduKY

alphaXiv

@askalphaxiv

1 day ago

"MAI-Thinking-1: Building a Hill-Climbing Machine" Microsoft just did something almost no frontier AI lab has done before They shared how they engineered the data behind a frontier-scale model in unusual depth. From data collection and eval decontamination, to data mix scaling, this paper lays out how they managed 30T pretraining tokens plus 3.55T midtraining tokens Surprisingly, they also used no third-party distillation and no open-source training datasets The model itself is not a jaw-dropping release, but the paper might be the best open look yet at a frontier-scale data factory and hill-climbing loop.

askalphaxiv's tweet photo. "MAI-Thinking-1: Building a Hill-Climbing Machine"

Microsoft just did something almost no frontier AI lab has done before

They shared how they engineered the data behind a frontier-scale model in unusual depth.

From data collection and eval decontamination, to data mix scaling, this paper lays out how they managed 30T pretraining tokens plus 3.55T midtraining tokens

Surprisingly, they also used no third-party distillation and no open-source training datasets

The model itself is not a jaw-dropping release, but the paper might be the best open look yet at a frontier-scale data factory and hill-climbing loop.

8

220

35

114

19K

0

30

Joe Fenton @JoeFenton

3 days ago

Positive reaction to the mai-thinking-1 tech report is more than I imagined. Some nice write-ups from the open research community

2

10

0

1

833

Joe Fenton @JoeFenton

3 days ago

https://t.co/Nz1vqzrzEu

Harveen Singh Chadha

@HarveenChadha

3 days ago

MAI-Thinking-1 by Microsoft looks to be approaching sonnet level model, the 109 page tech report is gold they got 29T unique tokens without any synthetic tokens for pretraining which is exact opposite of what they were doing with phi models !! so many counter intuitive decisions but the best part is they talk a lot about data.. this is a must must read

HarveenChadha's tweet photo. MAI-Thinking-1 by Microsoft looks to be approaching sonnet level model, the 109 page tech report is gold

they got 29T unique tokens without any synthetic tokens for pretraining which is exact opposite of what they were doing with phi models !!

so many counter intuitive decisions but the best part is they talk a lot about data.. this is a must must read

7

209

11

87

15K

1

0

113

Who to follow

Mustafa Suleyman

@mustafasuleyman

CEO, @MicrosoftAI | Author: The Coming Wave | Past: Co-founder, @InflectionAI & @GoogleDeepMind

Lilian Weng

@lilianweng

Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

Nat McAleese

@__nmca__

Research @AnthropicAI. Previously @OpenAI, @DeepMind. Views my own.

Joe Fenton @JoeFenton

3 days ago

https://t.co/TvEskMQDlh

wh

@nrehiew_

3 days ago

Super detailed tech report for MAI-Thinking-1, with a ton of info on all stages of the pipeline. I'm surprised so much of this info is released :) Super long thread on my notes:

nrehiew_'s tweet photo. Super detailed tech report for MAI-Thinking-1, with a ton of info on all stages of the pipeline. I'm surprised so much of this info is released :)

Super long thread on my notes: https://t.co/uCtan39KUp

1

158

18

116

20K

0

33

Joe Fenton @JoeFenton

4 days ago

> buy truckloads of good books > remove unspeakable amounts of slop from web data > build a shedload of held-out evals that was my work on mai-thinking-1 the model gets 97% AIME and I can speak for hours about ISBNs read the tech report: https://t.co/XyBJudWQE2

1

6

0

1

284

Joe Fenton @JoeFenton

25 days ago

@levelsio How’d you find Aman Tokyo?

1

0

1

1K

Joe Fenton @JoeFenton

about 2 months ago

@IanOsband Tax slipped my mind somehow

2

1

0

196

Joe Fenton @JoeFenton

about 2 months ago

Anthropic achieves escape velocity - question is, who will be next...

1

3

0

485

Joe Fenton @JoeFenton

4 months ago

@_aidan_clark_ Nothing is more important than getting things done - decline meetings and lose the reports.

1

0

103

Joe Fenton @JoeFenton

5 months ago

Qualitatively observed the same among AI researchers. The most successful are often exceptionally strong in seemingly orthogonal areas. Stay general kids…

JoeFenton's tweet photo. Qualitatively observed the same among AI researchers. The most successful are often exceptionally strong in seemingly orthogonal areas.

Stay general kids… https://t.co/z6LT2fUQKC

0

2

0

332

Joe Fenton @JoeFenton

5 months ago

Wondering if OpenAI falls into this category…

Joe Fenton @JoeFenton

about 2 years ago

"Invest in companies that would be happy to see a 100x improvement in foundation models" -- paraphrasing Sam Altman

0

2

0

733

0

322

Joe Fenton @JoeFenton

5 months ago

This is going to be insanely popular

Claude

@claudeai

5 months ago

Introducing Cowork: Claude Code for the rest of your work. Cowork lets you complete non-technical tasks much like how developers use Claude Code.

3K

87K

8K

58K

50M

0

2

0

374

Joe Fenton @JoeFenton

5 months ago

Build Jarvis with Claude Code, 100 lines of python and iMessage: * Script watches Messages DB for texts from your number * Forwards to Claude API with tools (shell, chrome, email) * Claude executes + replies via iMessage * Run as Launch Agent on always-on Mac

0

2

0

266

Joe Fenton @JoeFenton

5 months ago

2025 is peak “let’s ship a wrapped feature”

0

158

Joe Fenton

@JoeFenton

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users