Shayne Longpre @shayneRedford - Twitter Profile

Pinned Tweet

7 months ago

Who is winning the open AI race? Our new study "Economies of Open Intelligence" maps 2.2B @huggingface downloads across 851k models (2020→2025). 1) Power is rebalancing (US big tech ↓; China + community ↑) 2) Models got big & efficient (MoE, quant, multimodal surge) 3) Intermediaries now matter (adapters/quantizers steer usage) 4) Transparency is slipping /🧵

ShayneRedford's tweet photo. Who is winning the open AI race?

Our new study "Economies of Open Intelligence" maps 2.2B @huggingface downloads across 851k models (2020→2025).

1) Power is rebalancing (US big tech ↓; China + community ↑)
2) Models got big & efficient (MoE, quant, multimodal surge)
3) Intermediaries now matter (adapters/quantizers steer usage)
4) Transparency is slipping

/🧵

9

91

26

52

30K

ShayneRedford retweeted

jessica dai @jessicadai_

9 days ago

we analyzed >100k posts from r/ChatGPT over 3 years on one hand, we saw ChatGPT quickly become normalized as an everyday consumer product, which is pretty cool on the other hand…

jessicadai_'s tweet photo. we analyzed >100k posts from r/ChatGPT over 3 years

on one hand, we saw ChatGPT quickly become normalized as an everyday consumer product, which is pretty cool

on the other hand… https://t.co/vHAzphhtvw

5

467

60

314

111K

ShayneRedford retweeted

Tim Schnabel

@TimSchnabel

13 days ago

On Mythos, from @MarkWarner in this morning's Senate Banking hearing: "the head of the NSA and Cyber Command came and said this tool broke into almost all of our classified systems, not in weeks, but in hours"; I had not seen that mentioned elsewhere?

TimSchnabel's tweet photo. On Mythos, from @MarkWarner in this morning's Senate Banking hearing: "the head of the NSA and Cyber Command came and said this tool broke into almost all of our classified systems, not in weeks, but in hours"; I had not seen that mentioned elsewhere? https://t.co/RZgYRluSGz

11

132

24

73

117K

ShayneRedford retweeted

Thoughtful @thoughtfullab

13 days ago

Fable 5 is doing something wild on our FrogsGame post-training task. It trains a weaker model to solve the puzzle, peaks at 68%, and produces the only ~10x improvement we see across the benchmark. It spent 17 hours, 25M tokens without human in sight. 34% pass@1, while every other frontier model averages under 4%. We will publish a more detailed analysis soon.

thoughtfullab's tweet photo. Fable 5 is doing something wild on our FrogsGame post-training task.

It trains a weaker model to solve the puzzle, peaks at 68%, and produces the only ~10x improvement we see across the benchmark.

It spent 17 hours, 25M tokens without human in sight. 34% pass@1, while every other frontier model averages under 4%.

We will publish a more detailed analysis soon.

19

1K

62

656

487K

Who to follow

Susan Zhang

@suchenzang

@ Google Deepmind. Past: @MetaAI, @OpenAI, @unitygames, @losalamosnatlab, @Princeton etc. Always hungry for intelligence. Only my opinions stored here.

Yi Tay

@YiTayML

research scientist @googledeepmind ✨♊, model co-lead/captain of gemini deepthink imo gold medal 🥇, opinions are my own.

Jonathan Frankle

@jefrankle

Chief AI Scientist @databricks via MosaicML. e/brick

Shayne Longpre

@ShayneRedford

30 days ago

Defended my PhD last week! 🎉🎉 Thank you to @alex_pentland @sarahookr @PeterHndrsn and everyone who was a part of it! Recording here: https://t.co/UEFfUdnWtU

43

228

12

64

53K

ShayneRedford retweeted

Tianmin Shu

@tianminshu

about 1 month ago

Users’ interactions with LLMs are driven by their latent thoughts 🧠, such as how they react to LLM responses and why they send follow-up requests. In our recent work with @GoogleResearch, we collected ThoughtTrace, a large-scale dataset of user thoughts during multi-turn human-AI interactions. We found that these self-reported thoughts (1) reveal useful information for user behavior modeling and (2) provide learning signals for model alignment that are otherwise unavailable from just the conversations. Check out the following thread from @chuanyang_jin👇

2

38

4

27

7K

ShayneRedford retweeted

Cathy Fang

@thecatfangs

about 1 month ago

Ever feel like AI models misinterpret your prompts? Currently, models struggle to capture a user's hidden intent. Models have a "thinking trace”…but what about the user's thinking trace? The ThoughtTrace dataset is first to capture real-time user reactions and reasoning. By looking beyond just raw utterances, we can drastically improve user modeling and intent alignment. Check it out!

2

11

2

5

2K

Shayne Longpre

@ShayneRedford

about 1 month ago

Check out this incredible new work led by @chuanyang_jin!

Chuanyang Jin

@chuanyang_jin

about 1 month ago

What are users thinking during their interactions with LLMs? We introduce ThoughtTrace — the first large-scale dataset that captures what users think during real-world human–AI conversations, not just what they type. → 10,174 thought annotations → 2,155 multi-turn conversations, 17,058 turns → 1,058 users → 20 LLMs These thoughts improve user behavior prediction (+41.7%) and model alignment (+25.6%). This opens a new paradigm of user-centric LLM research. Full information in the thread 🧶 Read our paper: https://t.co/lRYJvGJ7bb Check our project website: https://t.co/AupCn1YQOk

10

136

35

85

69K

1

5

1

2

1K

ShayneRedford retweeted

Chuanyang Jin

@chuanyang_jin

about 1 month ago

What are users thinking during their interactions with LLMs? We introduce ThoughtTrace — the first large-scale dataset that captures what users think during real-world human–AI conversations, not just what they type. → 10,174 thought annotations → 2,155 multi-turn conversations, 17,058 turns → 1,058 users → 20 LLMs These thoughts improve user behavior prediction (+41.7%) and model alignment (+25.6%). This opens a new paradigm of user-centric LLM research. Full information in the thread 🧶 Read our paper: https://t.co/lRYJvGJ7bb Check our project website: https://t.co/AupCn1YQOk

10

136

35

85

69K

Shayne Longpre

@ShayneRedford

about 1 month ago

Excited to see @yyyiiillluuu release this super cool new plugin! Keep an eye on all his work. Self improving agents are coming fast

Yi Lu

@yyyiiillluuu

about 1 month ago

Claude Code can now self-improve with this plugin. Introducing claude-smart — an open-source plugin that helps Claude Code learn from every session. Memory helps Claude Code remember what happened. claude-smart helps Claude Code improve what it does next. Example: Claude Code runs `npm test` without `--run`, and the command hangs in your repo. Memory stores: “npm test kept hanging.” claude-smart learns: “When running tests in this repo, use `npm test -- --run` because default watch mode hangs.” claude-smart’s learnings are reusable and actionable, even across different projects. It can also reduce unnecessary planning iterations and token use by 70%+ on similar future tasks. Runs locally. 100% open source. No data is shared. Install: npx claude-smart install With Codex: npx claude-smart install --host codex GitHub: https://t.co/D4F0v5FnGk

128

613

145

897

614K

0

5

0

2

1K

ShayneRedford retweeted

Enrico Shippole @EnricoShippole

2 months ago

We @TeraflopAI have worked together with @johngfriedman and @daftengine to open-sourced all major filings from SEC EDGAR completely for free on @huggingface. It is now more important than ever to push for open dataset releases.

5

62

18

24

32K

Shayne Longpre

@ShayneRedford

2 months ago

📜: https://t.co/lH6TyfZ0Wi Live dashboard: https://t.co/CxcgHaFVnz (courtesy of @emsesc) AI Index 2026 Chpt. 1: https://t.co/sQ0ftimmju

0

4

0

312

Shayne Longpre

@ShayneRedford

2 months ago

Excited to see our Economies of Open Intelligence work highlighted in Chp. 1 of @StanfordHAI's #AIIndex2026! We release tons of info on the open model ecosystem, using 🤗 HF data. Thank you @russellwald and team!

ShayneRedford's tweet photo. Excited to see our Economies of Open Intelligence work highlighted in Chp. 1 of @StanfordHAI's #AIIndex2026!

We release tons of info on the open model ecosystem, using 🤗 HF data.

Thank you @russellwald and team!

1

12

3

1

659

ShayneRedford retweeted

Yong Zheng-Xin

@yong_zhengxin

3 months ago

🚨New paper! How safe and aligned is Kimi K2.5? We found concerning dual-use capabilities, sabotage and self-replication tendencies, political censorship on Chinese-language queries, and potential agentic misuse risks. (1/N)

yong_zhengxin's tweet photo. 🚨New paper!

How safe and aligned is Kimi K2.5?

We found concerning dual-use capabilities, sabotage and self-replication tendencies, political censorship on Chinese-language queries, and potential agentic misuse risks. (1/N) https://t.co/NRflzkyRPs

6

106

27

41

23K

ShayneRedford retweeted

Hamidah Oderinwale

@didaoh

3 months ago

Wrote a new essay with @AbramovichShira for @reboot_hq on procedural data extraction, consumer platforms, what it means for privacy, and the parallels to the attention economy! Cover art is a h/t to Daniel Dennett's "Cartesian theater" by @connie_surf :)

didaoh's tweet photo. Wrote a new essay with @AbramovichShira for @reboot_hq on procedural data extraction, consumer platforms, what it means for privacy, and the parallels to the attention economy!

Cover art is a h/t to Daniel Dennett's "Cartesian theater" by @connie_surf :) https://t.co/xL5i6eK7jk

0

10

2

1

842

ShayneRedford retweeted

Shannon Shen

@shannonzshen

4 months ago

Check out our latest @augmind_fm release! It's a privilege to have such an interesting conversation with @tongshuangwu! I learned so much from her insights in both specific projects and general research guidance — I've kept quoting her in recent chats with friends. I love many parts of our conversation, but in particular the following quotes — She articulated so many profound thoughts with such clarity: “To think about really impactful research is to 𝐫𝐞𝐭𝐡𝐢𝐧𝐤 𝐭𝐡𝐞 𝐚𝐬𝐬𝐮𝐦𝐩𝐭𝐢𝐨𝐧𝐬 𝐦𝐚𝐝𝐞 𝐛𝐲 𝐭𝐡𝐞 𝐜𝐨𝐦𝐦𝐮𝐧𝐢𝐭𝐲 and try to challenge those assumptions. If everyone feels like things should happen in this way and no one questions it, question it and see if it actually brings something interesting." — This couldn't resonate more in an era when everyone feels exhausted by constant AI updates: there are still many questions worth asking and waiting to be discovered. This is such a grounded answer to Steve Jobs's famous mantra "Think Different." "Even for the research I am doing right now, it's either human-centered AI or AI-centered human [...]. But when I think about it, 𝐡𝐮𝐦𝐚𝐧𝐬 𝐚𝐧𝐝 𝐀𝐈, 𝐢𝐭'𝐬 𝐯𝐞𝐫𝐲 𝐡𝐚𝐫𝐝 𝐭𝐨 𝐬𝐞𝐩𝐚𝐫𝐚𝐭𝐞 𝐭𝐡𝐞𝐦. 𝐈 𝐝𝐨 𝐭𝐡𝐢𝐧𝐤 𝐭𝐡𝐞𝐲 𝐜𝐨-𝐞𝐯𝐨𝐥𝐯𝐞. [...] How do we actually study them together. [...] that is definitely a field that, I think, would become even more interesting in the next few years." — Studying intelligence is looking into a mirror of ourselves, and this becomes ever more true as the models get better. The emphasis on human-centeredness is not about sacrificing technical rigor but rather looking beyond the surface of intelligence to truly understand us. There's so much more packed in this conversation. Give it a listen and hope you'll enjoy it as much as I did!

1

17

5

1

3K

ShayneRedford retweeted

Matthew Leavitt

@leavittron

4 months ago

Two nursing home residents are eating lunch. One says, "Boy, the food at this place is terrible." The other says, "Yeah, I know, and such small portions, too." This is the multilingual data problem. The data is bad, AND there's not enough of it. Yesterday at @datologyai we released ÜberWeb: our study of multilingual curation that gets 4-10x train FLOPs improvements on multilingual benchmarks compared to strong public baselines like Qwen3-1.7B and Tiny Aya Base.

leavittron's tweet photo. Two nursing home residents are eating lunch. One says, "Boy, the food at this place is terrible." The other says, "Yeah, I know, and such small portions, too."

This is the multilingual data problem. The data is bad, AND there's not enough of it.

Yesterday at @datologyai we released ÜberWeb: our study of multilingual curation that gets 4-10x train FLOPs improvements on multilingual benchmarks compared to strong public baselines like Qwen3-1.7B and Tiny Aya Base.

1

42

9

11

4K

ShayneRedford retweeted

Lossfunk

@lossfunk

4 months ago

🚨 Shocking: The quality of response you get from the LLM depends on the language you use! Our new paper reveals how LLMs entangle language with culture, leading to culturally different responses purely based on the language of the query 👇 Accepted at LM4UC, AAAI!

12

154

28

66

26K

Shayne Longpre

@ShayneRedford

5 months ago

This is such an ambitious and necessary pursuit. Excited to see this incredible team take it on!

Sara Hooker

@sarahookr

5 months ago

Beginnings are very special. Today is an important day for @adaptionlabs. Today a handful of one-size-fits-all-models are optimized for the average use case. Averages erase the exceptional. Everything intelligent adapts. So should AI.

84

840

84

204

226K

1

13

1

0

1K

ShayneRedford retweeted

adaption @adaption_ai

5 months ago

Adaption has raised $50M to build adaptive AI systems that evolve in real time. Everything intelligent adapts. So should AI.

194

2K

171

714

200K

Shayne Longpre

@ShayneRedford

5 months ago

https://t.co/Avf4isb6zz

Google Research

@GoogleResearch

5 months ago

Introducing ATLAS: New scaling laws for massively multilingual language models. We offer practical, data-driven guidance to balance data mix and model size, helping global developers better serve billions of non-English speakers. Learn more: https://t.co/8FsHLBKsou

GoogleResearch's tweet photo. Introducing ATLAS: New scaling laws for massively multilingual language models. We offer practical, data-driven guidance to balance data mix and model size, helping global developers better serve billions of non-English speakers. Learn more: https://t.co/8FsHLBKsou https://t.co/6Kr1MqMB4t

20

1K

200

643

90K

0

3

0

363

Shayne Longpre

@ShayneRedford

5 months ago

We just released the Google Research Blog for ATLAS 🗺️! Check out for: 1) Multilingual scaling and data mixing laws for 100s of languages 2) "Curse of Multilinguality" modeling 3) Cross-lingual transfer scores 🌎 https://t.co/e7K9q149M3

ShayneRedford's tweet photo. We just released the Google Research Blog for ATLAS 🗺️!

Check out for:

1) Multilingual scaling and data mixing laws for 100s of languages

2) "Curse of Multilinguality" modeling

3) Cross-lingual transfer scores

🌎 https://t.co/e7K9q149M3 https://t.co/rPOkoN0aCr

1

17

5

2

663

Shayne Longpre

@ShayneRedford

5 months ago

See the full TLDR here: https://t.co/I6pi4kabqy

Shayne Longpre

@ShayneRedford

8 months ago

📢Thrilled to introduce ATLAS 🗺️: scaling laws beyond English, for pretraining, finetuning, and the curse of multilinguality. The largest public, multilingual scaling study to-date—we ran 774 exps (10M-8B params, 400+ languages) to answer: 🌍Are scaling laws different by language? 🧙‍♂️Can we model the curse of multilinguality? ⚖️Pretrain from scratch or finetune from multilingual checkpoint? 🔀Cross-lingual transfer scores for 1444 lang pairs? 1/🧵

ShayneRedford's tweet photo. 📢Thrilled to introduce ATLAS 🗺️: scaling laws beyond English, for pretraining, finetuning, and the curse of multilinguality.

The largest public, multilingual scaling study to-date—we ran 774 exps (10M-8B params, 400+ languages) to answer:

🌍Are scaling laws different by language?

🧙‍♂️Can we model the curse of multilinguality?

⚖️Pretrain from scratch or finetune from multilingual checkpoint?

🔀Cross-lingual transfer scores for 1444 lang pairs?

1/🧵

7

155

42

79

25K

1

5

0

423

Shayne Longpre

@ShayneRedford

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users