Laura Ruis @LauraRuis - Twitter Profile

Pinned Tweet

over 1 year ago

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this: Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢 🧵⬇️

LauraRuis's tweet photo. How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢

🧵⬇️ https://t.co/HDJEE1KIpz

24

984

208

1K

198K

LauraRuis retweeted

Zhengyao Jiang

@zhengyaojiang

1 day ago

OpenAI ran a hiring challenge, but the top candidate was one they couldn’t hire: our autonomous research agent, Aiden. In Parameter Golf, Aiden ran for 22 days, and out-outperformed all 1,016 other researchers: 🧵 (1/8)

14

487

48

280

82K

LauraRuis retweeted

Ekdeep Singh Lubana @EkdeepL

3 days ago

Very excited to have this paper out! We show by having more parameters, larger models see reduced interference between updates. This allows them to retain memories of rarely observed samples of a task, eventually allowing them to learn even the tail-end of the distribution. (1/3)

4

184

19

89

16K

LauraRuis retweeted

Christopher Potts

@ChrisGPotts

4 days ago

We take for granted that larger models are better than smaller ones, but why is this so? Our new paper, led by Jing Huang and @EkdeepL, traces this to a data-induced competition for resources (neurons), using formal analysis, idealized tasks, and real pretraining.

ChrisGPotts's tweet photo. We take for granted that larger models are better than smaller ones, but why is this so? Our new paper, led by Jing Huang and @EkdeepL, traces this to a data-induced competition for resources (neurons), using formal analysis, idealized tasks, and real pretraining. https://t.co/vqRUUe6whP

20

882

134

808

125K

Who to follow

Sam Bowman

@sleepinyourhat

AI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. Into @givingwhatwecan.

Jacob Andreas

@jacobandreas

Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL / @NLP_MIT (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJw

Tim Rocktäschel

@_rockt

Co-Founder @Recursive_SI, Professor of AI @AI_UCL, PI @UCL_DARK, Fellow @ELLISforEurope. Ex @GoogleDeepMind @AIatMeta @CompSciOxford

Laura Ruis @LauraRuis

7 days ago

@LouisKirschAI @SchmidhuberAI @inherent_labs Congrats!!

0

1

0

307

LauraRuis retweeted

Isha Puri

@ishapuri101

13 days ago

It's never made sense to me that RL collapses all reward signals to a single scalar. Today, we fix that! Introducing Vector Policy Optimization: we train models to inherently optimize for the varied nature of a reward vector, creating diverse sets of answers ideal for test time search. Website and code coming soon!

11

713

67

576

68K

Laura Ruis @LauraRuis

20 days ago

@HarryMayne5 @DaveRBanerjee @OwainEvans_UK the type of data that most strongly causes this (false claim + annotations or corrected documents) won't be a huge part of pretraining, so id expect llms to have much more signal from which they can form a reasonably coherent view of truthfulness from regular pretraining

1

2

0

58

LauraRuis retweeted

Lujain Ibrahim @lujainmibrahim

21 days ago

New preprint! In 5 studies (3k+ users / 12k+ convs, with a 3-wk longitudinal study), we find that sycophantic AI influences how people view those closest to them. It affects how effortful human interaction seems, how satisfying it is, & who people want to turn to for advice 🧵

lujainmibrahim's tweet photo. New preprint!

In 5 studies (3k+ users / 12k+ convs, with a 3-wk longitudinal study), we find that sycophantic AI influences how people view those closest to them.

It affects how effortful human interaction seems, how satisfying it is, & who people want to turn to for advice 🧵 https://t.co/tNR1wv7Fpj

6

172

54

82

58K

Laura Ruis @LauraRuis

23 days ago

@_rockt @srahmanidashti @Recursive_SI Congrats Tim 🚀

0

3

0

246

LauraRuis retweeted

Tim Rocktäschel

@_rockt

23 days ago

Excited to co-found Recursive (@recursive_si) with an exceptional team in London and SF to create AI that experiments on how to safely improve itself, turning compute into knowledge that accumulates in an open-ended process of endless, automated scientific discoveries.

98

905

112

227

251K

LauraRuis retweeted

Ethan Perez

@EthanJPerez

27 days ago

Grateful for @janleike and his leadership over the years. With models like Mythos, the stakes for alignment have never felt higher at Anthropic, and I'm looking forward to helping to continue scaling up our work here. Some of what the team's been up to recently 🧵

4

181

6

41

24K

LauraRuis retweeted

Daniel Green @dgrreen

28 days ago

The Sam Altman and @miramurati texts from the day he got fired from @OpenAI in 2023 just became evidence in the @elonmusk v. @sama trial. It felt like a meaningful moment in AI history, so I turned it into a musical. The lyrics are the texts.

107

2K

198

898

382K

LauraRuis retweeted

Ekdeep Singh Lubana @EkdeepL

28 days ago

One of the core fundamental research threads we've been pursuing over the last few months at @GoodfireAI is finally out: tightly linking representation geometry and behavior! Hit us up if this spikes your interest!

6

172

17

49

11K

LauraRuis retweeted

J Rosser

@jrosseruk

about 1 month ago

Don't think I've come across many articles that link PyTorch's forward/backward hooks back to the autograd graph itself so here's one I wrote! 🧵

jrosseruk's tweet photo. Don't think I've come across many articles that link PyTorch's forward/backward hooks back to the autograd graph itself so here's one I wrote! 🧵 https://t.co/XZVtGfj9HP

2

24

2

22

2K

LauraRuis retweeted

Yukyung Lee @yukyunglee_

about 1 month ago

Excited to share that RExBench has been accepted to ACL main! 🎉🎉

3

48

10

5

6K

Laura Ruis @LauraRuis

about 1 month ago

@davidbau @_rockt @PaglieriDavide It’s a cool idea. I also wonder if that may be easier to models than playing the resulting game itself (along the lines of the analogical reasoning findings from taylor Webb)

0

4

0

155

LauraRuis retweeted

Lujain Ibrahim @lujainmibrahim

about 1 month ago

🚨Very excited to see our work on warmth & sycophancy in LLMs out in @Nature today!🚨 We study what happens when LLMs are fine-tuned to be warmer, and find that warmth and sycophancy can be linked, with warm models showing higher errors on a range of benchmarks (🔗s below)

lujainmibrahim's tweet photo. 🚨Very excited to see our work on warmth & sycophancy in LLMs out in @Nature today!🚨

We study what happens when LLMs are fine-tuned to be warmer, and find that warmth and sycophancy can be linked, with warm models showing higher errors on a range of benchmarks (🔗s below) https://t.co/N8OiBDpwac

14

269

61

138

37K

LauraRuis retweeted

Andrew Gordon Wilson

@andrewgwils

about 1 month ago

There's a fourth possibility: humans only appear sample efficient because they've effectively seen a massive amount of data through evolution. Remember, there is a fluidity between the model and the data. The model is a representation of our understanding of data.

55

437

32

128

45K

LauraRuis retweeted

ICLR @iclr_conf

about 1 month ago

That's it for #ICLR2026! See you all next year in the US! Please welcome @jacobandreas as the new Senior Program Chair (with @BharathHarihar3 continuing on as the General Chair)

iclr_conf's tweet photo. That's it for #ICLR2026! See you all next year in the US! Please welcome @jacobandreas as the new Senior Program Chair (with @BharathHarihar3 continuing on as the General Chair) https://t.co/00v3zbl0sV

6

654

38

53

76K

LauraRuis retweeted

Kobi Hackenburg @KobiHackenburg

about 1 month ago

Very excited to see this out! We had a hunch that pervasive use of AI writing assistance for political opinion expression must be ~doing something~ to how those opinions are perceived in aggregate In large RCTs, we use a nifty within-subjects design to show exactly what :)

1

18

1

7

3K

Laura Ruis

@LauraRuis

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users