Pete Shaw

@ptshaw2

Research Scientist @GoogleDeepmind

Seattle, WA

Joined January 2013

576 Following

757 Followers

117 Posts

ptshaw2 retweeted

28 days ago

🎉 Excited to share that our work on intrinsic dimensionality of reasoning has been accepted to #ICML2026 as a ✨spotlight✨ (top 2.2%)! We analyze the effectiveness of teaching a model how to reason via the lens of intrinsic dimensionality (the minimum effective capacity a model needs to solve the task) and find that effective reasoning chains are inherently compressive! Across Gemma-3 1B and 4B, lower intrinsic dimensionality strongly predicts not only in-distribution accuracy (GSM8K), but also robustness on OOD benchmarks (GSM-Hard, GSM-Symbolic, GSM-IC) -- outperforming reasoning length, token perplexity, and KL divergence. Stay tuned for more results and exciting updates in the camera-ready! 🚀

2

197

38

100

20K

Pete Shaw @ptshaw2

about 1 month ago

@LH Yes, in the limit these objectives are not computable, but any real model is finite. Hopefully this is clear enough in the paper.

0

1

0

0

8

Pete Shaw @ptshaw2

about 2 months ago

I will be presenting this paper at ICLR next week! 🇧🇷 Come chat about Kolmogorov complexity, the MDL principle, and what this all means for training better models! 🧵

Pete Shaw @ptshaw2

8 months ago

Excited to share a new paper that aims to narrow the conceptual gap between the idealized notion of Kolmogorov complexity and practical complexity measures for neural networks.

ptshaw2's tweet photo. Excited to share a new paper that aims to narrow the conceptual gap between the idealized notion of Kolmogorov complexity and practical complexity measures for neural networks. https://t.co/CM3qcflVr8

2

144

23

144

29K

3

105

12

55

10K

Pete Shaw @ptshaw2

about 1 month ago

@LH The paper studies description length measures that are *asymptotically* optimal in the limit as computational resources increase.

1

1

0

0

9

Who to follow

Verified account

Assistant professor @Berkeley_EECS @berkeley_ai || Research scientist at @allen_ai || PhD from @uwcse @uwnlp

Hanna Hajishirzi

Verified account

@HannaHajishirzi

Sr. Director of AI at @allen_ai, Prof at @uw_cse, lead OLMo, Tulu

Sebastian Gehrmann

Making AI trustworthy as Head of Responsible AI in the CTOs office @Bloomberg. Formerly LLMs @ Google Brain / PhD @ Harvard. views my own

Pete Shaw @ptshaw2

about 2 months ago

@unitambo Hi Phillip. Sounds interesting. Do you have a more detailed write-up?

1

0

0

0

46

Pete Shaw @ptshaw2

8 months ago

Excited to share a new paper that aims to narrow the conceptual gap between the idealized notion of Kolmogorov complexity and practical complexity measures for neural networks.

ptshaw2's tweet photo. Excited to share a new paper that aims to narrow the conceptual gap between the idealized notion of Kolmogorov complexity and practical complexity measures for neural networks. https://t.co/CM3qcflVr8

2

144

23

144

29K

Pete Shaw @ptshaw2

about 2 months ago

Aside from the paper, I’m also interested in work related to LLM post-training (RL, evals, tools, agents) and ML applications in science (particularly biology).

0

1

0

0

140

Pete Shaw @ptshaw2

about 2 months ago

Details: https://t.co/LL2qnMjXxx Paper: https://t.co/0z8wSTrRMN

1

2

0

2

185

ptshaw2 retweeted

Pete Shaw @ptshaw2

3 months ago

I particularly enjoyed the perspectives in this blog post and the paper it is based on... [2/2] https://t.co/oULJB9XsBA

0

5

3

0

883

Pete Shaw @ptshaw2

3 months ago

I particularly enjoyed the perspectives in this blog post and the paper it is based on... [2/2] https://t.co/oULJB9XsBA

Emiliano Penaloza

3 months ago

Link to full post: https://t.co/eYROt5UNlj This was joint work with some great folks: @dheeraj_46329, @siddarthv66 and @MassCaccia

0

14

2

11

2K

0

5

3

0

883

Pete Shaw @ptshaw2

3 months ago

Lots of interesting recent work related to the information asymmetry introduced by conditioning a teacher on privileged information [1/2]

Martin Klissarov @MartinKlissarov

4 months ago

In the limit, what's important is our ability to adapt. What is a good recipe for teaching agents to adapt on-the-fly? We introduce two meta-learning for LLMs papers written with @JonnyCoook at @GoogleDeepMind. This is research from last year we can finally share 🧵👇

MartinKlissarov's tweet photo. In the limit, what's important is our ability to adapt.

What is a good recipe for teaching agents to adapt on-the-fly?

We introduce two meta-learning for LLMs papers written with @JonnyCoook at @GoogleDeepMind.

This is research from last year we can finally share 🧵👇 https://t.co/MtiyTHrXF4

9

249

48

286

53K

1

22

1

21

3K

ptshaw2 retweeted

4 months ago

🚨 I’m on the 2026 Research Scientist Job Market! I am a PhD student at UNC Chapel Hill (advised by @mohitban47) and recipient of the Apple Scholars in AI/ML PhD Fellowship. My research centers around: 🔸Reasoning & RL/Post-Training: Evaluating and interpreting the reasoning process, and improving post-training and alignment through self-generated and reward-based signals (Intrinsic Dim., ReCEVAL, ScPO, LASeR). 🔸Agents & Planning: Designing adaptive agent frameworks to that use extra test-time compute & reasoning upon failure (ADaPT, System-1.x, PRInTS). 🔸Reward & Skill Discovery in Code: Leveraging execution signals to build reliable rewards, automate debugging, and discover abstractions in code (UTGen, ReGAL). Prev (Research Intern): Google DeepMind, Meta FAIR, Allen Institute for AI (AI2), and Adobe Research. Feel free to reach out via DM or email if you’re interested, have leads, or would like to connect! 🌐 https://t.co/17h5KwDZHA 📧 [email protected] #NLP #AI #JobSearch

15

345

58

121

56K

Pete Shaw @ptshaw2

4 months ago

Good reasoning strategies make a task more compressible. I found this to be an elegant and intuitive perspective on why effective reasoning leads to better generalization, and was a lot of fun working with @ArchikiPrasad and team on this!

4 months ago

🚨Excited to share our new work viewing reasoning strategies as teaching tools: for fixed target model, which CoT strategies best support learning and generalization? ✨Our answer is intrinsic dimensionality (minimum effective capacity a model needs to solve the task). Somewhat counterintuitively, adding CoT – which requires generating longer and more structured outputs – can reduce learning complexity. Good reasoning compresses the task, i.e., it reduces the degrees of freedom the model needs to map inputs to correct solutions. 🧵⬇️ (1/5)

ArchikiPrasad's tweet photo. 🚨Excited to share our new work viewing reasoning strategies as teaching tools: for fixed target model, which CoT strategies best support learning and generalization?

✨Our answer is intrinsic dimensionality (minimum effective capacity a model needs to solve the task).

Somewhat counterintuitively, adding CoT – which requires generating longer and more structured outputs – can reduce learning complexity. Good reasoning compresses the task, i.e., it reduces the degrees of freedom the model needs to map inputs to correct solutions.

🧵⬇️ (1/5)

5

207

48

156

45K

1

35

5

12

4K

ptshaw2 retweeted

4 months ago

This is absolutely shameful. Agents of a federal agency unnecessarily escalating, and then executing a defenseless citizen whose offense appears to be using his cell phone camera. Every person regardless of political affiliation should be denouncing this.

246

8K

948

423

980K

Pete Shaw @ptshaw2

5 months ago

@AdaptiveAgents Seems like learnability challenges are more relevant than expressivity limits in the context of approximating universal compressors?

0

2

0

0

456

ptshaw2 retweeted

François Chollet

5 months ago

The goal of AI should not be to replace human thought and human agency, but to expand them. Not everything needs to be automated.

143

929

90

131

67K

Pete Shaw @ptshaw2

6 months ago

@fchollet This view is often used to motivate symbolic representations, but DL models can in theory also learn optimal compression if we move past parameter counting as a description length measure: https://t.co/0z8wSTrRMN But either way, hard to optimize.

0

4

0

2

189

Pete Shaw @ptshaw2

7 months ago

https://t.co/YJbr49POus

Pete Shaw @ptshaw2

8 months ago

Excited to share a new paper that aims to narrow the conceptual gap between the idealized notion of Kolmogorov complexity and practical complexity measures for neural networks.

ptshaw2's tweet photo. Excited to share a new paper that aims to narrow the conceptual gap between the idealized notion of Kolmogorov complexity and practical complexity measures for neural networks. https://t.co/CM3qcflVr8

2

144

23

144

29K

0

2

0

1

107

Pete Shaw @ptshaw2

7 months ago

Good time to plug our recent paper connecting the notion of Kolmogorov complexity to Transformers, inspired by the work of Schmidhuber and many others... 🧵

Jürgen Schmidhuber

7 months ago

SchmidhuberAI's tweet photo. https://t.co/CHxpjLXlFP

54

1K

98

634

161K

1

4

0

2

349

ptshaw2 retweeted

Conference on Language Modeling @COLM_conf

8 months ago

Outstanding paper 3🏆: Don't lie to your friends: Learning what you know from collaborative self-play https://t.co/hvY1oaF6Jf

COLM_conf's tweet photo. Outstanding paper 3🏆: Don't lie to your friends: Learning what you know from collaborative self-play
https://t.co/hvY1oaF6Jf https://t.co/OCvxWGXc7h

1

40

11

14

11K

ptshaw2 retweeted

Google DeepMind @GoogleDeepMind

8 months ago

Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐 It builds on Gemini’s visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster speed.

GoogleDeepMind's tweet photo. Our new Gemini 2.5 Computer Use model can navigate browsers just like you do. 🌐

It builds on Gemini’s visual understanding and reasoning capabilities to power agents that can click, scroll and type for you online - setting a new standard on multiple benchmarks, with faster speed.

107

3K

338

703

453K

Last Seen Users on Sotwe

Trends for you

Most Popular Users