Sam Buchanan

Verified account

@_sdbuchanan

prev. @Berkeley_EECS, @TTIC_Connect, PhD @EE_ColumbiaSEAS. Training efficiency, theory and practice!

Bay Area, CA

Joined March 2015

1.4K Following

1.9K Followers

379 Posts

Pinned Tweet

8 months ago

We wrote a book about representation learning! It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms. 👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

_sdbuchanan's tweet photo. We wrote a book about representation learning!

It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms.

👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

14

1K

202

1K

142K

_sdbuchanan retweeted

about 1 month ago

Very excited to release Terminal-Bench 2.1! Coding agents are among the most economically consequential deployments of LLMs to date. As agents improve, benchmark reliability matters more. We audited TB2.0 and found and corrected issues in 28/89 tasks. 30% of the benchmark! But the rankings survived, absolute scores moved up to 12pp!

ekellbuch's tweet photo. Very excited to release Terminal-Bench 2.1!

Coding agents are among the most economically consequential deployments of LLMs to date. As agents improve, benchmark reliability matters more.

We audited TB2.0 and found and corrected issues in 28/89 tasks. 30% of the benchmark!

But the rankings survived, absolute scores moved up to 12pp!

28

768

74

220

85K

2 months ago

@PreetumNakkiran Congrats!!

0

1

0

0

77

3 months ago

@sirbayes @druv_pai @pengwang2003 @YiMaTweets Thank you for your thorough reading of and feedback on the first version, Kevin! Many significant improvements to the new version are downstream of it, like a discussion of latent diffusion in Ch6 :-)

0

2

0

0

329

Who to follow

Verified account

Associate Professor CS/stats UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on LLMs and deep learning. PhD at Stanford.

Simon Shaolei Du

Verified account

@SimonShaoleiDu

Reasoning Chief Scientist @miromind_ai. Associate Professor @uwcse. Prev @xai. Postdoc @the_IAS. PhD in machine learning @mldcmu.

Verified account

Professor @UCLA, Ex-ByteDance Seed | Recent work: Seed2.0, SeedFold, SeedProteo | Opinions are my own

3 months ago

This version of the book also involved huge efforts on the infrastructure side, for the .pdf and web versions of the book, in English and Chinese. Thank you to my awesome collaborators: @druv_pai, @robinwuzy, @TianzheC, @YiMaTweets, Peng Wang, @qu_1006, and many more!

0

4

0

0

405

3 months ago

We've released an updated "v2.0" of our book on deep representation learning! We've reorganized and improved many sections for better pedagogical clarity, and added many new examples and applications throughout the book. Massive thanks are due to folks in the community who submitted feedback and corrections on the first version, including @sirbayes :-) 📕Read: https://t.co/gLoWpLRicB 🛠️Contribute: https://t.co/Fj0REoP1HZ

Kevin Patrick Murphy

3 months ago

I am delighted to see a new version of the book by @_sdbuchanan, @druv_pai , @pengwang2003 and @YiMaTweets . This is the best book on the foundations of deep representation learning! In this era of coding agents, the math is all you need to learn :) https://t.co/3IvoZeFUYA

5

606

99

628

58K

1

40

4

29

6K

3 months ago

@Kangwook_Lee @PUBG @Krafton_AI Congrats!!

0

0

0

0

190

4 months ago

@graceluo_ @feng_jiahai @trevordarrell @AlecRad @JacobSteinhardt Cool work! I was wondering -- is it essential for the interpretability results that the denoiser is a FiLM SwiGLU MLP? Would a different activation, or a DiT or a U-Net work too? I'm curious if the denoiser is doing something like LISTA (https://t.co/RE72U7FqPD)

0

1

0

0

384

4 months ago

@bneyshabur Congrats! Can't wait to see what you're building

0

1

0

0

329

4 months ago

@TheGregYang Give 2046 a try -- same director different vibe!

0

1

0

0

107

4 months ago

@amspector100 wtf this launch is so good

0

2

0

0

505

4 months ago

@_onionesque Ha, thanks Shubhendu! Are you in Mountain View now? I'll tell you over coffee sometime, it's been a journey indeed :-)

1

1

0

0

77

8 months ago

We wrote a book about representation learning! It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms. 👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

_sdbuchanan's tweet photo. We wrote a book about representation learning!

It’s fully open source, available and readable online, and covers everything from theoretical foundations to practical algorithms.

👷‍♂️ We’re hard at work updating the content for v2.0, and would love your feedback and contributions

14

1K

202

1K

142K

_sdbuchanan retweeted

4 months ago

Thrilled to have contributed to Terminal-Bench, a benchmark for real-time evaluation of autonomous agents on tasks ranging from debugging system configs to developing protein engineering workflows. My core contribution focused on analyzing agent behavior: how they reason, where they get stuck, and why they fail. A consistent finding? Large models tend to break in similar ways. To build better agents, we don't just need better models, we need to innovate the worlds they learn in! Check out the paper for details. More coming soon.

1

46

12

3

5K

5 months ago

Escape the tyranny of the KV cache at large context lengths via end-to-end test-time training! I had the privilege to work with this team at the beginning of last year. The rigor and vision that went into this is remarkable (metalearning a transformer!?) -- check it out!

5 months ago

LLM memory is considered one of the hardest problems in AI. All we have today are endless hacks and workarounds. But the root solution has always been right in front of us. Next-token prediction is already an effective compressor. We don’t need a radical new architecture. The missing piece is to continue training the model at test-time, using context as training data. Our full release of End-to-End Test-Time Training (TTT-E2E) with @NVIDIAAI, @AsteraInstitute, and @StanfordAILab is now available. Blog: https://t.co/woCpiIrq0T Arxiv: https://t.co/3VkFlS3wx3 This has been over a year in the making with @arnuvtandon and an incredible team.

karansdalal's tweet photo. LLM memory is considered one of the hardest problems in AI.

All we have today are endless hacks and workarounds. But the root solution has always been right in front of us.

Next-token prediction is already an effective compressor. We don’t need a radical new architecture. The missing piece is to continue training the model at test-time, using context as training data.

Our full release of End-to-End Test-Time Training (TTT-E2E) with @NVIDIAAI, @AsteraInstitute, and @StanfordAILab is now available.

Blog: https://t.co/woCpiIrq0T
Arxiv: https://t.co/3VkFlS3wx3

This has been over a year in the making with @arnuvtandon and an incredible team.

91

2K

321

2K

574K

0

14

2

1

1K

_sdbuchanan retweeted

Nikhil Ghosh @nikhilghosh101

5 months ago

Sharing our recent work on understanding the mechanisms underlying the empirical success of hyperparameter transfer using μP! (1/11) with Denny Wu and @albertobietti

nikhilghosh101's tweet photo. Sharing our recent work on understanding the mechanisms underlying the empirical success of hyperparameter transfer using μP! (1/11)

with Denny Wu and @albertobietti https://t.co/KTHwIBwTEr

2

146

33

116

19K

6 months ago

@_onionesque Exciting, congrats!!

0

1

0

0

141

6 months ago

It's been inspiring to see @brenthyi grow this project over the past three years!! The best library I know for bootstrapping research code into the terminal with zero friction 🫡

Brent Yi @brenthyi

6 months ago

tyro 1.0 is out 🐣 This has been a pet project/niche interest of mine for ~4 years now, so it's a bit of a sentimental moment... https://t.co/bAibP3RjxE

11

180

22

58

43K

1

6

0

1

1K

6 months ago

@exnx Congrats!

0

0

0

0

228

6 months ago

@DimitrisPapail Great movie!

0

2

0

0

76

6 months ago

@nabla_theta 68%

0

2

0

0

275

_sdbuchanan retweeted

6 months ago

Excited to be @ #Neurips2025 presenting Weaver, our approach for combining multiple weak verifiers to narrow the generation-verification gap. Come talk to us today from 11:00 AM – 2:00 PM PST Exhibit Hall C,D,E #3714. If you're working on reliable agents, evaluation, or self-verification, this is for you and I would love to connect.

ekellbuch's tweet photo. Excited to be @ #Neurips2025 presenting Weaver, our approach for combining multiple weak verifiers to narrow the generation-verification gap. Come talk to us today from 11:00 AM – 2:00 PM PST Exhibit Hall C,D,E #3714. If you're working on reliable agents, evaluation, or self-verification, this is for you and I would love to connect.

3

26

6

3

2K

Last Seen Users on Sotwe

Trends for you

Most Popular Users