Howard Chen @__howardchen - Twitter Profile

21 days ago

@princeton_nlp graduated 11 (!) PhDs at the hooding ceremony yesterday. Their research has fundamentally shaped the global AI landscape in so many ways, and they have been a core part of building the NLP group and its collegial spirit. It's been a real privilege working with and getting to know them all over the past few years! @ZexuanZhong @AmeetDeshpande_ @SadhikaMalladi @_carlosejimenez @JensTuyls @VishvakM @__howardchen @xiamengzhou @gaotianyu1350 @danfriedman0 @_awettig w/ @danqi_chen @prfsanjeevarora

karthik_r_n's tweet photo. @princeton_nlp graduated 11 (!) PhDs at the hooding ceremony yesterday. Their research has fundamentally shaped the global AI landscape in so many ways, and they have been a core part of building the NLP group and its collegial spirit. It's been a real privilege working with and getting to know them all over the past few years!
@ZexuanZhong @AmeetDeshpande_ @SadhikaMalladi @_carlosejimenez @JensTuyls @VishvakM @__howardchen @xiamengzhou @gaotianyu1350 @danfriedman0 @_awettig

w/ @danqi_chen @prfsanjeevarora

6

185

18

16

24K

__howardchen retweeted

Danqi Chen

@danqi_chen

21 days ago

Hooded six PhD students yesterday, my very first cohort at Princeton: Zexuan Zhong (@ZexuanZhong, 2024), Dan Friedman (@danfriedman0, 2025), Howard Chen (@__howardchen, 2025), Mengzhou Xia (@xiamengzhou, 2025), Tianyu Gao (@gaotianyu1350, 2025), and Alex Wettig (@_awettig, 2026)! They started their PhD at the beginning of the pandemic and lived through one of the most revolutionary stretches our field has ever seen. Their work has shaped how we think about language models today. So proud of them, and can't wait to see what they do next!

10

632

30

62

64K

__howardchen retweeted

Noam Razin @noamrazin

about 1 month ago

📰 RL for LMs often relies on imperfect proxy rewards, which can lead to reward hacking. But are incorrect rewards necessarily harmful? Turns out, they can also be benign or even beneficial! This has implications for reward model evaluation and verifiable reward design. 🧵

noamrazin's tweet photo. 📰 RL for LMs often relies on imperfect proxy rewards, which can lead to reward hacking. But are incorrect rewards necessarily harmful?

Turns out, they can also be benign or even beneficial!
This has implications for reward model evaluation and verifiable reward design.

🧵 https://t.co/IRP5zw3GtN

1

202

27

148

31K

Howard Chen @__howardchen

7 months ago

We let agents accumulate its context freely assuming little or no side-effects. This may not be the case! Sometimes they answer political or moral questions differently and even act differently after reading or conducting research. More analysis in the thread!

Jiayi Geng

@JiayiiGeng

7 months ago

We use LLMs for everyday tasks—research, writing, coding, decision-making. They remember our conversations, adapt to our needs and preferences. Naturally, we trust them more with repeated use. But this growing trust might be masking a hidden risk: what if their beliefs are shifting and we don't notice? We study the question "Do LM assistants change their beliefs as context accumulates?" in our new preprint: 👇 (1/n)

JiayiiGeng's tweet photo. We use LLMs for everyday tasks—research, writing, coding, decision-making. They remember our conversations, adapt to our needs and preferences. Naturally, we trust them more with repeated use.

But this growing trust might be masking a hidden risk: what if their beliefs are shifting and we don't notice?

We study the question "Do LM assistants change their beliefs as context accumulates?" in our new preprint: 👇
(1/n)

20

363

72

244

63K

0

10

1

2

2K

Who to follow

LLM Efficiency @NVIDIA - views have always been only my own 🥇🥈 @ Flunkyball Polish Championships

Hao Zhu

@_Hao_Zhu

Building the AI social brain for humans @StanfordNLP PhD @LTIatCMU

Howard Chen @__howardchen

7 months ago

@liliyu_lili @thinkymachines Congrats Lili!!!!

0

109

Howard Chen @__howardchen

8 months ago

Very beautiful, very powerful.

Chieh-Hsin (Jesse) Lai

@JCJesseLai

8 months ago

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading. 🧵You’ll find the link and a few highlights in the thread. We’d love to hear your thoughts and join some discussions! ⚡ Stay tuned for our markdown version, where you can drop your comments!

JCJesseLai's tweet photo. Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on!

📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon.

It traces the core ideas that shaped diffusion modeling and explains how today’s models work, why they work, and where they’re heading.

🧵You’ll find the link and a few highlights in the thread.
We’d love to hear your thoughts and join some discussions!

⚡ Stay tuned for our markdown version, where you can drop your comments!

53

2K

495

3K

858K

0

3

0

748

Howard Chen @__howardchen

9 months ago

This is what we should be using AI for. Yeah, science!

Google DeepMind @GoogleDeepMind

9 months ago

We’re announcing a major advance in the study of fluid dynamics with AI 💧 in a joint paper with researchers from @BrownUniversity, @nyuniversity and @Stanford.

GoogleDeepMind's tweet photo. We’re announcing a major advance in the study of fluid dynamics with AI 💧 in a joint paper with researchers from @BrownUniversity, @nyuniversity and @Stanford. https://t.co/HevQE7mKI8

178

5K

712

1K

1M

0

3

0

713

Howard Chen @__howardchen

10 months ago

@DimitrisPapail Essentially an updated version of the Shakespeare-typing monkey? But now the monkey gets rewarded and learns (slowly from scratch). Though conceptually it feels more like "inverse RL" where the reward func is simply the exact match against the expert demos and not a learned one.

0

1

0

198

Howard Chen @__howardchen

12 months ago

Agents need a new type of learning in the era of experience (imagining 4.0). Not quite gradient descent and not exactly in-context learning. Experience never ends so you'd need metabolism. Many emerging ideas recently but none cracked it yet.

Rohan Paul

@rohanpaul_ai

12 months ago

Andrej Karpathy: Software Is Changing (Again) Key learning points from this brilliant lecture from yesterday. 🚀 The Shifting Software Map For 70 years code flowed in one style, then neural networks arrived and rewrote large patches of logic. Karpathy divides eras into software 1.0 for handwritten instructions, software 2.0 for trained weights, and software 3.0 for programmable large language models that obey plain-text prompts. Each layer still matters, so future engineers must move smoothly among explicit code, dataset tuning, and prompt design.

rohanpaul_ai's tweet photo. Andrej Karpathy: Software Is Changing (Again)

Key learning points from this brilliant lecture from yesterday.

🚀 The Shifting Software Map

For 70 years code flowed in one style, then neural networks arrived and rewrote large patches of logic.

Karpathy divides eras into software 1.0 for handwritten instructions, software 2.0 for trained weights, and software 3.0 for programmable large language models that obey plain-text prompts.

Each layer still matters, so future engineers must move smoothly among explicit code, dataset tuning, and prompt design.

2

184

43

156

22K

0

12

0

2

2K

__howardchen retweeted

Jiayi Geng

@JiayiiGeng

12 months ago

I'm thrilled to share that I've moved to Pittsburgh and joined NeuLab at CMU as a research intern this summer, advised by @gneubig! I'll also start my PhD @LTIatCMU this fall. Feel free to reach out if you're interested in chatting about multi-agent systems, LLMs for scientific discovery, or cognitive science! Special thanks to all the amazing people who've inspired and supported me throughout my master's journey at Princeton, especially my advisors @danqi_chen and Tom Griffiths (@cocosci_lab), and my mentor @__howardchen! I'm deeply grateful for their incredible guidance and encouragement!🐯🎓

12

364

13

78

39K

Howard Chen @__howardchen

about 1 year ago

Science is about inferring underlying rules/dynamics of a system. Can SoTA LLMs do it reliably? Despite all the progress in building AI scientists, we find it still nontrivial for models to reverse-engineer simple black-box systems. More insights/analysis in the thread!

Jiayi Geng

@JiayiiGeng

about 1 year ago

Using LLMs to build AI scientists is all the rage now (e.g., Google’s AI co-scientist [1] and Sakana’s Fully Automated Scientist [2]), but how much do we understand about their core scientific abilities? We know how LLMs can be vastly useful (solving complex math problems) yet unreliable (counting the number of "R"s in "strawberry" or calculating 9.9 - 9.11) at the same time. Similarly, despite recent advances in applying LLMs to science, are we confident that they can reliably uncover the underlying mechanism of a simple black-box system in a controlled setting? We study this question in our new preprint: 📢👇 (1/n)

JiayiiGeng's tweet photo. Using LLMs to build AI scientists is all the rage now (e.g., Google’s AI co-scientist [1] and Sakana’s Fully Automated Scientist [2]), but how much do we understand about their core scientific abilities?
We know how LLMs can be vastly useful (solving complex math problems) yet unreliable (counting the number of "R"s in "strawberry" or calculating 9.9 - 9.11) at the same time. Similarly, despite recent advances in applying LLMs to science, are we confident that they can reliably uncover the underlying mechanism of a simple black-box system in a controlled setting?

We study this question in our new preprint: 📢👇
(1/n)

12

484

74

432

73K

0

11

0

2K

Howard Chen @__howardchen

about 1 year ago

Experience is the data of AI. Absolutely.

Richard Sutton

@RichardSSutton

about 1 year ago

Rich's slogans for AI research (revised 2006): 1. Approximate the solution, not the problem (no special cases) 2. Drive from the problem 3. Take the agent’s point of view 4. Don’t ask the agent to achieve what it can’t measure 5. Don't ask the agent to know what it can't verify 6. Set measurable goals for subparts of the agent 7. Discriminative models are usually better than generative models 8. Work by orthogonal dimensions. Work issue by issue 9. Work on ideas, not software 10. Experience is the data of AI https://t.co/UHpKNbatvZ

12

908

158

555

60K

0

1

0

880

__howardchen retweeted

Noam Razin @noamrazin

about 1 year ago

The success of RLHF depends heavily on the quality of the reward model (RM), but how should we measure this quality? 📰 We study what makes a good RM from an optimization perspective. Among other results, we formalize why more accurate RMs are not necessarily better teachers! 🧵

noamrazin's tweet photo. The success of RLHF depends heavily on the quality of the reward model (RM), but how should we measure this quality?

📰 We study what makes a good RM from an optimization perspective. Among other results, we formalize why more accurate RMs are not necessarily better teachers!
🧵 https://t.co/lSfffqhbjs

8

825

133

651

108K

__howardchen retweeted

Alex Wettig @_awettig

over 1 year ago

🤔 Ever wondered how prevalent some type of web content is during LM pre-training? In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐 Key takeaway: domains help us curate better pre-training data! 🧵/N

_awettig's tweet photo. 🤔 Ever wondered how prevalent some type of web content is during LM pre-training?

In our new paper, we propose WebOrganizer which *constructs domains* based on the topic and format of CommonCrawl web pages 🌐

Key takeaway: domains help us curate better pre-training data! 🧵/N https://t.co/qptz231z3u

5

208

58

106

49K

Howard Chen @__howardchen

over 1 year ago

@abacaj Smells like model’s reward hacking party lol

0

3

0

175

Howard Chen @__howardchen

over 1 year ago

This is truly heartbreaking.

Nick Hill @nickhill33

over 1 year ago

@douwekiela @FelixHill84 Felix’s story: https://t.co/BAyUNXeMuS

6

219

49

139

35K

0

670

Howard Chen @__howardchen

over 1 year ago

@archit_sharma97 Not exactly though? I mean the KL is itself E_{y~\pi}[log \pi / \pi_0] so if you want to wrap the expectation outside then it should be log \pi/\pi_0 in the bracket?

1

0

352

Howard Chen @__howardchen

over 1 year ago

Are we so back or not?

0

8

0

1

1K

Howard Chen @__howardchen

over 1 year ago

Great thread.

Sebastian Seung

@SebastianSeung

over 1 year ago

The theoretical physics approach to neural nets was launched by @HopfieldJohn in this classic 1982 paper that introduced the "energy function" to associative memory models. https://t.co/HekdYdvvJc

SebastianSeung's tweet photo. The theoretical physics approach to neural nets was launched by @HopfieldJohn in this classic 1982 paper that introduced the "energy function" to associative memory models. https://t.co/HekdYdvvJc https://t.co/dsJIg14IkH

2

61

22

26

18K

0

3

0

1

888

Howard Chen

@__howardchen

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users