Jane Pan @JanePan_ - Twitter Profile

JanePan_ retweeted

He He

@hhexiy

2 months ago

https://t.co/H3TAsaThYQ

18

872

128

1K

118K

JanePan_ retweeted

John (Yueh-Han) Chen

@jcyhc_ai

3 months ago

Can LLMs control their chains of thought (CoT)? If so, they could evade CoT monitors 🚨 We introduce the CoT Controllability eval suite to find out. Our results leave us cautiously optimistic that today’s models struggle to obfuscate their CoT in ways that undermine monitorability. In this thread, I explain additional findings that I find interesting Joint work w @OpenAI

3

74

12

24

10K

Jane Pan @JanePan_

4 months ago

Excited to see NeuNeu out — learned scaling laws from open-source LM trajectories!

Michael Hu @michahu8

4 months ago

if you truly believe in the bitter lesson, then why hand design scaling laws? introducing: neural neural scaling laws (NeuNeu), a neural network - trained on open-source LM trajectories - that predicts LMs' future downstream task performance 🧵👇

michahu8's tweet photo. if you truly believe in the bitter lesson, then why hand design scaling laws?

introducing: neural neural scaling laws (NeuNeu), a neural network
- trained on open-source LM trajectories
- that predicts LMs' future downstream task performance

🧵👇 https://t.co/92dlxJiJKc

4

204

31

137

20K

0

8

0

1

626

JanePan_ retweeted

Vishakh Padmakumar

@vishakh_pk

4 months ago

Our work on LLM novelty as the frontier of original and high-quality output was accepted to #ICLR26! Come talk to us about how model scale, SFT, and RL affect this trade-off! See you in Brazil!🇧🇷h/t to my awesome collaborators @hhexiy @valeriechen_ @JanePan_ @jcyhc_ai

vishakh_pk's tweet photo. Our work on LLM novelty as the frontier of original and high-quality output was accepted to #ICLR26! Come talk to us about how model scale, SFT, and RL affect this trade-off! See you in Brazil!🇧🇷h/t to my awesome collaborators @hhexiy @valeriechen_ @JanePan_ @jcyhc_ai https://t.co/JKw6XDmBEk

1

46

6

13

17K

Who to follow

Jens Tuyls

@JensTuyls

PhD @PrincetonCS. Previously CS & Eng. @UCIrvine. Studying AI, ML, RL, NLP.

Sadhika Malladi

@SadhikaMalladi

Postdoc researcher at MSR NYC; incoming faculty at UCSD CSE; CS PhD at Princeton

Sam Gupta

@yakatttak

PhD @PrincetonCS MLSys | Prev: CS/Math @RutgersU

JanePan_ retweeted

Richard Pang @yzpang_

8 months ago

🚨Prompt Curriculum Learning (PCL) - Efficient LLM RL training algo! - We investigate factors that affect convergence: bsz, # prompt, # gen, prompt selection - We propose PCL: lightweight algo that *dynamically selects intermediate-difficulty prompts* using a learned value model

yzpang_'s tweet photo. 🚨Prompt Curriculum Learning (PCL)
- Efficient LLM RL training algo!
- We investigate factors that affect convergence: bsz, # prompt, # gen, prompt selection
- We propose PCL: lightweight algo that *dynamically selects intermediate-difficulty prompts* using a learned value model https://t.co/uuBPr7g4Jr

2

169

35

113

24K

Jane Pan @JanePan_

10 months ago

Bored of seeing pristine, perfect posters? Come see me at Hall X5, Board 105 at 6pm to witness my masterpiece, featuring bonus Sharpie scribbles and a QR code that betrayed me at the last moment 😤

Jane Pan @JanePan_

11 months ago

I'll be at ACL Vienna 🇦🇹 next week presenting this work! If you're around, come say hi on Monday (7/28) from 18:00–19:30 in Hall 4/5. Would love to chat about code model benchmarks 🧠, simulating user interactions 🤝, and human-centered NLP in general!

1

52

5

11K

0

23

3

2

3K

Jane Pan @JanePan_

11 months ago

I'll be at ACL Vienna 🇦🇹 next week presenting this work! If you're around, come say hi on Monday (7/28) from 18:00–19:30 in Hall 4/5. Would love to chat about code model benchmarks 🧠, simulating user interactions 🤝, and human-centered NLP in general!

Jane Pan @JanePan_

over 1 year ago

When benchmarks talk, do LLMs listen? Our new paper shows that evaluating that code LLMs with interactive feedback significantly affects model performance compared to standard static benchmarks! Work w/ @RyanShar01, @jacob_pfau, @atalwalkar, @hhexiy, and @valeriechen_! [1/6]

JanePan_'s tweet photo. When benchmarks talk, do LLMs listen?

Our new paper shows that evaluating that code LLMs with interactive feedback significantly affects model performance compared to standard static benchmarks!

Work w/ @RyanShar01, @jacob_pfau, @atalwalkar, @hhexiy, and @valeriechen_!

[1/6] https://t.co/OYtuGYYpiq

2

54

15

14

11K

1

52

5

11K

JanePan_ retweeted

Vishakh Padmakumar

@vishakh_pk

about 1 year ago

What does it mean for #LLM output to be novel? In work w/ @jcyhc_ai, @JanePan_, @valeriechen_, @hhexiy we argue it needs to be both original and high quality. While prompting tricks trade one for the other, better models (scaling/post-training) can shift the novelty frontier 🧵

vishakh_pk's tweet photo. What does it mean for #LLM output to be novel?
In work w/ @jcyhc_ai, @JanePan_, @valeriechen_, @hhexiy we argue it needs to be both original and high quality. While prompting tricks trade one for the other, better models (scaling/post-training) can shift the novelty frontier 🧵 https://t.co/2sRGlTUTJJ

2

87

25

36

12K

JanePan_ retweeted

Yulin Chen @YulinChen99

about 1 year ago

We're excited to receive wide attention from the community—thank you for your support! We release code, trained probes, and the generated CoT data👇 https://t.co/Rkw6LJtAyj We have labeled answer data on its way. Stay tuned!

1

43

12

18

5K

Jane Pan @JanePan_

about 1 year ago

Do reasoning models know when their answers are right?🤔 Really excited about this work led by Anqi and @YulinChen99. Check out this thread below!

Yulin Chen @YulinChen99

about 1 year ago

Reasoning models overthink, generating multiple answers during reasoning. Is it because they can’t tell which ones are right? No! We find while reasoning models encode strong correctness signals during chain-of-thought, they may not use them optimally. 🧵 below

YulinChen99's tweet photo. Reasoning models overthink, generating multiple answers during reasoning. Is it because they can’t tell which ones are right?

No! We find while reasoning models encode strong correctness signals during chain-of-thought, they may not use them optimally.

🧵 below https://t.co/Vj6XgMuF4E

10

378

77

327

51K

0

66

7

34

10K

Jane Pan @JanePan_

over 1 year ago

Our work bridges the gap between existing static benchmarks and real-world usage, and we hope to inspire future work on scalable methods for evaluating models in a collaborative setting. Read our preprint at https://t.co/gwPhIJDmqR! [6/6]

JanePan_'s tweet photo. Our work bridges the gap between existing static benchmarks and real-world usage, and we hope to inspire future work on scalable methods for evaluating models in a collaborative setting.

Read our preprint at https://t.co/gwPhIJDmqR!

[6/6] https://t.co/JkbnrDQFVP

0

8

1

479

Jane Pan @JanePan_

over 1 year ago

When benchmarks talk, do LLMs listen? Our new paper shows that evaluating that code LLMs with interactive feedback significantly affects model performance compared to standard static benchmarks! Work w/ @RyanShar01, @jacob_pfau, @atalwalkar, @hhexiy, and @valeriechen_! [1/6]

2

54

15

14

11K

Jane Pan @JanePan_

over 1 year ago

We also investigate how much a code model adjusts its solution in response to feedback. Weaker models tend to make many surface-level changes that do not greatly change code behavior; stronger models may make relatively small edits that highly affect code behavior. [5/6]

JanePan_'s tweet photo. We also investigate how much a code model adjusts its solution in response to feedback. Weaker models tend to make many surface-level changes that do not greatly change code behavior; stronger models may make relatively small edits that highly affect code behavior.

[5/6] https://t.co/AMG43FKfCL

1

6

0

467

JanePan_ retweeted

Jacob Andreas @jacobandreas

almost 2 years ago

@tallinzen @akyurekekin @yoavartzi @NeelNanda5 I really like the paper from Jane Pan (w @danqi_chen) abt this: https://t.co/rnbu4Mnfab. ICL in big models is clearly a mix of task recognition and "real learning" (you're not learning to translate from 3 examples, but you're not getting an arbitrary label mapping from the prior)

2

29

5

13

3K

Jane Pan @JanePan_

almost 2 years ago

Our pre-print is available on arXiv here: https://t.co/cYUa7vUBOB [7/7]

0

8

1

1K

Jane Pan @JanePan_

almost 2 years ago

Do LLMs exploit imperfect proxies of human preference in context? Yes! In fact, they do it so severely that iterative refinement can make outputs worse when judged by actual humans. In other words, reward hacking can occur even without gradient updates! w/ @hhexiy, @sleepinyourhat, @ihsgnef [1/7]

JanePan_'s tweet photo. Do LLMs exploit imperfect proxies of human preference in context? Yes!

In fact, they do it so severely that iterative refinement can make outputs worse when judged by actual humans. In other words, reward hacking can occur even without gradient updates!

w/ @hhexiy, @sleepinyourhat, @ihsgnef
[1/7]

4

170

30

114

22K

Jane Pan @JanePan_

almost 2 years ago

We follow the canonical definition of reward hacking, observing a divergence between the ground-truth reward (human expert judgment) and its proxy (an LLM judge following the same scoring criteria as the humans). Our results complement recent work on output degradation via iterative refinement when measured with secondary objectives (https://t.co/x4WpZ8sky4) or with reference-based metrics (https://t.co/31ugQmUcZq). [6/7]

1

7

0

1

1K

Jane Pan

@JanePan_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users