David Wu @dxwu_ - Twitter Profile

Pinned Tweet

about 1 year ago

Flying to Singapore to present this recent work about weak-to-strong generalization at #ICLR2025. Nearly random weak labels can yield nearly perfect generalization for strong models, under the right scaling regimes...even if the strong model can exactly memorize the weak labels!

dxwu_'s tweet photo. Flying to Singapore to present this recent work about weak-to-strong generalization at #ICLR2025.
Nearly random weak labels can yield nearly perfect generalization for strong models, under the right scaling regimes...even if the strong model can exactly memorize the weak labels! https://t.co/EfPt6QISL6

1

15

1

0

1K

dxwu_ retweeted

Hongxun Wu @HongxunWu

25 days ago

🧵(1/8) An @OpenAI internal reasoning LLM achieved an AI Math milestone: solving an open problem central to its mathematical subfield— in this case, the unit distance problem of discrete geometry. We came across it in a side quest to truly push our model on the hardest problems.

HongxunWu's tweet photo. 🧵(1/8) An @OpenAI internal reasoning LLM achieved an AI Math milestone: solving an open problem central to its mathematical subfield— in this case, the unit distance problem of discrete geometry.

We came across it in a side quest to truly push our model on the hardest problems. https://t.co/fdgXp3aPVp

26

954

134

309

141K

David Wu @dxwu_

9 months ago

@AnshNagda hi!

0

1

0

23

David Wu @dxwu_

about 1 year ago

Come visit my poster in Poster Session 3 to hear more! Or check out the paper at https://t.co/scVrNhgjek

0

2

0

165

Who to follow

Kevin Black

@kvablack

phd @berkeley_ai, research @physical_int

Nived Rajaraman

@Nived_Rajaraman

Postdoc at @MSFTResearch. Formerly @berkeley_ai

Grace Luo

@graceluo_

phd student @berkeley_ai, vision + language

David Wu @dxwu_

about 1 year ago

Flying to Singapore to present this recent work about weak-to-strong generalization at #ICLR2025. Nearly random weak labels can yield nearly perfect generalization for strong models, under the right scaling regimes...even if the strong model can exactly memorize the weak labels!

1

15

1

0

1K

David Wu @dxwu_

about 1 year ago

Our theory also predicts that weak logits can give *better scaling for multiclass problems* than ground truth multiclass labels when there are tons of classes. This corroborates conventional wisdom in distillation.

1

0

186

David Wu @dxwu_

over 1 year ago

Presenting a poster about this recent work at the NeurIPS M3L Workshop tomorrow! Catch me there @ 4:00 PM if you want to hear more about provable toy models for weak-to-strong generalization.

dxwu_'s tweet photo. Presenting a poster about this recent work at the NeurIPS M3L Workshop tomorrow! Catch me there @ 4:00 PM if you want to hear more about provable toy models for weak-to-strong generalization. https://t.co/tEaCynxW95

0

26

0

2

974

dxwu_ retweeted

Haize Labs

@haizelabs

over 1 year ago

We're excited to share our new preprint introducing endless jailbreaks via bijection learning. Our attack exploits the advanced reasoning abilities of frontier LLMs like GPT-4o and Claude 3.5 Sonnet, revealing a critical model vulnerability that arises from capabilities. 🧵(1/n)

haizelabs's tweet photo. We're excited to share our new preprint introducing endless jailbreaks via bijection learning.

Our attack exploits the advanced reasoning abilities of frontier LLMs like GPT-4o and Claude 3.5 Sonnet, revealing a critical model vulnerability that arises from capabilities. 🧵(1/n)

15

314

43

204

46K

dxwu_ retweeted

Shreyas Kapur @shreyaskapur

about 2 years ago

My first PhD paper!🎉We learn *diffusion* models for code generation that learn to directly *edit* syntax trees of programs. The result is a system that can incrementally write code, see the execution output, and debug it. 🧵1/n

111

5K

583

3K

742K

David Wu @dxwu_

over 2 years ago

@rajivmovva thanks raj!

0

374

David Wu @dxwu_

over 2 years ago

My first PhD paper was accepted to NeurIPS'23 as a spotlight! We nail down when overparameterized linear models generalize for multiclass classification in a toy setting, using a nice new tool for concentration in sparse problems. 🧵(1/n)

7

250

9

76

45K

David Wu @dxwu_

over 2 years ago

There are many other cool things I left out here. Check our paper out for more details (https://t.co/z4Jc4dwlc0)! (7/7)

1

18

1

11

2K

David Wu @dxwu_

over 2 years ago

That's where our new tool comes in! It's a variant of the Hanson-Wright inequality for bilinear forms of subgaussian vectors where one side is sparse. This is a standalone result which can be applied to problems with sparse labels. The proof isn't so bad either! (6/n)

1

4

0

1

2K

David Wu

@dxwu_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users