Jonathan @ICLR @lightetal - Twitter Profile

Pinned Tweet

3 months ago

Post-training LLMs is like mixing a cocktail: Too much easy data → no learning Too much hard data → instability Wrong balance → collapse And today, we mix it by hand. What if the data mixture could be learned instead of hand-tuned? https://t.co/2EntaqvO2G 🧵👇

lightetal's tweet photo. Post-training LLMs is like mixing a cocktail:

Too much easy data → no learning
Too much hard data → instability
Wrong balance → collapse

And today, we mix it by hand.

What if the data mixture could be learned instead of hand-tuned?

https://t.co/2EntaqvO2G

🧵👇 https://t.co/vbiuY9rXZR

4

114

14

93

23K

Jonathan @ICLR

@lightetal

9 days ago

@aaliya_va @sarahookr It’s always fun connecting with curious minds!

0

1

0

23

lightetal retweeted

Sara Hooker

@sarahookr

12 days ago

Our second continual learning dinner + community gathering is coming up next Thursday evening. We cutoff the guestlist on Tuesday to give names to security. Sign up before then. Will be very fun. https://t.co/uVVMYjDUAu

3

46

5

8

4K

Jonathan @ICLR

@lightetal

15 days ago

@Matthewagi Thanks!!

0

1

0

10

Jonathan @ICLR

@lightetal

26 days ago

Accepted at #ICML2026 recently!

Jonathan @ICLR

@lightetal

about 1 month ago

AI runs on data. But… data is hard to buy. ❌ How much is a dataset worth? ❌ Will it actually help your model? ❌ What about privacy / trust? What if we never priced data directly… and instead priced what actually matters: model improvement? https://t.co/7L2j7GMS31 🧵👇

1

11

3

12

7K

1

12

0

3

6K

Jonathan @ICLR

@lightetal

17 days ago

The real AGI was the researchers Anthropic recruited along the way.

Andrej Karpathy

@karpathy

17 days ago

Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.

8K

150K

11K

14K

27M

1

3

0

171

Jonathan @ICLR

@lightetal

21 days ago

@violazhouyi @restofworld First time getting quoted in a feature article 😂

0

1

0

37

Jonathan @ICLR

@lightetal

22 days ago

@tw_killian @BYU @BYUCS Super exciting, congrats Taylor!

0

1

0

77

Jonathan @ICLR

@lightetal

23 days ago

@atharva_sehgal This is so cool!

0

70

Jonathan @ICLR

@lightetal

24 days ago

@michaelryan207 @lateinteraction @KnightHennessy Congrats Michael!!

1

0

83

Jonathan @ICLR

@lightetal

25 days ago

🚀 Super excited to present at the continual learning meetup!

Nilou Salehi

@nilou_salehi

26 days ago

📣 The May speaker of our researcher meetup on continual learning will be Caltech's @lightetal Jonathan Li! 📣 Jonathan will share recent work on structured, language-native search methods such as DISC and SFS, which enable agents to explore diverse reasoning paths, strategies, and solutions beyond what is directly represented in their training data. Space is limited and last time we had a waitlist of 200+ so please register soon if you'd like to join. @sarahookr @mralbertchun @NikzadAfshin https://t.co/lm1pkfdAk0

0

7

4

6

3K

0

5

0

2

770

Jonathan @ICLR

@lightetal

26 days ago

@MimeeXu Wow this is very cool! Thanks for sharing

0

1

0

20

Jonathan @ICLR

@lightetal

about 1 month ago

AI runs on data. But… data is hard to buy. ❌ How much is a dataset worth? ❌ Will it actually help your model? ❌ What about privacy / trust? What if we never priced data directly… and instead priced what actually matters: model improvement? https://t.co/7L2j7GMS31 🧵👇

1

11

3

12

7K

Jonathan @ICLR

@lightetal

about 1 month ago

Grateful to amazing collaborators on this work, including: Minbiao Han, Steven Xia, @haifengxu0 @SainyamGalhotra @raulcfernandez @Cornell @UChicago Paper: https://t.co/3VRxAl7VZP 🧵 9/n

0

2

0

142

Jonathan @ICLR

@lightetal

about 1 month ago

The big takeaway: 👉 Don’t sell data. 👉 Sell what data does. By pricing model improvement, we can: • unlock data markets • align incentives • make ML systems more accessible • ensure privacy This is a step toward real AI economies. 🧵 8/n

1

0

139

Jonathan @ICLR

@lightetal

Last Seen Users on Sotwe

Trends for you

Most Popular Users