Gene Li @geneli0 - Twitter Profile

Pinned Tweet

Gene Li @geneli0

11 months ago

like everyone else i am hopping on the blog post trend https://t.co/t3Rma35FWC

0

180

29

195

19K

Gene Li @geneli0

about 1 month ago

Happy to chat about this paper, RL, deep learning, etc. Feel free to reach out! Poster Session 6 on Saturday!

Nirmit Joshi @nirmitj_

about 1 month ago

@geneli0 will be presenting our paper @iclr_conf 🇧🇷 (this Saturday), which has implications for SFT of LLMs. https://t.co/QZlmCBS1vT

0

13

4

5

2K

0

6

1

0

660

geneli0 retweeted

Mayee Chen

@MayeeChen

4 months ago

Data mixing - determining ratios across your training datasets - matters a lot for model quality. While building Olmo 3, we learned it’s hard to set up a method that finds a strong mix, and hard to maintain that mix as datasets change throughout development. Introducing Olmix👇

MayeeChen's tweet photo. Data mixing - determining ratios across your training datasets - matters a lot for model quality. While building Olmo 3, we learned it’s hard to set up a method that finds a strong mix, and hard to maintain that mix as datasets change throughout development.
Introducing Olmix👇 https://t.co/xqFxujcrsk

13

270

72

177

57K

geneli0 retweeted

Mayee Chen

@MayeeChen

7 months ago

Thrilled to have contributed to Olmo 3! The best fully open 32B model (data, training recipes, checkpoints and more!) As an intern at AI2 these last 8 months, I’ve grown to deeply appreciate the careful science, iteration, and collaboration that go into models like this and have learned so much from the team. I am more optimistic than ever about the future of open-source and data-centric research right now. My particular contribution was working on the Dolma 3 data mix 👩‍🍳 I was able to apply ideas from some of my earlier mixing work, explore new problem settings, and see firsthand the data challenges that arise when building datasets intended for real models at scale. More on this coming soon!

16

271

34

67

70K

Gene Li @geneli0

8 months ago

Check this out! Some fun work with interesting implications for LLM training 🧐

Nirmit Joshi @nirmitj_

8 months ago

Very satisfied with some neat results on imitation learning. When distribution matching isn’t possible, what’s even the role of demonstrations? Cloning/log-loss minimization? We propose directly encoding reward structure—motivating new algorithmic ideas. https://t.co/QZlmCBSzlr

4

67

6

55

11K

1

9

0

4

2K

geneli0 retweeted

Charlie Hou

@hou_char

8 months ago

Gave a talk at @OpenAI on our work 🌸 POPri “Policy Optimization for Private Data”. POPri is a huge improvement in synthetic data generation under security+privacy constraints! Learn more:

hou_char's tweet photo. Gave a talk at @OpenAI on our work 🌸 POPri “Policy Optimization for Private Data”. POPri is a huge improvement in synthetic data generation under security+privacy constraints! Learn more: https://t.co/h5I0jlfIt8

2

12

2

1

2K

Gene Li

@geneli0

Last Seen Users on Sotwe

Trends for you

Most Popular Users