Dylan Sam

6 months ago

Finally, I'm presenting work on monitoring models for harmful behaviors, hallucinations, and adversarial manipulation at Poster #1304 in Exhibit Hall C,D,E on 12/5 at 4:30pm! https://t.co/hIirReXAwZ

over 1 year ago

To trust LLMs in deployment (e.g., agentic frameworks or for generating synthetic data), we should predict how well they will perform. Our paper shows that we can do this by simply asking black-box models multiple follow-up questions! w/ @m_finzi and @zicokolter 1/ 🧵

dylanjsam's tweet photo. To trust LLMs in deployment (e.g., agentic frameworks or for generating synthetic data), we should predict how well they will perform. Our paper shows that we can do this by simply asking black-box models multiple follow-up questions! w/ @m_finzi and @zicokolter

1/ 🧵 https://t.co/OrNxpQfgBM

4

116

40

82

15K

0

2

0

425

6 months ago

I'm at NeurIPS this week! Excited to meet old/new friends and chat with people about training safer language models. I'm presenting a few works on safety pretraining, measuring diversity in data curation, and monitoring model behaviors --- more info below 👇

4

37

4

7

4K

6 months ago

Next, I'm presenting on safety pretraining, where we find that incorporating safety behaviors during pretraining leads to more robust language models! Come by Poster #5210 at Exhibit Hall C,D,E at 4:30pm today (12/4)! https://t.co/p1mCATySPL

9 months ago

🚨Excited to introduce a major development in building safer language models: Safety Pretraining! Instead of post-hoc alignment, we take a step back and embed safety directly into pretraining. 🧵(1/n)

dylanjsam's tweet photo. 🚨Excited to introduce a major development in building safer language models: Safety Pretraining!

Instead of post-hoc alignment, we take a step back and embed safety directly into pretraining.

🧵(1/n) https://t.co/667aQmBG5L

7

358

90

239

64K

1

3

0

591

dylanjsam retweeted

Sachin Goyal @goyalsachin007

6 months ago

I’m at NeurIPS this week (12/2-12/8) to present our work on when/how synthetic data (e.g., LLM simulations) can help scientists make inferences with less real data, improving the efficiency of costly experiments. Come by Poster #904 on Thursday 4:30PM (Exhibit Hall C,D,E)!🙂

2

31

4

9

13K

dylanjsam retweeted

Pratyush Maini

@pratyushmaini

6 months ago

Excited about our NeurIPS'25 tutorial Data Privacy, Memorization & Copyright in GenAI with Cooper (co-founder, GenLaw) & Joe (represents OpenAI, Stability in all US copyright litigations) We bring together ML researchers, with those who understand its legal implications. Pls RT

pratyushmaini's tweet photo. Excited about our NeurIPS'25 tutorial
Data Privacy, Memorization & Copyright in GenAI

with Cooper (co-founder, GenLaw) & Joe (represents OpenAI, Stability in all US copyright litigations)

We bring together ML researchers, with those who understand its legal implications. Pls RT https://t.co/eXaYuj4DXo

3

82

22

8

13K

dylanjsam retweeted

Bryan Wilder @brwilder

7 months ago

I gave talks at MIT and Harvard this week about "Science with synthetic data". How can generative models help us learn about the world (e.g., social systems) in a principled way? Lots of interesting conversations; more convinced than ever that there's nuanced issues to navigate

brwilder's tweet photo. I gave talks at MIT and Harvard this week about "Science with synthetic data". How can generative models help us learn about the world (e.g., social systems) in a principled way? Lots of interesting conversations; more convinced than ever that there's nuanced issues to navigate https://t.co/hY6ufAfWOG

1

9

2

4

624

dylanjsam retweeted

7 months ago

📢 Multi-token prediction has long struggled with defining the right “auxiliary target,” leading to tons of heuristics. We show a core limitation of these and propose a simple & sweet idea: future summary prediction. Introducing what I call 🚀TL;DR token pretraining🚀

goyalsachin007's tweet photo. 📢 Multi-token prediction has long struggled with defining the right “auxiliary target,” leading to tons of heuristics. We show a core limitation of these and propose a simple & sweet idea: future summary prediction.

Introducing what I call
🚀TL;DR token pretraining🚀 https://t.co/eVsTiRizD9

4

241

36

159

29K

dylanjsam retweeted

Yuda Song @yus167

8 months ago

🤖 Robots rarely see the true world's state—they operate on partial, noisy visual observations. How should we design algorithms under this partial observability? Should we decide (end-to-end RL) or distill (from a privileged expert)? We study this trade-off in locomotion. 🧵(1/n)

yus167's tweet photo. 🤖 Robots rarely see the true world's state—they operate on partial, noisy visual observations.
How should we design algorithms under this partial observability?
Should we decide (end-to-end RL) or distill (from a privileged expert)?
We study this trade-off in locomotion. 🧵(1/n) https://t.co/IEWVGrPsOx

2

142

40

66

31K

dylanjsam retweeted

Bryan Wilder @brwilder

8 months ago

How can synthetic data from LLMs be used, e.g. for social science, in a principled way? Check out Emily's thread on our NeurIPS paper. The key is to generate each synthetic sample by prompting with a real example -- enables debiased estimates that wouldn't be possible otherwise!

1

10

2

1K

dylanjsam retweeted

8 months ago

14/ I’ll be giving a talk on our work at the #COLM2025 Social Simulations workshop tomorrow (Friday 10/10) at 10AM. Come by Room 523AB!🙂 Paper Link: https://t.co/XH2wk06MRA Code: https://t.co/5NZyzT25p0

0

7

3

1

778

dylanjsam retweeted

8 months ago

13/ I really enjoyed working on this project with the brilliant and kindest @shantanug7 and great mentors @zacharylipton, @DonskerClass and @brwilder

1

5

1

0

638

dylanjsam retweeted

8 months ago

💡Can we trust synthetic data for statistical inference? We show that synthetic data (e.g. LLM simulations) can significantly improve the performance of inference tasks. The key intuition lies in the interactions between the moments of synthetic data and those of real data

yewonbyun_'s tweet photo. 💡Can we trust synthetic data for statistical inference?

We show that synthetic data (e.g. LLM simulations) can significantly improve the performance of inference tasks. The key intuition lies in the interactions between the moments of synthetic data and those of real data https://t.co/WiHkcR0GmO

2

144

36

84

31K

8 months ago

Very interesting insights into understanding when and why synthetic data (although imperfect and biased) can boost the performance of statistical inference!! 📈📈