Sean Hendryx @SeanHendryx - Twitter Profile

Pinned Tweet

about 1 year ago

What will the learning environments of the future look like that train artificial super intelligence? In recent work at @scale_AI , we show that training systems that combine verifiable rewards with multi-agent interaction accelerate learning.

SeanHendryx's tweet photo. What will the learning environments of the future look like that train artificial super intelligence? In recent work at @scale_AI , we show that training systems that combine verifiable rewards with multi-agent interaction accelerate learning. https://t.co/WwFjjOJS5P

12

130

28

99

24K

SeanHendryx retweeted

Anisha Gunjal @anisha_gunjal

6 months ago

Great to see our work, Rubrics as Rewards, featured in the latest RLHF Book update 📘🚀 Rubric-based RLVR is emerging as a practical tool for modern training and evaluation. See §13.4 at https://t.co/uuhUIUBvNE. 📖

0

9

2

760

SeanHendryx retweeted

Manasi Sharma @ ICLR 2026 @ManasiSharma_

7 months ago

🚀New @scale_AI paper: 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗥𝘂𝗯𝗿𝗶𝗰𝘀, a benchmark for evaluating Deep Research (DR) agents. Even top agents like Gemini & OpenAI DR achieve <𝟲𝟴% 𝗿𝘂𝗯𝗿𝗶𝗰 𝗰𝗼𝗺𝗽𝗹𝗶𝗮𝗻𝗰𝗲. We built 𝟮.𝟱𝗞+ expert rubrics with 𝟮.𝟴𝗞+ hrs of human labor to measure why.

ManasiSharma_'s tweet photo. 🚀New @scale_AI paper: 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗥𝘂𝗯𝗿𝗶𝗰𝘀, a benchmark for evaluating Deep Research (DR) agents. Even top agents like Gemini & OpenAI DR achieve <𝟲𝟴% 𝗿𝘂𝗯𝗿𝗶𝗰 𝗰𝗼𝗺𝗽𝗹𝗶𝗮𝗻𝗰𝗲. We built 𝟮.𝟱𝗞+ expert rubrics with 𝟮.𝟴𝗞+ hrs of human labor to measure why. https://t.co/aPYN3WZBhW

12

222

33

129

32K

SeanHendryx retweeted

Bing Liu

@vbingliu

9 months ago

🚀 Introducing SWE-Bench Pro — a new benchmark to evaluate LLM coding agents on real, enterprise-grade software engineering tasks. This is the next step beyond SWE-Bench: harder, contamination-resistant, and closer to real-world repos.

54

1K

110

351

569K

Who to follow

Monica

@monicaeyebee

Roast me: https://t.co/wQztw0pGNN

Blake Resnick

@Blake_Resnick_

Founder and CEO of @BrincDrones. Forbes 30 Under 30. Thiel Fellow. Building technology in the service of public safety.

Adham

@adhamelarabawy

Research @GoogleDeepMind. Opinions mine.

SeanHendryx retweeted

Anisha Gunjal @anisha_gunjal

11 months ago

🤔 How do we train LLMs on real-world tasks where it’s hard to define a single verifiable answer? Our work at @scale_AI introduces Rubrics as Rewards (RaR) — a framework for on-policy post-training that uses structured, checklist-style rubrics as interpretable reward signals. 🧵

anisha_gunjal's tweet photo. 🤔 How do we train LLMs on real-world tasks where it’s hard to define a single verifiable answer?

Our work at @scale_AI introduces Rubrics as Rewards (RaR) — a framework for on-policy post-training that uses structured, checklist-style rubrics as interpretable reward signals. 🧵 https://t.co/hN4QjqnS8k

6

252

41

201

57K

Sean Hendryx

@SeanHendryx

11 months ago

@karpathy a neat quality specific to language models is that you can just tell them what to do differently when they fail. And if you use importance sampling, gradients are aligned with the unguided context and it gets into the weights directly. No sleep needed https://t.co/qJ2Qv43rYp

Sean Hendryx

@SeanHendryx

about 1 year ago

For online RL, we introduce Guide, a class of algorithms which incorporate guidance into the model’s context when all rollouts fail and adjusts the importance sampling ratio in order to optimize the policy for contexts in which guidance is no longer present.

SeanHendryx's tweet photo. For online RL, we introduce Guide, a class of algorithms which incorporate guidance into the model’s context when all rollouts fail and adjusts the importance sampling ratio in order to optimize the policy for contexts in which guidance is no longer present. https://t.co/nCda4RdUdB

1

2

1

0

1K

0

5

1

2

574

SeanHendryx retweeted

Miles Turpin @milesaturpin

12 months ago

New @Scale_AI paper! 🌟 LLMs trained with RL can exploit reward hacks but not mention this in their CoT. We introduce verbalization fine-tuning (VFT)—teaching models to say when they're reward hacking—dramatically reducing the rate of undetected hacks (6% vs. baseline of 88%).

milesaturpin's tweet photo. New @Scale_AI paper! 🌟

LLMs trained with RL can exploit reward hacks but not mention this in their CoT. We introduce verbalization fine-tuning (VFT)—teaching models to say when they're reward hacking—dramatically reducing the rate of undetected hacks (6% vs. baseline of 88%). https://t.co/kV0Bb0niIY

9

270

61

132

27K

Sean Hendryx

@SeanHendryx

about 1 year ago

Thanks to coauthors @_jeffda @vaskar_n @anisha_gunjal @Elaine_Lau99 @ManasiSharma_ @XiangDeng1 @clintonjwang @nikhilbarhate99 @TommyMa9 and to @alexandr_wang and @summeryue0 for making this work possible

0

2

1

0

529

Sean Hendryx

@SeanHendryx

about 1 year ago

What will the learning environments of the future look like that train artificial super intelligence? In recent work at @scale_AI , we show that training systems that combine verifiable rewards with multi-agent interaction accelerate learning.

12

130

28

99

24K

Sean Hendryx

@SeanHendryx

about 1 year ago

You can read more about this work in our blog: https://t.co/cze6TcT438 and papers: https://t.co/dIKeDIkdbm https://t.co/RtCUq0KOYd

1

4

1

0

525

SeanHendryx retweeted

Jacob Phillips

@jacob_dphillips

about 1 year ago

We’re entering a new era in robotics where generalized systems are starting to work in the real world, but researchers still don’t have good tools for understanding their data. That’s why I built ARES, an open-source platform for ingesting, annotating, and curating robotics data.

jacob_dphillips's tweet photo. We’re entering a new era in robotics where generalized systems are starting to work in the real world, but researchers still don’t have good tools for understanding their data. That’s why I built ARES, an open-source platform for ingesting, annotating, and curating robotics data. https://t.co/XQ2D2hFTVI

14

156

30

73

61K

Sean Hendryx

@SeanHendryx

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users