Changyu Chen

@Cameron_Chann

reward hacking for human-ai collaboration @StanfordNLP

Palo Alto, CA

Joined May 2020

346 Following

596 Followers

234 Posts

Pinned Tweet

Changyu Chen

@Cameron_Chann

about 1 year ago

(1/3) My favorite figure from the paper. Nearly all open-source RL frameworks introduce an unintentional bias when computing the masked mean 😮. The fix? Just replace mask.sum with a constant.

Cameron_Chann's tweet photo. (1/3) My favorite figure from the paper.

Nearly all open-source RL frameworks introduce an unintentional bias when computing the masked mean 😮. The fix? Just replace mask.sum with a constant. https://t.co/nODAcPa9bf

178

165

41K

Cameron_Chann retweeted

Vishakh Padmakumar

@vishakh_pk

about 15 hours ago

People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)

$vishakh_pk's tweet photo. People are increasingly worried that AI tools make us overreliant. But how do we actually measure this? We introduce Offloading Score, a measure of reliance based on the fraction of cognitive effort offloaded to AI while completing a task. In a controlled user study, Offloading Score detects increased reliance under time pressure, while several common alternatives do not. (1/9)$

102

14K

Changyu Chen

@Cameron_Chann

7 days ago

@rronak_ @MichaelElabd @QuantumArjun this is amazing! Trajectory is instantiating one way I’ve been thinking about learning from experience. big congrats on the launch @rronak_ @MichaelElabd @QuantumArjun !!

218

Cameron_Chann retweeted

Yijia Shao @EchoShao8899

9 days ago

🔴 LIVE this Thursday, May 28th | 6–7PM PST @augmind_fm goes live with @cjziems @dorazhao9, and @Diyi_Yang to discuss their recent paper and the classroom experiment behind it. → Does AI make us happier? → What do we need from LLMs? → How do we reinvent the classroom? Live paper discussion + Q&A as well! Live stream link: https://t.co/SyZOwBfFM6 Use this link to mark your calendar: https://t.co/yFzYoAfXsn?

EchoShao8899's tweet photo. 🔴 LIVE this Thursday, May 28th | 6–7PM PST
@augmind_fm goes live with @cjziems @dorazhao9, and @Diyi_Yang to discuss their recent paper and the classroom experiment behind it.
→ Does AI make us happier?
→ What do we need from LLMs?
→ How do we reinvent the classroom?
Live paper discussion + Q&A as well!

Live stream link: https://t.co/SyZOwBfFM6
Use this link to mark your calendar: https://t.co/yFzYoAfXsn?

Who to follow

Avinandan Bose

@avibose22

Final Year PhD @UWCSE | Visiting Researcher FAIR @Meta Superintelligence Labs | Prev. Research Engineer @sgSMU | CSE @IITKanpur '22

Gautham Krishna Gudur

@gauthamkrishna_

Ph.D. student @UTAustin and @wncg_UT. Prev. @BellLabs @ericsson. Into efficient + data/human-centered ML + LLMs + multimodal health AI. Almost always GPU poor.

Ashvin Nair

@ashvinair

RL foundations @cursor_ai. Prev: o1, o3, Code Interpreter @openai, 9 years learning to poke by poking at UC Berkeley

Cameron_Chann retweeted

Diyi Yang

@Diyi_Yang

15 days ago

The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :) With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

Diyi_Yang's tweet photo. The next frontier of AI is not only more capable model; it is an AI that *humans* can meaningfully live and work with :)

With all students in my cs329x Human-Centered LLM class, we present 60+ pages of insights for developing Human-Centered LLMs (HCLLMs), from design & data sourcing to training, eval & deployment 🧵

289

180

53K

Cameron_Chann retweeted

Dimitris Papailiopoulos

@DimitrisPapail

17 days ago

https://t.co/n10GwfKYuY

974

125

848K

Changyu Chen

@Cameron_Chann

19 days ago

@KushaSareen @rish2k1 @agarwl_ @Devvrit_Khatri @LakshyAAAgrawal @inderjit_ml @profjoeyg @KurtKeutzer Thank you! it makes sense. looking forward to the code and will dive into the details with it

Changyu Chen

@Cameron_Chann

20 days ago

This is super cool! Like the way that the teacher's behavior being steered. A quick question, I feel the same spike-aware reward design and surprisal gate is applicable to opsd setting, by replacing the vanilla reward function with your proposal? How do you compare these two approaches?

743

Cameron_Chann retweeted

Lujain Ibrahim @lujainmibrahim

21 days ago

New preprint! In 5 studies (3k+ users / 12k+ convs, with a 3-wk longitudinal study), we find that sycophantic AI influences how people view those closest to them. It affects how effortful human interaction seems, how satisfying it is, & who people want to turn to for advice 🧵

lujainmibrahim's tweet photo. New preprint!

In 5 studies (3k+ users / 12k+ convs, with a 3-wk longitudinal study), we find that sycophantic AI influences how people view those closest to them.

It affects how effortful human interaction seems, how satisfying it is, & who people want to turn to for advice 🧵 https://t.co/tNR1wv7Fpj

172

58K

Changyu Chen

@Cameron_Chann

21 days ago

@Haonan_Wang_ Thanks Haonan!

Changyu Chen

@Cameron_Chann

21 days ago

@ChengleiSi @tydsh @CaimingXiong Congrats chenglei!!

123

Changyu Chen

@Cameron_Chann

22 days ago

@web3nomad @Stanford @Diyi_Yang yeah, highly agree! reward function (more broadly env) has been a critical component in RL study. One question that excites me is which reward design benefits the collaboration in the long run.

Changyu Chen

@Cameron_Chann

22 days ago

Life update: I'm super excited to join @Stanford as a postdoc working with @Diyi_Yang ! I’ll continue my research on RL, and recently I’ve become especially interested in how RL can contribute to human-AI collaboration and collaborative agents. A new chapter begins, from the sunny island to the sunny state ☀️🏝️

199

16K