Anirudh Vemula @vvanirudh - Twitter Profile

almost 2 years ago

Now that I have started using twitter somewhat regularly, let me take a minute to advertise the RL theory lecture notes I have been developing with Sasha Rakhlin: https://t.co/x16aGvE4tr

canondetortugas's tweet photo. Now that I have started using twitter somewhat regularly, let me take a minute to advertise the RL theory lecture notes I have been developing with Sasha Rakhlin: https://t.co/x16aGvE4tr https://t.co/6phDbj6gJx

6

647

90

444

70K

vvanirudh retweeted

Vaishnavh Nagarajan @_vaishnavh

over 2 years ago

🗣️ “Next-token predictors can’t plan!” ⚔️ “False! Every distribution is expressible as product of next-token probabilities!” 🗣️ In work w/ @GregorBachmann1 , we carefully flesh out this emerging, fragmented debate & articulate a key new failure. 🔴 https://t.co/fLgLAjvIUf

13

395

77

348

55K

Anirudh Vemula @vvanirudh

over 2 years ago

@Wenxuan_Zhou @yufei_ye Congrats Wenxuan!!

0

1

0

129

vvanirudh retweeted

AK

@_akhaliq

almost 3 years ago

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback paper page: https://t.co/vJHkg8dce0 Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and related methods; (2) overview techniques to understand, improve, and complement RLHF in practice; and (3) propose auditing and disclosure standards to improve societal oversight of RLHF systems. Our work emphasizes the limitations of RLHF and highlights the importance of a multi-faceted approach to the development of safer AI systems.

_akhaliq's tweet photo. Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

paper page: https://t.co/vJHkg8dce0

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and related methods; (2) overview techniques to understand, improve, and complement RLHF in practice; and (3) propose auditing and disclosure standards to improve societal oversight of RLHF systems. Our work emphasizes the limitations of RLHF and highlights the importance of a multi-faceted approach to the development of safer AI systems.

4

648

144

356

133K

Who to follow

David Held

@davheld

Associate Professor at Carnegie Mellon University | he/him

Adithya Murali

@Adithya_Murali_

Research Scientist at @NVIDIAAI. I work on robots 🤖 | MIT TR35 | Previously PhD at @CMU_Robotics | @berkeley_ai, @AIatMeta

Tabitha Edith Lee

@TabulaRobot

Postdoc at @UMontreal & @Mila_Quebec in causal learning for robots and embodied AI. Prior stops at @CMU_Robotics, @nvidia, LM Space ATC, & Uber ATG.

Anirudh Vemula @vvanirudh

almost 3 years ago

@aravindr93 @philippswu @arjunmajum @kevinleestone @yixin_lin_ @IMordatch @pabbeel @ncklashansen @haosu_twitr @HarryXu12 @xiaolonw Also at ICML this week. We should catch up :)

0

1

0

172

Anirudh Vemula @vvanirudh

almost 3 years ago

@chinganc_rl Also at ICML this week. We should catch up :)

1

0

90

Anirudh Vemula @vvanirudh

almost 3 years ago

On my way to Honolulu to present this work! Hit me up if you want to hike and check out cool beaches B) https://t.co/ENKppPA6Bk

Anirudh Vemula @vvanirudh

about 3 years ago

If this has been a long thread, this can be the only tweet to pay attention to the example figure to understand awesomeness of PDAM. MBPO: O(2^H) computation per iteration, and converges to bad model LAMPS-MM: O(H) computation per iteration and converges to good model

vvanirudh's tweet photo. If this has been a long thread, this can be the only tweet to pay attention to the example figure to understand awesomeness of PDAM.

MBPO: O(2^H) computation per iteration, and converges to bad model

LAMPS-MM: O(H) computation per iteration and converges to good model https://t.co/5uxvjpUJLJ

1

0

862

0

3

0

573

Anirudh Vemula @vvanirudh

almost 3 years ago

@debidatta Why did you tweet this??

0

19

vvanirudh retweeted

Gokul Swamy @g_k_swamy

almost 3 years ago

I'm rarely as excited about a paper as our #ICML2023 paper: we develop an algorithm for doing inverse reinforcement w/o an expensive RL inner loop, providing an *exponential* speedup. Works *extremely* well in practice. Joint work w/ @sanjibac, @zstevenwu, and Drew Bagnell. [1/n]

g_k_swamy's tweet photo. I'm rarely as excited about a paper as our #ICML2023 paper: we develop an algorithm for doing inverse reinforcement w/o an expensive RL inner loop, providing an *exponential* speedup. Works *extremely* well in practice. Joint work w/ @sanjibac, @zstevenwu, and Drew Bagnell. [1/n] https://t.co/WiDkLpWRHL

6

586

93

330

77K

vvanirudh retweeted

Micah Corah @CorahMicah

about 3 years ago

I am delighted to say that I will be joining the Colorado School of Mines @CSatMines 💻🤖 as an Assistant Professor 👨‍🏫 this January! #academia #AcademicTwitter

12

95

5

2

11K

vvanirudh retweeted

Sanjiban Choudhury @sanjibac

about 3 years ago

Why is being laziness a fundamental virtue in both model based RL and IRL? Excited to share our new ICML'23 papers https://t.co/nuqnxMuyes and https://t.co/79pP9SVOn5 that gets at the heart of this question. Check out my talk at my CMU to learn more! https://t.co/ltOQXEFkFT

1

10

1

3

1K

Anirudh Vemula @vvanirudh

about 3 years ago

@rkbanoth Nuvvu unnavu kada Mari atluntadi

1

0

103

Anirudh Vemula @vvanirudh

about 3 years ago

@bremen79 Yes please

0

1

0

26

Anirudh Vemula @vvanirudh

about 3 years ago

@nanjiang_cs @g_k_swamy @yus167 Thanks for pointing out that connection @g_k_swamy Loved it! And thanks @nanjiang_cs def looking forward to catching up again after a long time at ICML

0

1

0

225

Anirudh Vemula @vvanirudh

about 3 years ago

Joint work with my awesome collaborators @yus167, @sanjibac, Aarti Singh, and Drew Bagnell

0

1

0

179

Anirudh Vemula @vvanirudh

about 3 years ago

Our paper on a new (lazy) approach to model-based RL that is both computationally efficient and avoids the objective mismatch problem has been accepted for ICML! Excited to present it at Honolulu this summer! https://t.co/ENKppPA6Bk

1

55

11

17

10K

Anirudh Vemula @vvanirudh

about 3 years ago

If this has been a long thread, this can be the only tweet to pay attention to the example figure to understand awesomeness of PDAM. MBPO: O(2^H) computation per iteration, and converges to bad model LAMPS-MM: O(H) computation per iteration and converges to good model

1

0

862

Anirudh Vemula

@vvanirudh

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users