Junhyuk Oh @junh_oh - Twitter Profile

7 months ago

Work done with the amazing team: @iurii_kemaev [tech lead], @greg_far, @dancalian, @luisa_zintgraf, @matteohessel, Satinder Singh, @hado, David Silver Code & weights: https://t.co/Zm3kem2SOl Paper: https://t.co/3V4TmPTWm4

1

21

0

8

1K

Junhyuk Oh @junh_oh

7 months ago

Excited to announce that our work on “Discovering state-of-the-art RL algorithms” is finally published in @Nature! In this work, we meta-learned RL algorithms at scale. Paper: https://t.co/3V4TmPTWm4 Blog: https://t.co/G65ReK2iMs See thread 👇

14

475

85

297

73K

Junhyuk Oh @junh_oh

7 months ago

What did it discover? DiscoRL discovered entirely new prediction semantics. While humans rely on concepts like "value functions," DiscoRL learned to predict salient future events - like high rewards or changes in policy entropy - that complement traditional RL concepts.

junh_oh's tweet photo. What did it discover? DiscoRL discovered entirely new prediction semantics.

While humans rely on concepts like "value functions," DiscoRL learned to predict salient future events - like high rewards or changes in policy entropy - that complement traditional RL concepts. https://t.co/mzy9tg2Bim

1

27

0

6

2K

junh_oh retweeted

Luisa Zintgraf @luisa_zintgraf

8 months ago

Excited to share our new paper, "DataRater: Meta-Learned Dataset Curation"! We explore a fundamental question: How can we *automatically* learn which data is most valuable for training foundation models? Paper: https://t.co/N2ozU2RXWb to appear @NeurIPSConf Thread 👇

11

327

55

237

131K

Who to follow

Sarath Chandar

@apsarathchandar

Associate Professor @polymtl and @Mila_Quebec; Canada CIFAR AI Chair; Machine Learning Researcher. Pro-bono office hours: https://t.co/tK69DKRf9N?amp=1

Devendra Chaplot

@dchaplot

Building superintelligence @xai

Nan Jiang

@nanjiang_cs

machine learning researcher, with focus on reinforcement learning. assoc prof @ uiuc cs. Course on RL theory (w/ videos): https://t.co/vqVKwY4RJE

Junhyuk Oh @junh_oh

over 5 years ago

I will be briefly talking about how I used JAX to implement my recent #NeurIPS2020 work on Discovering RL Algorithms (https://t.co/Pc74kYMkVo). Stop by our livestream if you are interested. :)

Google DeepMind @GoogleDeepMind

over 5 years ago

Join our team at the #NeurIPS2020 JAX Ecosystem meet up to learn more about JAX and why it's effective for research in reinforcement learning, GANs, meta-gradients and more. Today at 11am PST / 2pm ET/ 7pm GMT https://t.co/ORApnkq7pC (calendar invite)

6

107

29

15

0

16

1

0

junh_oh retweeted

Marc G. Bellemare @marcgbellemare

over 5 years ago

Our most recent work is out in Nature! We're reporting on (reinforcement) learning to navigate Loon stratospheric balloons and minimizing the sim2real gap. Results from a 39-day Pacific Ocean experiment show RL keeps its strong lead in real conditions. https://t.co/jBbCABc3pP

21

729

95

74

0

junh_oh retweeted

Google DeepMind @GoogleDeepMind

over 5 years ago

In a major scientific breakthrough, the latest version of #AlphaFold has been recognised as a solution to one of biology's grand challenges - the “protein folding problem”. It was validated today at #CASP14, the biennial Critical Assessment of protein Structure Prediction (1/3)

119

10K

2K

270

0

junh_oh retweeted

Berkeley AI Research

@berkeley_ai

over 7 years ago

CfP for the @iclr2019 workshop on structure and priors in reinforcement learning (SPiRL), deadline 3/7! https://t.co/KAl7Lp1tMy

1

26

4

2

0

junh_oh retweeted

Pablo Samuel Castro @pcastr

over 7 years ago

really happy to announce the next version of our #RL framework: Dopamine 2.0! beyond atari: now we support general discrete-domain gym environments. we've been using this internally for our research and it allows us to test out new ideas very quickly. try it out!

4

127

29

10

0

junh_oh retweeted

Google DeepMind @GoogleDeepMind

over 7 years ago

Join us and @Blizzard_Ent this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by @Artosis and @RotterdaM08! Livestream on YouTube: https://t.co/lQytLEsT0o Read more about #StarCraft2 as an environment for AI research: https://t.co/TSUdS9vttG

GoogleDeepMind's tweet photo. Join us and @Blizzard_Ent this Thursday at 6:00pm GMT for an exciting #StarCraft demonstration, hosted by @Artosis and @RotterdaM08!

Livestream on YouTube: https://t.co/lQytLEsT0o

Read more about #StarCraft2 as an environment for AI research: https://t.co/TSUdS9vttG https://t.co/Eztc5Bro5Y

62

2K

881

43

0

junh_oh retweeted

Demis Hassabis

@demishassabis

over 7 years ago

Delighted to welcome reinforcement learning pioneer Satinder Singh to @DeepMindAI. He’ll bring some incredible experience to the team and I'm really looking forward to working with him!

4

535

57

4

0

junh_oh retweeted

Vincent François-Lavet @VinFL

over 7 years ago

Excited to share a quite extensive introduction to deep reinforcement learning! With @astro_pyotr @riashatislam @marcgbellemare and Joelle Pineau, we hope it will be useful to the community. Print version available at #NeurIPS! https://t.co/IbGAbNX8do

3

250

87

34

0

junh_oh retweeted

Miles Brundage

@Miles_Brundage

about 8 years ago

"Self-Imitation Learning," Oh and Guo et al.: https://t.co/ryPW9Qf0A2 Imitating past good experiences in the replay buffer leads to big improvements over A2C, PPO, inc. good Montezuma performance in fewer frames than prior approaches

2

57

19

5

0

junh_oh retweeted

Pieter Abbeel

@pabbeel

over 8 years ago

NIPS Deep RL Symposium Schedule now available: https://t.co/a2UEG3v7lf includes over 70 contributed papers/posters, and invited talks by David Silver, Joelle Pineau, Ruslan @rsalakhu , Ben Van Roy, Michael Bowling. Thursday 12/7 @NipsConference

2

166

54

0