Greg Farquhar

23 days ago

@pfau Member of Theological Staff?

0

2

0

180

7 months ago

This was a great project to work on. Happy to have it published now in @Nature! Meta-learning is important.

Junhyuk Oh @junh_oh

7 months ago

Excited to announce that our work on “Discovering state-of-the-art RL algorithms” is finally published in @Nature! In this work, we meta-learned RL algorithms at scale. Paper: https://t.co/3V4TmPTWm4 Blog: https://t.co/G65ReK2iMs See thread 👇

14

474

85

297

73K

0

1

0

192

Modelling @ Cohere. Ex RL research lead at Google Brain, DeepMind. Textbook author. Co-founder, Reliant AI.

over 1 year ago

@j_foerst In case it's not where you got this, fyi https://t.co/NT8kAJVvlh

1

0

117

Who to follow

Marc G. Bellemare

@marcgbellemare

Shimon Whiteson

@shimon8282

Research Director at Google DeepMind | Professor of Computer Science at Oxford.

Maximilian Igl

@MaxiIgl

RS at Nvidia focussing on autonomous vehicles. Former Oxford PhD, MSR, Deepmind and Waymo. Opinions my own and do not represent those of my employer, Nvidia.

almost 5 years ago

There are a bunch of ideas in this paper, but it all fits together really neatly! Great work from @filangelos and team 👏

0

4

0

Angelos Filos @filangelos

almost 5 years ago

There’s huge potential in using ‘demonstrations’ from other agents with different goals: to understand which features & dynamics of the environment *might* be important to you; and to borrow from others' behaviours only where they are useful for you.

almost 5 years ago

👽 PsiPhi-learning 👽 (long talk #ICML) https://t.co/TA7gDtEHak shows how an agent can use data from the behavior of other agents with diverse goals: to infer their intentions and fulfill its own! 🧵

2

78

10

14

0

1

7

1

0

over 5 years ago

@risi1979 Combining Deep Reinforcement Learning and Search for Imperfect-Information Games https://t.co/RB2ptFmwab from @polynoamial @anton_bakhtin et al. kinda has it all -- clarity, insights, theory, great empirical results, code available 👏

0

12

0

4

0

almost 6 years ago

@NandoDF And if you want to have children while awaiting your ILR, no recourse for them that I'm aware of :(

0

1

0

almost 6 years ago

@NandoDF Yes, acquiring ILR (settled status is I think a similar scheme for EU citizens) takes many years (can be 10 years in some cases) and is very expensive. I went through the absurd process (German citizen lived here 15 years) and it would be be very hard for less privileged folks.

1

0

almost 6 years ago

Permanent damage to generalisation from early updates in non-stationary training -- really enjoyed looking into this intriguing problem and trying to solve it for deep RL agents!

Maximilian Igl @MaxiIgl

almost 6 years ago

Really excited about our new work: In deep RL, we typically collect new data using a non-stationary policy that gets updated as we learn and improve. We show this can impact the learning dynamics of our deep policy and lead to worse generalization https://t.co/1YTfpzDZOd (1/7)

MaxiIgl's tweet photo. Really excited about our new work: In deep RL, we typically collect new data using a non-stationary policy that gets updated as we learn and improve. We show this can impact the learning dynamics of our deep policy and lead to worse generalization https://t.co/1YTfpzDZOd (1/7) https://t.co/GtagiEJ1sV

1

124

24

17

0

17

3

2

0

about 6 years ago

This is awesome, but I'm a little scared of how much time I might spend playing it myself...

about 6 years ago

I am proud to announce the release of the NetHack Learning Environment (NLE)! NetHack is an extremely difficult procedurally-generated grid-world dungeon-crawl game that strikes a great balance between complexity and speed for single-agent reinforcement learning research. 1/

14

695

178

85

0

7

0

greg_far retweeted

about 6 years ago

I am proud to announce the release of the NetHack Learning Environment (NLE)! NetHack is an extremely difficult procedurally-generated grid-world dungeon-crawl game that strikes a great balance between complexity and speed for single-agent reinforcement learning research. 1/

14

695

178

85

0

about 6 years ago

I particularly enjoyed visualising & analysing the learned mixing functions that combine per-agent utilities into joint values!

greg_far's tweet photo. I particularly enjoyed visualising & analysing the learned mixing functions that combine per-agent utilities into joint values! https://t.co/Rh1LTcgOuU

Mikayel Samvelyan

@_samvelyan

about 6 years ago

Happy to share the extended version of our #QMIX paper “Monotonic Value Function Factorisation for Deep Multi-Agent RL” We include further analysis and ablation studies that investigate how monotonic factorisation of joint Q-val helps QMIX outperform VDN https://t.co/AGGADZgumu

1

30

8

2

0

2

0

over 6 years ago

Potential for cool applications in meta-learning, multi-agent learning, etc. If you have ideas or want to chat, let me know or find me at NeurIPS 😀

0

7

0

over 6 years ago

A much-improved 🎲Loaded DiCE🎲 objective lets you easily compute low-variance estimators of any-order derivatives for RL. Paper https://t.co/dllhrHuzwD and code https://t.co/NqZsdZy3iT online, nice working with @shimon8282 and @j_foerst! #NeurIPS2019

greg_far's tweet photo. A much-improved 🎲Loaded DiCE🎲 objective lets you easily compute low-variance estimators of any-order derivatives for RL. Paper https://t.co/dllhrHuzwD and code https://t.co/NqZsdZy3iT online, nice working with @shimon8282 and @j_foerst! #NeurIPS2019 https://t.co/f5lsnZjd7Q

1

62

12

10

0

greg_far retweeted

Noam Brown

@polynoamial

almost 7 years ago

Tuomas Sandholm and I are doing a Reddit AMA now on the #Pluribus poker AI! https://t.co/qOnCXFSJwe

0

25

4

2

0

almost 7 years ago

AI accelerates by 10x in the hour it takes to repost from r/machinelearning to r/singularityisnear... just how near is it at that rate?? 😱

greg_far's tweet photo. AI accelerates by 10x in the hour it takes to repost from r/machinelearning to r/singularityisnear... just how near is it at that rate?? 😱 https://t.co/X4Rez4FejB

1

13

1

0

almost 7 years ago

Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: https://t.co/YoKe9ZIjhk + code: https://t.co/BdZjplNNEg. Great working with Laura Gustafson @ebetica @shimon8282 Nicolas Usunier @syhw

greg_far's tweet photo. Progressively growing the action space creates a great curriculum for learning agents -- check out our paper: https://t.co/YoKe9ZIjhk + code: https://t.co/BdZjplNNEg. Great working with Laura Gustafson @ebetica @shimon8282 Nicolas Usunier @syhw https://t.co/oDFv5XWx2s

0

129

31

15

0

greg_far retweeted

about 7 years ago

How can RL agents exploit the compositional, relational and hierarchical structure of the world? A growing number of authors propose learning from natural language. We are excited to share our @IJCAIconf survey of this emerging field! https://t.co/XLHnXMQbVY TL;DR:🤖+📖=📈🎯🏆🥳

_rockt's tweet photo. How can RL agents exploit the compositional, relational and hierarchical structure of the world? A growing number of authors propose learning from natural language. We are excited to share our @IJCAIconf survey of this emerging field! https://t.co/XLHnXMQbVY
TL;DR:🤖+📖=📈🎯🏆🥳 https://t.co/FYHq0aCjM9

2

248

71

32

0

greg_far retweeted