Roy Fox (@royf@sigmoid.social) @roydfox - Twitter Profile

roydfox retweeted

over 2 years ago

Excited to share our work, "Skill Set Optimization", a continual learning method for LLM actors that: - Automatically extracts modular subgoals to use as skills - Reinforces skills using environment reward - Facilitates skill retrieval based on state https://t.co/vaSYjVzlB2 🧵

kolbytn's tweet photo. Excited to share our work, "Skill Set Optimization", a continual learning method for LLM actors that:
- Automatically extracts modular subgoals to use as skills
- Reinforces skills using environment reward
- Facilitates skill retrieval based on state
https://t.co/vaSYjVzlB2
🧵 https://t.co/Q2OBToSgCo

1

74

23

42

16K

Roy Fox (@[email protected]) @roydfox

over 3 years ago

it's like how politicians “apologize if anyone misunderstood what I meant”; ChatGPT is a little weasel

1

0

286

Roy Fox (@[email protected]) @roydfox

over 3 years ago

I absolutely love how ChatGPT never actually admits any of its mistakes, instead “apologizing for the confusion”. It's like: “look, one of us seems very confused, but let's not get hung up on who. here's the plan: I won't deny partial responsibility, and you still tip me, k?”

2

7

0

871

roydfox retweeted

Uri Shalit @ShalitUri

over 3 years ago

Are you interested in causality, machine learning and healthcare? Come work with Mihaela van der Schaar (@MihaelaVDS) and me in a joint PhD or postdoc at Cambridge University, UK and the Technion, Israel Contact via email: shalit-lab AT technion ac il

2

142

36

25

35K

Who to follow

Tengyu Ma

@tengyuma

Assistant prof. @ Stanford; Chief AI Scientist @ MongoDB; Former Co-founder/CEO of Voyage AI Working on ML, DL, RL, LLMs, and their theory.

Marc G. Bellemare

@marcgbellemare

Modelling @ Cohere. Ex RL research lead at Google Brain, DeepMind. Textbook author. Co-founder, Reliant AI.

Nan Jiang

@nanjiang_cs

machine learning researcher, with focus on reinforcement learning. assoc prof @ uiuc cs. Course on RL theory (w/ videos): https://t.co/vqVKwY4RJE

Roy Fox (@[email protected]) @roydfox

over 3 years ago

@yoavgo @beenwrekt I think of expertise as having better processed info than most. But in both senses of the word, you need non-experts to convey context, and chat can enable that. Not current magic-chat-systems, for sure – those mostly enable the “imagining” part that makes demos impressive.

1

0

47

Roy Fox (@[email protected]) @roydfox

over 3 years ago

@yoavgo @beenwrekt the value is not in the expertise, it could even be widely known info. it's about retrieving that info in the context of my personal current situation. chat is the best interface to convey this context.

1

0

34

Roy Fox (@[email protected]) @roydfox

over 3 years ago

@yoavgo @beenwrekt medical advice, financial advice, legal advice, to name a few

1

0

39

Roy Fox (@[email protected]) @roydfox

over 3 years ago

@yoavgo @beenwrekt @beenwrekt doesn't think the problem with driving is sensing, but I can't image what sensing looks like when it has nothing to do with remaining issues. Does it embed perceptions informatively and usably? Then the entire problem is solved, no? 3/3

0

23

Roy Fox (@[email protected]) @roydfox

over 3 years ago

@yoavgo @beenwrekt Another answer to the original question: 2022 demos feel potentially more disruptive. It's easier to imagine an infinite list of killer chat-facilitated applications than driving-facilitated ones. Unless the “killer” part is literal. 2/3

2

0

53

Roy Fox (@[email protected]) @roydfox

over 3 years ago

https://t.co/CGyKfVwPIf

0

roydfox retweeted

Noga Zaslavsky @NogaZaslavsky

over 3 years ago

The deadline for our #NeurIPS2022 InfoCog workshop is approaching soon [updated 🗓️: Sep 22]. We expect to have some funding to support a few selected presenters of accepted papers, and a special issue of Open Mind associated with the workshop! More info 👇 https://t.co/oWibOkkXbv

1

6

3

0

roydfox retweeted

Pieter Abbeel

@pabbeel

almost 4 years ago

Very excited for the 2022 edition of the #neurips Deep RL workshop! A few fun changes, see below. Also, submission deadline: Sep 22

0

32

3

5

0

roydfox retweeted

Hugo Larochelle

@hugo_larochelle

almost 4 years ago

We (@BeEngelhardt, @NailaMurray and I) are proud to announce the creation of a Journal-to-Conference track, in collaboration with JMLR and conferences NeurIPS 2022, ICLR 2023 and ICML 2023! https://t.co/DYcKRBDOw1 https://t.co/Er3jLzPXf4 https://t.co/DMfKbMukpd

18

1K

193

73

0

roydfox retweeted

Noga Zaslavsky @NogaZaslavsky

almost 4 years ago

📣 Very excited to announce our in-person #NeurIPS2022 workshop on Information-Theoretic Principles in Cognitive Systems! Check out our lineup of invited speakers and CFP, submit short papers by September 19 https://t.co/oWibOkkplX #InfoCog2022 @NeurIPSConf

1

100

28

18

0

roydfox retweeted

Yevgeni Berzak @whylikethis_

almost 4 years ago

המעבדה שלי בטכניון מחפשת עוזרי.ות מחקר! לפרטים נוספים: https://t.co/yBYSbjzAQI כמו כן, מגייסים סטודנטים.ות לתארים מתקדמים. אל תהססו ליצור קשר! https://t.co/V8Am4R4H2k

0

6

2

0

roydfox retweeted

Stephen McAleer

@McaleerStephen

almost 4 years ago

Policy Space Response Oracles (PSRO) mixes over a population of deep RL policies to approximate a Nash equilibrium, but exploitability can increase from one iteration to the next. We introduce Anytime PSRO which does not increase exploitability. Arxiv: https://t.co/yC88UhDcMH

McaleerStephen's tweet photo. Policy Space Response Oracles (PSRO) mixes over a population of deep RL policies to approximate a Nash equilibrium, but exploitability can increase from one iteration to the next. We introduce Anytime PSRO which does not increase exploitability.

Arxiv: https://t.co/yC88UhDcMH https://t.co/dulSSGJFXh

1

32

7

5

0

roydfox retweeted

Noga Zaslavsky @NogaZaslavsky

over 4 years ago

Come join us tomorrow (Mon) for the #NeurIPS2021 Meaning in Context workshop! We aim to advance human-machine communication by understanding how pragmatic reasoning works in humans and how it can inform AI website: https://t.co/UAQrQlOWvP NeurIPS link: https://t.co/i8ursERsbR

1

34

8

2

0

Roy Fox (@[email protected]) @roydfox

over 5 years ago

This. Our top conferences are "captive regulators" of noteworthy research, and Big ML has subtly shifted their focus to experiments that have low scientific value, high environmental footprint, and — significantly — only they can run. #demeritNeurIPS

Ben Recht @beenwrekt

over 5 years ago

However, the chairs didn’t address the issue of corporate influence on the conference.

1

21

0

3

1

0

Roy Fox (@[email protected]) @roydfox

over 5 years ago

@jackclarkSF Agree that more gov investment in academic AI research would be great, but it won't address root issue. Gov money is free of incentive to profit, but cannot compete w/ industry money while there's huge incentive to profit off AI => must have regulation to stop profitable AI abuse

0

Roy Fox (@[email protected])

@roydfox

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users