James

@jmac_ai

Ask me about #ReinforcementLearning #AI research @SonyAI_global RL for games, robotics, and other real-world applications Views and tweets are my own.

Joined March 2012

578 Following

779 Followers

5.5K Posts

Pinned Tweet

over 1 year ago

Reinforcement learning in #AI is hard, so I’ve made a website to collect answers I’ve given to common RL questions. It's named Decisions & Dragons. It’s launching with 8 questions and answers, but I will add to it in the future. A 🧵to give a preview with the link below.

jmac_ai's tweet photo. Reinforcement learning in #AI is hard, so I’ve made a website to collect answers I’ve given to common RL questions.

It's named Decisions & Dragons. It’s launching with 8 questions and answers, but I will add to it in the future.

A 🧵to give a preview with the link below. https://t.co/wtralbGySg

5

251

33

299

24K

jmac_ai retweeted

Governor Tim Walz @GovTimWalz

5 months ago

I’ve seen the video. Don’t believe this propaganda machine. The state will ensure there is a full, fair, and expeditious investigation to ensure accountability and justice.

33K

228K

27K

5K

12M

11 months ago

@seohong_park Link to GT Sophy work we did at Sony AI: https://t.co/1XbpIxXibE

0

0

0

0

80

11 months ago

@seohong_park Great post! I'll note that our work on GT is a real-world off-policy RL success. (It's in the game!) However, I share a similar conclusion. Off-policy Q-learning is brittle. I think MBRL where you learn from off-policy data but optimize on-policy in the model is more promising.

2

3

0

1

569

Who to follow

Joseph Suarez 🐡

Verified account

I build sane open-source RL tools. MIT PhD, creator of Neural MMO and founder of PufferAI. DM for business: non-LLM sim engineering, RL R&D, infra & support.

Stefano V. Albrecht

Research in AI and machine learning for autonomous systems. MIT Press textbook: https://t.co/TlgjB3qF5U Cambridge Press book: https://t.co/KP3KAU8VAZ

Verified account

Australian living in the US | building parallel @p0

about 1 year ago

@bimald @coolkoon @GaryMarcus I’m not going to tell you LLMs will have no impact on coding. For simple things it can give non-programmers more flexibility. However, it’s not like chess, because the limitations if English in specifying behavior is an inherent bottleneck, where no such thing exists for chess

0

1

0

0

88

about 1 year ago

@bimald @coolkoon @GaryMarcus I think you misunderstand the nature of code. Code isn’t for computers, it’s for people. English is *bad* at specifying clear behavior. In fact, regularly in design discussions, we move to writing bits of code because it’s more clear than words.

2

1

0

0

159

about 1 year ago

@kevinroose AI Expert here. First, a great many experts do not believe in the extreme hype/doom & the loudest hypers/doomers are usually not AI *scientists*. Second, unlike climate change, the hype/doom is not based on a scientific model. It is speculative rhetoric. They are not the same

0

1

0

0

68

over 1 year ago

I was *not* expecting vocals for this song to go so hard. Absolutely incredible vocal power and control. Pernelle needs to be much more popular.

Pernelle. 🦚 Suteki Da Ne @PernelleMusic

over 1 year ago

BOW WOW WOW, had the pleasure to try my voice on rock music with @huskybythegeek and I love it! Happy late PC release to Final Fantasy VII Rebirth 🐶 #FF7Rebirth

8

183

37

8

10K

0

2

0

0

309

jmac_ai retweeted

Pernelle. 🦚 Suteki Da Ne @PernelleMusic

over 1 year ago

BOW WOW WOW, had the pleasure to try my voice on rock music with @huskybythegeek and I love it! Happy late PC release to Final Fantasy VII Rebirth 🐶 #FF7Rebirth

8

183

37

8

10K

over 1 year ago

@chrisprucha @bradneuberg The bigger limiter has always been that you always need to do experiments in the real world to advance, and that means both resources and contending with the speed of reality. Science is necessary, not a kludge.

1

4

1

0

267

over 1 year ago

@SenMastriano Resign. You are unfit for a booster seat.

0

0

0

0

21

over 1 year ago

@agrimgupta92 Very impressive. Although it feels like this person is moments away from cutting off their finger and it’s giving me an anxiety attack :p

0

1

0

0

1K

over 1 year ago

@rao2z @natolambert I think this may be a misunderstanding of the bitter lesson. Rich Sutton is very focused on online learning and not pretraining, to a fault.

0

0

0

0

62

over 1 year ago

@Intrinsic29 I don't think I've ever seen a non-doomer say "Some things are impossible, therefore AI cannot be a threat." It's always been a reaction to magical thinking where a doomer will literally be equating a future AI with a "god."

1

2

0

0

35

over 1 year ago

@Intrinsic29 Often, if a non-doomer brings up limits, its because a doomer presented an argument assuming no limits. Doomers regularly hand wave "of course the AI finds a way to do x because it's superintelligent." You can't automatically conclude that without making magical assumptions.

1

2

0

0

38

over 1 year ago

@scrumtuous @clattner_llvm @tsoding @Modular Lot's of reasons, but here's a simple one if you're actually skeptical: Modular's twitter account retweets that account.

0

0

0

0

10

over 1 year ago

@scrumtuous @clattner_llvm @tsoding @Modular You know you're responding to Chris Lattner, right? :p

1

1

0

0

69

over 1 year ago

@Miles_Brundage I've been all aboard the "we can solve the AI problem" train my entire adult life. But claiming it's not hard shows a lack of respect for the challenging problem we've tackled and is falling victim to hype instead of science.

0

2

0

0

54

over 1 year ago

@Miles_Brundage What unwillingness? The timeless pattern is AI researchers thinking it will be solved soon and being wrong every time. The founders of the field thought they'd solve much of the problems in a few months with a small team. There is far too much willingness to believe it s easy.

1

4

1

0

197

over 1 year ago

@ErbunnNinja @Intrinsic29 An an expert in decision-making agents, you are entirely incorrect.

1

0

0

0

33

over 1 year ago

@ErbunnNinja @Intrinsic29 There will be a world of difference. People who use the word god end up confusing themselves precisely because they're assuming the difference. This is a scientific and technical issue. The word "god" has no value in discourse of it and can -- and does -- confuse matters.

1

1

0

0

39

Last Seen Users on Sotwe

Trends for you

Most Popular Users