Reinforcement learning in #AI is hard, so I’ve made a website to collect answers I’ve given to common RL questions.
It's named Decisions & Dragons. It’s launching with 8 questions and answers, but I will add to it in the future.
A 🧵to give a preview with the link below.
I’ve seen the video.
Don’t believe this propaganda machine.
The state will ensure there is a full, fair, and expeditious investigation to ensure accountability and justice.
@seohong_park Great post! I'll note that our work on GT is a real-world off-policy RL success. (It's in the game!)
However, I share a similar conclusion. Off-policy Q-learning is brittle. I think MBRL where you learn from off-policy data but optimize on-policy in the model is more promising.
@bimald@coolkoon@GaryMarcus I’m not going to tell you LLMs will have no impact on coding. For simple things it can give non-programmers more flexibility.
However, it’s not like chess, because the limitations if English in specifying behavior is an inherent bottleneck, where no such thing exists for chess
@bimald@coolkoon@GaryMarcus I think you misunderstand the nature of code. Code isn’t for computers, it’s for people. English is *bad* at specifying clear behavior. In fact, regularly in design discussions, we move to writing bits of code because it’s more clear than words.
@kevinroose AI Expert here.
First, a great many experts do not believe in the extreme hype/doom & the loudest hypers/doomers are usually not AI *scientists*.
Second, unlike climate change, the hype/doom is not based on a scientific model. It is speculative rhetoric.
They are not the same
BOW WOW WOW, had the pleasure to try my voice on rock music with @huskybythegeek and I love it!
Happy late PC release to Final Fantasy VII Rebirth 🐶
#FF7Rebirth
BOW WOW WOW, had the pleasure to try my voice on rock music with @huskybythegeek and I love it!
Happy late PC release to Final Fantasy VII Rebirth 🐶
#FF7Rebirth
@chrisprucha@bradneuberg The bigger limiter has always been that you always need to do experiments in the real world to advance, and that means both resources and contending with the speed of reality.
Science is necessary, not a kludge.
@agrimgupta92 Very impressive. Although it feels like this person is moments away from cutting off their finger and it’s giving me an anxiety attack :p
@rao2z@natolambert I think this may be a misunderstanding of the bitter lesson. Rich Sutton is very focused on online learning and not pretraining, to a fault.
@Intrinsic29 I don't think I've ever seen a non-doomer say "Some things are impossible, therefore AI cannot be a threat."
It's always been a reaction to magical thinking where a doomer will literally be equating a future AI with a "god."
@Intrinsic29 Often, if a non-doomer brings up limits, its because a doomer presented an argument assuming no limits.
Doomers regularly hand wave "of course the AI finds a way to do x because it's superintelligent." You can't automatically conclude that without making magical assumptions.
@Miles_Brundage I've been all aboard the "we can solve the AI problem" train my entire adult life. But claiming it's not hard shows a lack of respect for the challenging problem we've tackled and is falling victim to hype instead of science.
@Miles_Brundage What unwillingness? The timeless pattern is AI researchers thinking it will be solved soon and being wrong every time.
The founders of the field thought they'd solve much of the problems in a few months with a small team.
There is far too much willingness to believe it s easy.
@ErbunnNinja@Intrinsic29 There will be a world of difference. People who use the word god end up confusing themselves precisely because they're assuming the difference.
This is a scientific and technical issue. The word "god" has no value in discourse of it and can -- and does -- confuse matters.