๐งต Deli AutoResearch SKILL is now officially open source! ๐
https://t.co/V3lwwdyQm8
Alongside it, weโre dropping our 4th survey paper โ this time on Self-play.
https://t.co/SEb2qoKCI6
Inspired by AlphaZero, we got a powerful insight: prior knowledge doesnโt always lift the ceiling.
Models can discover more globally optimal solutions just by playing against themselves.
The biggest change in this paper?
For the first time, the AutoResearch Agent autonomously planned GPU experiments โ and submitted actual RL runs on the DeepSeek 285B model.
The entire RL pipeline โ experiment design, code writing, running, debugging, and conclusion summarization โ was 100% automated, with zero human intervention from me.
This was incredibly difficult, but an incredibly important step.
https://t.co/kuZZNux5RH
GRPO is the tool being called by the AutoResearch Agent here.
We see this as the beginning of our Continual Learning research journey. ๐
As always, this is my personal research project, unaffiliated with any organization. All views are my own.
#AI #ReinforcementLearning #SelfPlay #OpenSource #AutoML #ContinualLearning #DeepSeek
My deep learning course @unige_en is available on-line. 1000+ slides, ~20h of screen-casts. Full of examples in @PyTorch.
https://t.co/6OVyjPdwrC
And my "Little Book of Deep Learning" is available as a phone-formatted pdf (400k downloads!)
https://t.co/qXni5GZOMT
Earn $25 per Day by completing tasks๐จ
You Just Connect Your X Account to the site and Post about the project at the end of the Day You get upto $25
Group has been created to onboard and Guide you through
If youโre Interested
Retweet this! Tag a fren!
Drop proofs of following Us with notis turned on in the comment,
We'll Send you the link