Simple deep RL was thought to fail in imperfect-info games like poker.
A new ICLR 2026 paper shows that with proper tuning, generic methods like PPO match or beat specialized approaches like fictitious play and counterfactual regret minimization.
https://t.co/RvkZ6XxR6J
1/2
Two BIG updates for the RLBrew Workshop at #RLC2026! 📣
1️⃣ Dual submissions are welcome
2️⃣ We’ll be awarding a Best Paper RLBrew Award 🏆
You have 2 DAYS LEFT to submit — deadline: May 29!
Details: https://t.co/segLTne6Tp
Reminder! RLBRew deadline in coming up in 7 days! Submit your works soon👩💻
Reminder that we accept under review papers! This is a good place to discuss your ideas and get feedback from the community
Come say hi at my @iclr_conf poster (Friday 10:30am, Pavilion 3 #316) to chat about prediction sets or why @PlantNetProject is my favorite plant identification app!
We ran thousands of sweeps to compare RL algos for imperfect information games and found preliminary evidence for the Policy Gradient Hypothesis:
With proper tuning, generic PG (PPO, etc.) methods are highly competitive in IIGS.
Check out the full paper: https://t.co/Cw4krtwVWe
🔥 Thrilled to announce the Continual Reinforcement Learning (CRL) Workshop @continual_learn, at
@RL_Conference 2026 in Montreal, Canada!
📣 We welcome submissions on broad topics of continual RL. Interested in submitting or reviewing? Check out our website for more details!
@RL_Conference is one of my favorite conferences to submit, go, and review. I suspect it will be my favorite to host a workshop too!
Great speaker lineup, submit your work!
Scalar, well-defined, easy-to-optimize rewards aren’t always available in real-world interaction data–yet that data is crucial for scaling general-purpose agents. Excited to announce the 3rd edition of RLBRew: Towards Scalable General-Purpose Agents at @RL_Conference 2026!
Have seen the most thoughtful reviews so far in RLC. Watching the review quality go ⬇️ and conference size ⬆️at ICLR, Neurips past years they seem like the wrong venue to facilitate discussions on RL. Consider submitting!
My RL team at @amazon NYC is looking for summer 2026 PhD interns! We apply RL to Amazon's supply chain and do publishable, open-ended research in meta learning, multi-agent RL, constrained RL, exploration, and LLMs+RL. Interested? Email your CV to [email protected] by Nov 30th!
Announcing RLZero for Generalist Agents at #NeurIPS2025. To our knowledge, the first to enable all of:
💬 Language → behavior (zero-shot)
🎥 Video → behavior (zero-shot, cross-embodiment)
🧠 One Behavioral Foundation Model for many tasks
From instructions & demos to actions—no fine-tuning/test-time learning. 🚀 Below are examples what RLZero can do for humanoids:
Intelligent humanoids should have the ability to quickly adapt to new tasks by observing humans
Why is such adaptability important?
🌍 Real-world diversity is hard to fully capture in advance
🧠 Adaptability is central to natural intelligence
We present MimicDroid 👇
🌐 https://t.co/J8XpND9j1j
@danfei_xu to your point, I think what has made the Jet Propulsion Lab so good at space robotics is their heavy and meticulous focus on systems engineering.