🧵 Everyone is chasing new diffusion models—but what about the representations they model from?
We introduce Discrete Latent Codes (DLCs):
- Discrete representation for diffusion models
- Uncond. gen. SOTA FID (1.59 on ImageNet)
- Compositional generation
- Integrates with LLM
🧱
I rambled somewhat-coherently on a podcast about discrete representations for diffusion models
(https://t.co/wzHBfX2eUK) and recent MARL work solving AI social dilemmas
(https://t.co/hgiqkUFHp9)
Thank you @joycech3n for the great convo and laughing at my one mediocre joke
Olmo 3.1: even more RL = even more RL-Zero!
@saurabh_shah2 and I tweaked some hyperparams and prompts, @hamishivi and @finbarrtimbers improved the code and boom!
New Olmo 3.1 RL-Zero 👾 An updated, solid baseline for your RL and reasoning research
New preprint! Learning Robust Social Strategies with Large Language Models. We apply multi-agent RL finetuning to train LLMs that achieve cooperative and non-exploitable behavior in social dilemmas for the first time.
📄 https://t.co/lMKxJ4XoBx
🧵 ⬇️
(1/8)
Zero rewards after tons of RL training? 😞 Before using dense rewards or incentivizing exploration, try changing the data. Adding easier instances of the task can unlock RL training. 🔓📈To know more checkout our blog post here: https://t.co/BPErVcLmP8. Keep reading 🧵(1/n)
🧵 Everyone is chasing new diffusion models—but what about the representations they model from?
We introduce Discrete Latent Codes (DLCs):
- Discrete representation for diffusion models
- Uncond. gen. SOTA FID (1.59 on ImageNet)
- Compositional generation
- Integrates with LLM
🧱
I'm looking for reviewers for TMLR around geometry-aware molecular generation that focuses on order-agnostic autoregressive modeling, if you have expertise in geometric deep learning or generative AI for drug discovery, please let me know! 🧬
Proud to have contributed, as an independent researcher, to the great work achieved by our research team.
Many thanks to @johanobandoc , @lavoiems , Scott Fujimoto, @AaronCourville and @pcastr for making it happen.
See Pablo’s thread for more details.
1/3 🥳Excited to share our new paper ‘Simplicial Embeddings Improve Sample Efficiency in Actor–Critic Agents’!
Project your features onto a product of simplices → sparse, stable reps, stronger grads, faster learning.
🧵For more details, check out Pablo’s thread 👇
🔊Simplicial Embeddings (SEMs) Improve Sample Efficiency in Actor-Critic Agents🔊
In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents.
1/X
🔊Simplicial Embeddings (SEMs) Improve Sample Efficiency in Actor-Critic Agents🔊
In our recent preprint we demonstrate that the use of well-structured representations (SEMs) can dramatically improve sample efficiency in RL agents.
1/X