📢 Introducing TorchRL Data Hub, the most comprehensive single-format data repository for offline RL. With its unified format, swapping or merging datasets is a breeze. It includes: OpenX Embodiment, Roboset, Minari, (V-)D4RL, GenDGRL, AtariDQN, OpenML. https://t.co/XdLBLFVzvN
torchrl 0.13 is out!
It provides a lot of deep optimizations including scan/triton-based RNNs with support for intermediate resets (=> padding free) + cuda-based prioritized RBs
🚀 TorchRL 0.9 is here and it's LLM ready!
This major release brings comprehensive LLM fine-tuning capabilities to our RL ecosystem. We're enabling LLM post-training with your favorite RL tools like replay buffers and data collectors.
🧵 Thread below 👇
📚 What's included in 0.9:
->Unified wrappers for Hugging Face and vLLM models
->Complete GRPO implementation with multi-GPU support
->Specialized objectives: GRPO and SFT loss functions optimized for LLMs
->Comprehensive docs and tutorials
Hi imitation learning friends, I am excited to introduce CleanIL (https://t.co/jacC3COKXZ), a repo of high quality single-file implementations of imitation learning and inverse RL algos inspired by CleanRL and built on @torchrl1.
We're looking for brave OSS contributors to help us scale this up! We need:
- an asyncio api
- ray support
- support for transformed async environments
Please reach out if you're willing to help!
A few months ago we quietly open-sourced a @PyTorch video decoding library called torchcodec -- small, nimble, fast.
The @LeRobotHF folks had some nice things to say about it.