1/🚀 We’re excited to share World Model Self-Distillation (WMSD)
WMSD trains pretrained video generators to solve general tasks from an image + short instruction; without curated task-execution videos.
It combines self-distillation with VLM-feedback RL, letting the Executor learn from a detailed-solution Demonstrator and improve beyond it.
🌐 Project: https://t.co/WneVN1zQ5B
#WorldModels #VideoGeneration #ReinforcementLearning #SelfDistillation #AI #Robotics
8/ Results: WMSD improves task-solving, agent grounding, and consistency across base models.
Even without task-specific supervision, WMSD reaches competitive performance compared with supervised fine-tuning baselines.
That is important because robotics data is expensive, and scalable task learning needs better alternatives."