The ChatGPT of robotics needs both a model (GPT) AND an interaction method (“chat”) to let users steer robot policies online, post-training. The multimodality in diffusion models is great for capturing diverse behaviors, but users need control at inference time!
@felixwyw & team’s work at #ICRA2025 presents a method to steer pre-trained policies during generation. Check out https://t.co/5oZDhS7Vxs for all the results!
Want your robot to clean the kitchen your way? 🧹✨
🔗https://t.co/5ym8VIu4hS"
Introducing Inference-Time Policy Steering: a training-free method that lets you specify where and how to manipulate objects, so you can guide non-interactive policies to align with your preferences!
GEN-1 blog has a passage on alignment: pre-trained models often seem super eager, almost restless, to try many different motions. Steering them toward user preferences is what turns raw intelligence into useful execution.
Prediction: As we enter the pre-training era for robotics, we’ll see a lot more alignment and interpretability research on these mysterious models.
🧠Reasoning VLAs generate textual plans 💬 via CoT before predicting actions 🦾. But do the predicted actions actually follow the plans? Introducing SEAL: a training-free runtime steering method that ensures VLAs "do what they say". Website: https://t.co/MzU2mRvWMv (1/9)
Excited to invite you to our #RSS2025 Workshop at RTH 109, where we’ll explore the frontier of Generative Models × Human–Robot Interaction! 🤖✨ Organizing with @robominyoung@sammy_j_c@haroldsoh@andreea7b@julie_a_shah
https://t.co/oD7PPTmqEJ
9:00 AM – Workshop Intro • @robominyoung
9:10 AM – Visual Embodied Planning • @RoozbehMottaghi
9:40 AM – Simulating People with Language Models—Good Idea? Bad Idea? • @ybisk
10:10 AM – Steerable & Interactive Robots in the Era of Large Models • @DorsaSadigh
10:40 AM – ☕ Coffee Break & Poster Session
11:30 AM – All Generative Models Are Wrong, But Some Are Useful • @robo_kween
12:00 PM – Panel 1: Are LLMs, VLMs, VLAs Robust Enough for Simulation & Interaction? Yonatan Bisk, Nadia Figueroa, Roozbeh Mottaghi, moderated by @CPDArobotics
12:30 PM – Lunch
1:30 PM – Stakeholder-Informed Physical Assistance: Leveraging Generative AI towards Real-World Caregiving Robots • @TapoBhat
2:00 PM – Individualized Models of Human Control in Assistive Robotics • @brenna_argall
2:30 PM – Spotlight Talks
3:00 PM – Demonstrating Arena 5.0 • @kastner_linh
3:30 PM – ☕ Coffee Break
4:10 PM – Panel 2: Foundation Models for Shared Autonomy & Assistive Robots. @V_Vanhoucke , Brenna Argall, Tapomayukh Bhattacharjee, moderated by @haroldsoh
5:00 PM – Best Paper Award & Wrap-Up
The #ICRA2025 DEBATE “Data will Solve Robotics and Automation: True or False?” was especially fun and insightful—watch and listen carefully! Aude Billard, @animesh_garg, Leslie Kaelbling, Frank Park, Daniela Rus, @RussTedrake moderated by @Ken_Goldberg
Robot social navigation in the wild, as seen in the airport on my way to our Social Navigation Workshop @ #ICRA2025 Attend on Friday, Room 313, schedule here: https://t.co/xA8vS0P49g
🚀 Join us at #ICRA2025 for an exciting workshop on Advances in Social Navigation! Share your latest projects in socially-aware robot navigation or participate in our competition. Papers due March 24th! https://t.co/h3fnuF1qPB
I love keynotes – every second is a meticulously crafted blend of cutting-edge technology and the art of presenting it, with everything computed and simulated on GPUs. Hear about the new models and the future of AI:
Watch Jensen Huang unveil the latest breakthroughs in AI, robotics, accelerated computing, and more in the #GTC25 keynote.
Now live ⬇️ https://t.co/NGOFFQFYSe
New research could enable users to correct a robot’s behavior with simple interactions, like pointing to an item or nudging a robot’s arm. This intuitive approach would allow a layperson to guide a robot right out of the box, no machine-learning required. https://t.co/LEke2dnpuj
In this example, a user provides a 3D sketch (green) as input. Unconditional rollouts first miss the left block, whereas policy steering helps the model explore its learned manifold to generate a successful trajectory towards the user’s goal.
The ChatGPT of robotics needs both a model (GPT) AND an interaction method (“chat”) to let users steer robot policies online, post-training. The multimodality in diffusion models is great for capturing diverse behaviors, but users need control at inference time!
@felixwyw & team’s work at #ICRA2025 presents a method to steer pre-trained policies during generation. Check out https://t.co/5oZDhS7Vxs for all the results!
Want your robot to clean the kitchen your way? 🧹✨
🔗https://t.co/5ym8VIu4hS"
Introducing Inference-Time Policy Steering: a training-free method that lets you specify where and how to manipulate objects, so you can guide non-interactive policies to align with your preferences!