Our new benchmark and dataset for interactive agents. The AI assistants are expected to interrupt when a mistake is made and provide corrective feedback! The benchmark is high-quality collected by our own research team! @apratimbh@RolandMemisevic
🚨Introducing: Ego-MC-Bench (Mistake Corrections) benchmark and Ego-CoMist (Counterfactual Mistakes) dataset.
🎯Ego-MC-Bench: Where AI assistants need to intervene at the right time (when) and with the right feedback (what) to prevent mistakes.
👉https://t.co/zFNEZWAMpt
1/4
🚨Introducing: Ego-MC-Bench (Mistake Corrections) benchmark and Ego-CoMist (Counterfactual Mistakes) dataset.
🎯Ego-MC-Bench: Where AI assistants need to intervene at the right time (when) and with the right feedback (what) to prevent mistakes.
👉https://t.co/zFNEZWAMpt
1/4
📢Excited to be presenting our work on memory + VLAs at ICRA'26 this Thursday morning (poster 224).
We found that a super simple language-based scratchpad with spatial and temporal grounding goes a long way in imparting memory to VLAs.
1/n
📢Current VLAs are stateless - this is not ideal for long horizon tasks - which requires models to remember key past information, e.g., positions, orientations, task progress.
🎯So we just let the VLA take notes to help remember the past a.k.a "Notes-to-Self"!
👉ICRA 2026
CogVL Workshop at #CVPR2026 is less than a week away!
We have an exciting lineup of keynote speakers across Vision, NLP and Cog Sci, and orals/posters on reasoning methods for VLM models.
🕐 June 3, 1 PM
📍 Rooms 610/612
🔗 Schedule: https://t.co/kNSQmKZuGJ
Attending #CVPR2026, Denver?
We have a stellar speaker lineup for “GeoFreeNVS: Geometry-Free Novel View Synthesis and Controllable Video Models.”
Come early. We anticipate seats will fill up FAST!
1/2