Introducing Unified Motion-Action (UMA) Model, a robot foundation model that uses 3D object motion as a shared interface for heterogeneous robot learning. UMA treats motion and action as co-evolving variables, enabling knowledge transfer across data sources and versatile inference. 🧵 1/n
Introducing KITE 🪁, a paradigm that transfers the interaction intent instead of joint-space actions across structurally different manipulators. Deploying on a new robot needs only its kinematics model (URDF), no new task demonstrations. 🧵1/8
Teleoperation is slow, expensive, and difficult to scale. So how can we train our robots instead?
Introducing X-Sim: a real-to-sim-to-real framework that trains image-based policies 1) learned entirely in simulation 2) using rewards from human videos.
https://t.co/5yt2iTFYF4
Check out our paper and project page for more information.
Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins
📄 https://t.co/THRpinMhZb
🌐 https://t.co/VrGeMMiyUC
💻 https://t.co/s8gHqp5G5l
Huge thanks to my advisors: @KuanFang and @weichiuma
How can robots solve tasks that demand both semantic and physical reasoning, like playing real-world Angry Birds, without tons of data?
We introduce Prompting with the Future: an MPC framework that fuses a pretrained VLM with an interactive digital twin for grounded, open-world motion planning.
🌐 https://t.co/VrGeMMiyUC
By explicitly modeling dynamics with an interactive digital twin, our method significantly outperforms baselines that directly prompt or fine-tune the VLM to handle both semantics and physics.
🤯 GPT-4o knows H&M left Russia in 2022 but still recommends shopping at H&M in Moscow.
🤔 LLMs store conflicting facts from different times, leading to inconsistent responses. We dig into how to better update LLMs with fresh facts that contradict their prior knowledge.
🧵 1/6 Our Paper: “Memorization vs. Reasoning: Updating LLMs with New Knowledge”
Contributions:
🏗️ Knowledge Update Playground (KUP) – a dataset & pipeline to simulate real-world fact changes
🚅 Memory-Conditioned Training (MRT)– a simple continued-pretraining method to inject and recall new facts