Thanks! The MoE actor learns its routing end-to-end, so there's no separate classifier or distillation step. On scaling, the experts specialize into shared sub-skills rather than whole tasks, so adding tasks doesn't mean adding experts. The critic is the bottleneck. We use one per task to isolate the signal, so critic count scales with the number of tasks. We're currently extending the framework to MTRL on loco-manipulation. Stay tuned!
How do you get perceptive locomotion over rough terrain without brittle terrain classifiers?
Excited to share CTS-MoE, a framework for implicit terrain adaptation via Mixture-of-Experts for perceptive locomotion. No selectors or per-task policies; the policy adapts end-to-end straight from vision.
TL;DR:
β Perception-driven routing handles diverse, discontinuous terrain implicitly; no high-level task selector or per-task policy distillation.
β Big gains on hard tasks (climbing, gaps) under MTRL, with smooth transitions on both seen and unseen terrain.
π§΅Thread:
[6/6] Huge thanks to my collaborators: @mpangarola, @aditya_potnis, Ana Luiza Mineiro, Prof. Marcelo Becker and Prof. Girish Chowdhary.
π https://t.co/8tEG4vMOGd
π https://t.co/O8IZLbv46J
[5/6] Hierarchical pipelines are bottlenecked by discrete high-level selection. Input-dependent routing lets the robot flow through transition states across a continuous 60m obstacle course.