Stanford MSL

@StanfordMSL

Stanford Multi-robot Systems Laboratory. Endowing groups of robots with the intelligence to collaborate safely and effectively with humans and each other.

Joined December 2020

9 Following

766 Followers

52 Posts

StanfordMSL retweeted

about 2 months ago

🤖Low-data post-training can teach a VLA policy a new robot skill. But it also makes it too attached to the training demos. We call this lock-in🔒: the policy can execute the post-training task, yet fails to respond to seemingly obvious prompt changes. DeLock preserves steerability using only the policy’s own pretrained knowledge. No extra supervision needed!🚀🚀🚀 #Robotics #AI #EmbodiedAI #VLA

5

178

43

98

31K

Stanford MSL @StanfordMSL

3 months ago

@SwannAiden @allenzren @JiankaiSun @QuanVng @MacSchwager To find out more about this work check out the paper and website:📄 https://t.co/synRHChFHs 🌐 https://t.co/z2BIcBepGs

0

14

2

7

1K

Stanford MSL @StanfordMSL

3 months ago

π, But Make It Fly ✈️ We fine-tuned π0, a VLA model pretrained entirely on manipulators, to fly a drone that picks up objects, navigates through gates, and composes both skills from language commands.

14

363

43

154

101K

Stanford MSL @StanfordMSL

3 months ago

This work was done in collaboration with Johnathan Tucker, Denis Liu, @SwannAiden, @allenzren, Javier Yu, @JiankaiSun, Brandon Kim, Lachlain McGranahan, @QuanVng, and @MacSchwager Stay tuned for the dataset and code!

1

10

0

0

2K

Who to follow

Verified account

Prof @Stanford, Distinguished Research Scientist and AV research lead @nvidia. PhD from @MITAeroAstro. Robotics, autonomous systems, AI. Opinions are my own.

We develop perception, control, & planning algorithms for robot autonomy | @CMU_Robotics | https://t.co/gWjGiUaBeP | https://t.co/9wO6amxfFc

Anirudha Majumdar

Verified account

Associate Professor @Princeton | Co-Director, Princeton Robotics | 20% Research Scientist @GoogleDeepMind.

Stanford MSL @StanfordMSL

7 months ago

Check out our new paper accepted by RA-L at: Check out our new paper accepted by RA-L at: https://t.co/GG6AQ29Mbi #Robotics #drone #VLM #VLA #RL #3DGS

Qianzhong Chen @QianzhongChen

7 months ago

🧵 Thread — GRaD-Nav++ 1/9 Do you ever wish you could throw away the controller and just tell your drone what to do? Like: “Go through that gate, then stop over the ladder.” or during midway “Actually switch tasks — fly to the monitor on the right.”

1

8

1

4

5K

1

3

0

2

603

Stanford MSL @StanfordMSL

about 1 year ago

[2/2] 📄 arXiv preprint: https://t.co/A92aF1jkip 🌐 Project website: https://t.co/IxKQl84OUt 💻 Code on GitHub: https://t.co/5hSXgQnrce 🎥 Demo video: https://t.co/8ReAR94NKi

0

1

1

0

321

Stanford MSL @StanfordMSL

about 1 year ago

[1/2] Excited to announce GRaD-Nav! We propose a new framework that integrates 3DGS and Differentiable RL to train vision-based drone navigation policies. Our method achieves efficient end2end training, zero-shot sim2real transfer, and strong in-task adaptability.

1

6

3

4

2K

Stanford MSL @StanfordMSL

about 1 year ago

[5/5] We show in hardware experiments that LatentToM solves tasks with two decentralized arms as well as a fully centralized bi-manual policy. Paper: https://t.co/gsUQC9dAuq Project: https://t.co/BFATfun1zX

0

1

0

0

415

Stanford MSL @StanfordMSL

about 1 year ago

[1/5] Humans collaborate with each other by simulating the state of mind of their teammates, a concept called Theory of Mind (ToM). We propose LatentToM, a method to endow robots with a theory of mind in latent space for cooperative manipulation.

1

4

2

4

14K

Stanford MSL @StanfordMSL

about 1 year ago

[4/5] LatentToM is comms flexible. Without comms, the robots rely completely on Theory of Mind for coordination. With comms, they use a single communication round to align their consensus embeddings at each policy inference.

1

1

0

0

443

Stanford MSL @StanfordMSL

about 1 year ago

[5/5] We embrace these findings by proposing an Action Lookup Table (ALT) policy, which equals the diffusion policy's reactivity and dexterity with a fraction of the memory footprint and inference time. And no diffusion denoising steps!

0

3

0

0

439

Stanford MSL @StanfordMSL

about 1 year ago

[1/5] What happens when you prompt a robot diffusion policy with an image of a cat? Website: https://t.co/9rD2SVsbkY Paper: https://t.co/2xJhBHbaYZ

2

27

2

18

4K

Stanford MSL @StanfordMSL

about 1 year ago

[4/5] A visual hash function indexing a memorized action lookup table gives closed-loop visual reactivity without the need for action generalization, which seems to be a powerful recipe for imitation learning with few demonstrations.

1

1

0

0

498

Stanford MSL @StanfordMSL

about 1 year ago

[3/3] This learned policy is designed to adapt at runtime to variations in drone dynamics. It outputs thrust and body rate commands and runs at 20hz on a commodity drone with only onboard compute and perception.

0

0

0

0

202

Stanford MSL @StanfordMSL

about 1 year ago

[1/3] We're excited to announce SOUS VIDE, a method to train visuomotor navigation policies for autonomous drones without data from real-world flights, using only Gaussian Splat reconstructions of the scene. 📄 Paper: https://t.co/8J7SV3w3ef 🌐 Project: https://t.co/EzEAY4jkOg

1

6

0

1

1K

Stanford MSL @StanfordMSL

about 1 year ago

[2/3] To do this, we train a Gaussian Splatting model of a scene and virtually "fly" the drone within it with a massive volume of motion and dynamics perturbations. This produces 100k+ image-action pairs, which then supervise the training of a our learned policy.

1

0

0

0

251

Last Seen Users on Sotwe

Trends for you

Most Popular Users