Shuo Cheng @ShuoCheng94 - Twitter Profile

3 months ago

Introducing EgoVerse: an ecosystem for robot learning from egocentric human data. Built and tested by 4 research labs + 3 industry partners, EgoVerse enables both science and scaling 1300+ hrs, 240 scenes, 2000+ tasks, and growing Dataset design, findings, and ecosystem 🧵

34

883

163

458

269K

ShuoCheng94 retweeted

Danfei Xu

@danfei_xu

4 months ago

New essay on robot learning from human data. I like @karpathy’s idea that LLMs are “ghosts” distilled from human knowledge. In robotics, we are attempting something similar: to summon a sensorimotor ghost. Our current ritual is teleoperation. It produces data, but strips away the reflexes, priors, and social interactions that make human behavior rich. My bet: robot learning will scale less with more robots, and more with better models of humans. Right now we lack both the systems and algorithms to model humans well. If we succeed, the result won’t just be better robots. It may be the first learned theory of how humans act in the physical world. Robots would simply be the first place we deploy it.

6

150

13

119

28K

Shuo Cheng @ShuoCheng94

7 months ago

🤖 Curious how Sim-and-Real Co-Training with OT works? Read the full paper here: https://t.co/diEOC9u6WB Website: https://t.co/Zsq7ppQyGb (n/n)

0

4

1

452

Shuo Cheng @ShuoCheng94

7 months ago

Can large-scale sim data enable real-world generalization?🤔 In our new work, we introduce a generalizable domain adaptation setting, where policies must handle real-world situations never presented in the real training data. (1/n)

1

40

14

33

16K

Who to follow

Minghua Liu

@MinghuaLiu_

Founding member @sudo_robotics. Embodied AI, 3D vision. | ex: @nvidia @Qualcomm @Waymo @Adobe @ucsd_cse @Tsinghua_Uni

Xingyu Lin

@Xingyu2017

Robot learning @openai. Previously @berkeley_ai @SCSatCMU. #Learning #Robotics

Huihan Liu

@huihan_liu

CS PhD @UTAustin | 🤖 Robot Learning & Embodied Agent | @berkeley_ai @AIatMeta @MSFTResearch | 🏆 RSS Best Paper Finalist | 🏆 ICRA Outstanding Learning Paper

Shuo Cheng @ShuoCheng94

7 months ago

In evaluation, our method delivers up to 30% higher success rates than the co-training baseline and generalizes to scenarios seen only in simulation, marking a step toward scalable robot learning without large real-world datasets. (4/n)

1

3

1

0

543

ShuoCheng94 retweeted

Yangcen Liu @Randle_Liu

9 months ago

What if one unified method helps robots learn from human videos across many tasks, many robots? Meet ImMimic: Cross-Domain Imitation from Human Videos via Mapping and Interpolation (CoRL 2025 Oral Presentation🏆) @ICatGT Check it here https://t.co/mrBAjewrlg!

6

80

24

42

27K

ShuoCheng94 retweeted

Simar Kareer @simar_kareer

over 1 year ago

Introducing EgoMimic - just wear a pair of Project Aria @meta_aria smart glasses 👓 to scale up your imitation learning datasets! Check out what our robot can do. A thread below👇

10

238

54

81

49K

Shuo Cheng @ShuoCheng94

almost 2 years ago

Visit our website for the paper and more details: https://t.co/thqTCGcDz0. Joint work with @CaelanGarrett, @AjayMandlekar and @danfei_xu (N/N)

1

7

0

2

1K

Shuo Cheng @ShuoCheng94

almost 2 years ago

With large-scale simulation study, we show NOD-TAMP can solve challenging tasks with a handful of demos (4 v.s. 500 demos compared to BC) and achieves strong generalization across diverse shapes, spatial layouts, and task goals. (5/N)

1

22

1

8

3K

Shuo Cheng @ShuoCheng94

almost 2 years ago

Together, NOD-TAMP flexibly integrates the adaptation of recorded trajectories with traditional motion planning to generalize across drastically different scene layouts. Here we show the full process of skill planning and adaptation for the mug sorting task. (5/N)

1

2

0

1K

Shuo Cheng @ShuoCheng94

almost 2 years ago

NOD-TAMP reasons about the pre- and post-conditions of each skill in NOD space and plans skill sequences to reach different goals. For instance, it can decide whether to pick a mug by the rim or handle to hang it on rack and use tools to manipulate hard-to-reach objects. (4/N)

1

2

0

1K

Shuo Cheng @ShuoCheng94

almost 2 years ago

For skill adaptation, our key insight is to use learned neural object descriptors (NOD) to transform skill trajectories from one task instance to others, thus being able to apply the demoed skills to manipulate unseen object shapes at novel poses. (3/N)

1

3

0

1K

Shuo Cheng @ShuoCheng94

almost 2 years ago

NOD-TAMP is a bi-level planner that reasons about (1) what skills to use given a high-level task goal and (2) how to co-adapt each skill and compose them to form a long-horizon trajectory plan. (2/N)

ShuoCheng94's tweet photo. NOD-TAMP is a bi-level planner that reasons about (1) what skills to use given a high-level task goal and (2) how to co-adapt each skill and compose them to form a long-horizon trajectory plan. (2/N) https://t.co/TeLmX1dEzQ

1

3

0

1K

Shuo Cheng @ShuoCheng94

almost 2 years ago

Can we teach a robot hundreds of tasks with only dozens of demos? Introducing NOD-TAMP: A framework that chains together manipulation skills from as few as one demo per skill to compositionally generalize across long-horizon tasks with unseen objects and scenes. (1/N)

4

164

24

112

32K

Shuo Cheng @ShuoCheng94

about 2 years ago

@danfei_xu Thanks Danfei for the great advising!

0

10

0

336

ShuoCheng94 retweeted

Danfei Xu

@danfei_xu

over 2 years ago

Since we are entering the "BC is all you need" phase of Robot Learning😜 --- Robomimic (https://t.co/jm2STNoHLu) allows you to play with SOTA algorithms (BC-Transformer, DiffusionPolicy, etc.) on challenging tasks. Also easy to integration with physical robots!

danfei_xu's tweet photo. Since we are entering the "BC is all you need" phase of Robot Learning😜 --- Robomimic (https://t.co/jm2STNoHLu) allows you to play with SOTA algorithms (BC-Transformer, DiffusionPolicy, etc.) on challenging tasks. Also easy to integration with physical robots! https://t.co/HAQoRGtZHT

2

98

19

42

13K

ShuoCheng94 retweeted

Shangjie Xue @ CVPR @ShangjieXue

over 2 years ago

How to represent granular materials for robot manipulation? Introducing our #CoRL2023 project: Neural Field Dynamics Model for Granular Object Piles Manipulation, a field-based dynamics model for granular object piles manipulation. 🌐 https://t.co/6KPwV32iqO 👇 Thread

1

19

7

8

10K

ShuoCheng94 retweeted

Vaibhav Saxena @saxenavaibhav11

about 3 years ago

If you're at #ICRA2023 come chat with us about our poster on "Generalizable Pose Estimation using Implicit Scene Representations!" Pod 11 at 3pm BST Read more about our paper: https://t.co/hLuRAnBY53

saxenavaibhav11's tweet photo. If you're at #ICRA2023 come chat with us about our poster on "Generalizable Pose Estimation using Implicit Scene Representations!" Pod 11 at 3pm BST

Read more about our paper: https://t.co/hLuRAnBY53 https://t.co/8rr9EvdMde

0

22

5

1

3K

Shuo Cheng

@ShuoCheng94

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users