Shuo Yang

Verified account

@ShuoYangAIR

CTO & Co-founder @ Mondo Robotics/ Ex Tesla | CMU PhD | Ex DJI

Palo Alto

Joined August 2022

62 Following

2.4K Followers

67 Posts

Pinned Tweet

about 1 year ago

I’ve recently left the Tesla Optimus team to co-found Mondo Tech with my longtime friend and former DJI colleague, Soren. I’m incredibly appreciative of the opportunity to contribute to Elon’s vision for general-purpose humanoid robots. Optimus has pushed both technical frontiers and society’s imagination of what it might look like for humans and robots to coexist more closely. I’m still a firm believer, but I’m chasing a different path. We’d like to test the hypothesis that smaller, more accessible robots, designed with consumer applications in mind, can be both meaningful products today and foundational platforms tomorrow. These systems will be compact, safe, user-friendly, and useful, capable of running modern machine learning algorithms on a fully self-developed embedded and mechatronic stack. We’re building teams in both Palo Alto and Shenzhen. If you’re passionate about embedded systems, ML/RL, and you want to build mini robots that can serve as little friends in people’s lives, we’d love to hear from you. That said, what truly cemented this decision is something more personal: Our son is almost three. He loves stories. Every night, I tell him adventures about a boy named Pike and his robot companion Nick. Nick helps fix trains using his own parts, scouts mountains (adapting into a drone), and stays up all night when Pike is sick. These stories came from a place of love and hope that he might grow up in a world where robots aren’t just tools, but friendly companions. My son never met Optimus, though he used to watch Optimus videos with me. But when I had to leave him crying to return to the lab, he gradually lost interest. Most nights, after storytime, I’d drive back to work, tuning Optimus algorithms in the Tesla lab, while watching him sleep on the baby monitor, wondering what future I was building for him. One night, I told him: “Daddy’s not working on that robot anymore. I’m going to build you one, like Nick. One that can be with you every day.” He looked at me, smiled, and said, “Okay.” And that was all I needed.

ShuoYangAIR's tweet photo. I’ve recently left the Tesla Optimus team to co-found Mondo Tech with my longtime friend and former DJI colleague, Soren. I’m incredibly appreciative of the opportunity to contribute to Elon’s vision for general-purpose humanoid robots. Optimus has pushed both technical frontiers and society’s imagination of what it might look like for humans and robots to coexist more closely. I’m still a firm believer, but I’m chasing a different path.

We’d like to test the hypothesis that smaller, more accessible robots, designed with consumer applications in mind, can be both meaningful products today and foundational platforms tomorrow. These systems will be compact, safe, user-friendly, and useful, capable of running modern machine learning algorithms on a fully self-developed embedded and mechatronic stack.

We’re building teams in both Palo Alto and Shenzhen. If you’re passionate about embedded systems, ML/RL, and you want to build mini robots that can serve as little friends in people’s lives, we’d love to hear from you.

That said, what truly cemented this decision is something more personal:

Our son is almost three. He loves stories. Every night, I tell him adventures about a boy named Pike and his robot companion Nick. Nick helps fix trains using his own parts, scouts mountains (adapting into a drone), and stays up all night when Pike is sick. These stories came from a place of love and hope that he might grow up in a world where robots aren’t just tools, but friendly companions. My son never met Optimus, though he used to watch Optimus videos with me. But when I had to leave him crying to return to the lab, he gradually lost interest. Most nights, after storytime, I’d drive back to work, tuning Optimus algorithms in the Tesla lab, while watching him sleep on the baby monitor, wondering what future I was building for him.

One night, I told him:
“Daddy’s not working on that robot anymore. I’m going to build you one, like Nick. One that can be with you every day.”

He looked at me, smiled, and said, “Okay.”

And that was all I needed.

42

652

32

111

55K

about 8 hours ago

@TheHumanoidHub Thank you very much!

2

6

0

0

323

12 days ago

@charles_rqi @mondorobotics Thank you very much Charles!

0

2

0

0

479

18 days ago

@SpaceX Could you tell us how was the booster during water landing?

1

0

0

0

462

26 days ago

@evanbeard You need RoboMaster, dude. The best way to train qualified robotics engineers is hosting college level robotics competitions for students.

0

0

0

0

26

28 days ago

@TairanHe99 Congratulations!

0

3

0

0

376

about 1 month ago

This is an impressive work

about 1 month ago

We are back. After one year of quiet building. Introducing GENE-26.5, our first robotic brain that takes a major step toward human-level capability. For years, robotics has struggled to learn from the world’s largest and valuable data source: Humans. Solving it means rethinking the whole stack from the ground up: - A robotics-native foundation model. - A 1:1 human-like robotic hand. - A noninvasive data collection glove for motion, force, and touch. - A simulator that turns weeks of experiments into minutes. GENE-26.5 is trained across language, vision, proprioception, tactile, and action. We designed a set of tasks to test how far we can go with this new paradigm. Fully autonomous, 1x speed, one model, same weights. (Enjoy with sound on) We are approaching the endgame for robotics. And this is just a beginning.

281

6K

1K

3K

3M

0

12

0

1

1K

about 1 month ago

Chao is one of the best roboticists I know

about 1 month ago

Our first demo debuted on Jensen Huang's GTC keynote, and today we’re launching @SanchoRobotics 🚀 GTC keynote demo with @MultiplyLabs. Extended cut below.

7

109

26

36

16K

1

59

8

28

8K

about 2 months ago

DiT4DiT is now open source! As the first humanoid-deployable Video-Action Model built on a world model, DiT4DiT continues to surprise us. In our paper last month, we showed its strong data efficiency. Now, with only slight modifications, it enables real-time whole-body autonomous pick-and-place. Paper: https://t.co/tjQipiYpBS Code: https://t.co/7pacr0mJk3 Website: https://t.co/YFZUTCTuUr

6

180

31

111

14K

2 months ago

@chris_j_paxton Please let me know Chris.

0

1

0

0

135

2 months ago

@ericjang11 Thank you Eric for visiting Mondo Robotics. The world will be a better place for sure.

0

21

0

0

2K

3 months ago

@adrianmacneil Will do!

0

1

0

0

42

3 months ago

Additional to a team of people who can deploy world model based manipulation model on humanoids (Please read https://t.co/hWAkwMmsFp), Mondo also has a team of people who can build the best consumer robot in the world. You will get this robot at a low price soon.

🐧 Daniel Garcia | therobotbay.com - Robot market

3 months ago

Mondo Robotics should seriously sell these - like now!

16

411

54

127

36K

3

43

2

22

6K

3 months ago

@HyperLogistix @dannybuntu Me and a team of good engineers, bro

2

1

0

1

51

3 months ago

@Majumdar_Ani Hi Anirudha, nice article! Please checkout our recent world model work https://t.co/VDa82Tdde8. Looking forward to hearing your comment!

0

4

0

0

632

3 months ago

@AndreTI it’s not a bug. It is because the VAM is very slow to infer so the robot receives discontinuous trajectories despite of smoothing mechanism. We need to improve inference time by compressing model

2

5

0

0

406

3 months ago

We’re excited to share DiT4DiT, an end-to-end Video-Action Model for robot learning that unifies a video Diffusion Transformer and an action Diffusion Transformer in a single cascaded framework. By leveraging the rich spatiotemporal and physical dynamics learned through video generation, rather than static image-text priors, DiT4DiT achieves state-of-the-art results on LIBERO (98.6%) and RoboCasa GR1 (50.8%) with far less training data, delivering over 10× better sample efficiency and up to 7× faster convergence. Real-world deployment on a humanoid robot further shows robust generalization. We believe this is a step toward making video generation a powerful backbone for robot policy learning. This work builds upon the brilliant foundations laid by Nvidia's GR00T and Cosmos. Project: https://t.co/YFZUTCTuUr Paper: https://t.co/tjQipiYpBS Code: Coming soon. In the meantime, you can ask your coding agent to reproduce the method based on GR00T/Cosmos.

6

228

36

156

31K

3 months ago

@elvisnavah Thank you Elvis. Your pioneering work is really inspiring

0

1

0

0

115

3 months ago

@chris_j_paxton Thank you Chris!

1

2

0

0

174

4 months ago

@HSlifelearner @TairanHe99 @zhengyiluo Hey Harsh, would love to catch up with you soon

0

0

0

0

140

4 months ago

Had a pleasant meeting with brilliant roboticists @TairanHe99 and @zhengyiluo. I am sure Nvidia will win 2026’s robot foundation model race!

ShuoYangAIR's tweet photo. Had a pleasant meeting with brilliant roboticists @TairanHe99 and @zhengyiluo. I am sure Nvidia will win 2026’s robot foundation model race! https://t.co/PTa8fBfan5

4

90

3

11

11K

Last Seen Users on Sotwe

Trends for you

Most Popular Users