Leonardo Perez @leoperzz - Twitter Profile

Pinned Tweet

Leonardo Perez @leoperzz

13 days ago

Amazing experience that we had at #ICRA2026. My favorite photo

Aryan Mangla

@Aryan_Mangla_

13 days ago

We won 1st place in Logistics Picking track at the #ICRA2026 Vienna Site of What Bimanuals Can Do 2026 @WBCDCompetition It focused on whole-body humanoid logistics picking task The journey and experience was just amazing! @leoperzz @autobrik @raulb4s @mbrq_13 @ubillus83797

Aryan_Mangla_'s tweet photo. We won 1st place in Logistics Picking track at the #ICRA2026 Vienna Site of What Bimanuals Can Do 2026 @WBCDCompetition

It focused on whole-body humanoid logistics picking task

The journey and experience was just amazing!

@leoperzz @autobrik @raulb4s @mbrq_13 @ubillus83797 https://t.co/6EbmExopGJ

6

42

9

3K

0

4

0

267

Leonardo Perez @leoperzz

about 18 hours ago

@aryanmadhaverma looks really good! 👀

1

0

80

leoperzz retweeted

Artificio

@Artificio_Org

2 days ago

Debugging

1

9

4

0

324

Leonardo Perez @leoperzz

3 days ago

@iyanmoonyang I think vision is mainly used for trajectory planning. Your brain can still adjust your movements using only sensory feedback from the ground

0

1

0

177

Who to follow

Leonardo Perez @leoperzz

4 days ago

@nanliuuu yes, in fact, pi07 only predicts the next state as an image https://t.co/GFBlHiZm1I

0

1

498

leoperzz retweeted

Lester Li

@sizhe_lester_li

5 days ago

Robot learning is moving beyond policies built for one robot, one scene, one task. At MIT, we’re exploring a different path: turning video world models into embodiment-agnostic robot policies. Introducing VERA: a 14B video-to-action system that controls robots across embodiments, skills, and environments. From zero-shot pick-and-place on a real Panda arm to contact-rich cube reorientation with a 16-DoF robotic hand. Different robots. Different environments. Different tasks. Same video planner. Same weights. We’re open-sourcing everything so you can fine-tune VERA for your own robot setup too. Deep dive in the thread: 🔗 https://t.co/hzuYZ2m5lS 🧵 (1/7)

14

439

60

371

159K

leoperzz retweeted

SAGE @SAGE_1125

6 days ago

We recently wrote a short blog on the mathematical essence behind three common World Model paradigms in Robot Learning. It looks at Future-conditioned / IDM-style, Single-backbone, and MoT-style models from the lens of probabilistic modeling and structured optimization.

SAGE_1125's tweet photo. We recently wrote a short blog on the mathematical essence behind three common World Model paradigms in Robot Learning.

It looks at Future-conditioned / IDM-style, Single-backbone, and MoT-style models from the lens of probabilistic modeling and structured optimization. https://t.co/J6O2v1USlb

6

42

7

61

14K

Leonardo Perez @leoperzz

5 days ago

@iyanmoonyang https://t.co/HWYnt2yNqa this is really good

1

3

0

20

1K

Leonardo Perez @leoperzz

6 days ago

@iyanmoonyang https://t.co/fFlMQC4JrF :)

0

1

0

22

leoperzz retweeted

T.Yamazaki @ZappyZappy7

9 days ago

『ロボットに「人間の動きを真似させる」だけでなく、ロボット自身の体格・重さ・関節・バランスに合わせて、現実に動ける形へ変換するための技術』押す・蹴る・運ぶ全身作業の学習を加速 https://t.co/BhDvmLmhqy #HumanoidRobot #PhysicalAI #RobotLearning #EmbodiedAI #DynaRetarget #ATARI_LAB

4

156

25

81

11K

leoperzz retweeted

Yunsong Zhou

@Yunsong_Zhou

10 days ago

Excited to share that SIM1 has been accepted to ECCV 2026! 🎉 Huge thanks to our amazing team for making this possible. See you in Sweden! 🇸🇪 #ECCV2026

1

52

5

24

9K

leoperzz retweeted

Ethan Clark

@ethanmclark1

11 days ago

Working in robotics right now is what I imagine working with language models felt like in 2023. Everyone throwing things at the wall to see what sticks Pixel prediction (Cosmos), action prediction (VLA), reward prediction (TD-MPC), and representation prediction (JEPA). Different paths for the same problem The recipe that won in language was self-supervised pretraining at internet scale then light finetune on top. Only representation prediction runs that playbook. It learns from action-free video data so you can pretrain on YouTube and egocentric data then add a control layer. Everything else needs action-labeled data that doesn't scale As an RL maximalist, I used to hate LeCun's cake. Turns out he was right all along which is how I ended up a JEPA truther

19

493

34

324

66K

leoperzz retweeted

Seungjae (Jay) LEE @JayLEE_0301

13 days ago

Can a model trained purely on video — with zero action labels — match VLAs trained on massive action-labeled datasets? Meet µ0 (Mew-Zero): a world model that learns a "physical language" for robots. Here's why we're excited 🧵

2

179

25

139

12K

leoperzz retweeted

NVIDIA Robotics

@NVIDIARobotics

13 days ago

@Aryan_Mangla_ @WBCDCompetition @leoperzz @autobrik @raulb4s @mbrq_13 @ubillus83797 @0xnonhuman @XYZRoboticsInc @NVIDIAAI @DrJimFan Congrats!

2

12

2

0

2K

Leonardo Perez @leoperzz

13 days ago

@NVIDIARobotics @Aryan_Mangla_ @WBCDCompetition @autobrik @raulb4s @mbrq_13 @ubillus83797 @0xnonhuman @XYZRoboticsInc @NVIDIAAI @DrJimFan Thank you for SONIC. It's amazing🫡

0

1

0

96

Leonardo Perez @leoperzz

14 days ago

Is it possible to use the same model to do this and do laundry, for example? One of the main problems, I think, is how we can achieve really high-frequency policies

Space and Technology

@spaceandtech_

15 days ago

Researchers from The University of Hong Kong and Kinetix AI have developed a humanoid robot system called SMASH that can play real table tennis using only onboard cameras. The robot tracks the ball in real time without using external cameras or motion-capture systems. It can perform powerful smashes, quick side movements, and low crouching saves using full-body coordination.

6

130

40

23

12K

0

1

0

70

leoperzz retweeted

Lucky Robots @luckyrobots

17 days ago

We are super excited to share with you our initial release of Lucky Engine. We are building a robotics engine from the ground up to be what we wished we could find in a simulator before

28

802

104

265

9M

leoperzz retweeted

Jason Liu

@JasonJZLiu

17 days ago

💥Introducing FACTR 2, learning external force sensing on commodity robot arms without needing dedicated sensors. We show that learned force signals enable force-feedback teleop on low-cost arms and improve BC policies. FACTR 2 consists of: 1. Neural External Torque (NEXT): learns external forces without needing dedicated force sensors. 2. Force-Informed Re-Sampling Training (FIRST): uses the learned force signal to identify task-critical regions and upsample them during training. w/ @StevenOh_ @_tonytao_ 🧵(1/N)

17

286

59

167

106K

Leonardo Perez @leoperzz

18 days ago

The most impressive demo I’ve seen at ICRA! Congrats guys and thank you for the socks too

Flexion Robotics

@FlexionRobotics

19 days ago

We ran 300 fully autonomous live demonstrations over 3 days at ICRA 2026. The task: a humanoid navigating stairs, picking up a box from the floor and placing it on a table. Simple to describe, but hard to execute reliably when your robot is making every decision on its own at a conference with new surroundings and a crowd watching live. This is just a glimpse. We've been pushing our stack much further and we'll be sharing more very soon. More information in the thread. #HumanoidRobots #ICRA2026 #Flexion

5

86

14

13

17K

0

52

leoperzz retweeted

Junzhe (JJ) He @JayHe748646

26 days ago

Excited to share our recent work on whole-body humanoid locomotion for challenging terrain traversal! Diffusion-based planner + RL WBC = general purpose locomotion controller Led by @ctki49 @mxu_cg @KehanWen170077 at @leggedrobotics and @xbpeng4.

4

177

26

76

16K

Leonardo Perez

@leoperzz

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users