Xuning Yang @xuningy - Twitter Profile

Pinned Tweet

about 2 months ago

When every generalist robot model scores 95%+ on a benchmark, the numbers become meaningless. What if we built a photorealistic benchmark that never saturates and can generate new scenes and tasks with AI Workflows in minutes? We introduce RoboLab! 🧵(1/6)

xuningy's tweet photo. When every generalist robot model scores 95%+ on a benchmark, the numbers become meaningless.

What if we built a photorealistic benchmark that never saturates and can generate new scenes and tasks with AI Workflows in minutes?

We introduce RoboLab! 🧵(1/6) https://t.co/GxFIivVmKa

10

147

27

106

28K

Xuning Yang @xuningy

18 minutes ago

@chris_j_paxton We built RoboLab to answer the question "How should we *objectively* evaluate *real-world* generalist policies at scale?" which is exactly where we differentiate. The noise, though, is a problem of statistics and quantifying the sim2real gap. Unfortunately this is not solved.

0

5

xuningy retweeted

Kaichun Mo @ CVPR

@KaichunMo

4 days ago

Cosmos3 (post-trained on DROID) surpassed strong VLA & WAM baselines to rank #1 on RoboLab All the compute FLOPs invested during the massive Cosmos3 pre-training and mid-training contribute to unlocking a better robot foundation model.😄

0

59

9

13

6K

Xuning Yang @xuningy

4 days ago

🎉 We added 2 SOTA WAMs to the RoboLab Leaderboard 🎉 Current leaders on RoboLab-120 (specific instr.): 🥇Cosmos3-Nano-Policy (39.7%) 🥈π0.5 (28.1%) 🥉DreamZero (28.1%) → See full results at: https://t.co/Le8jykn5jo → All policy clients available at: https://t.co/wQH4Py6zJ8

xuningy's tweet photo. 🎉 We added 2 SOTA WAMs to the RoboLab Leaderboard 🎉

Current leaders on RoboLab-120 (specific instr.):
🥇Cosmos3-Nano-Policy (39.7%)
🥈π0.5 (28.1%)
🥉DreamZero (28.1%)

→ See full results at: https://t.co/Le8jykn5jo

→ All policy clients available at: https://t.co/wQH4Py6zJ8 https://t.co/PMg9l74zBU

7

126

21

65

30K

Who to follow

Jacky's B2B

@JackysBiz

Twitter account for Jacky's Business Solutions. We talk robotics, 3D & large format printing, b2b tech. Celebrating 50 years.

रघुपति राघव राजाराम,पतित पावन सीताराम। सुंदर विग्रह मेघश्याम,गंगा तुलसी शालग्राम॥ भद्रगिरीश्वर सीताराम,भगत-जनप्रिय सीताराम। जानकीरमणा सीताराम,जयजय राघव सीताराम॥

Xuning Yang @xuningy

about 1 month ago

@wang_siyin Results coming soon, will be updated on the website

0

30

Xuning Yang @xuningy

about 2 months ago

When every generalist robot model scores 95%+ on a benchmark, the numbers become meaningless. What if we built a photorealistic benchmark that never saturates and can generate new scenes and tasks with AI Workflows in minutes? We introduce RoboLab! 🧵(1/6)

10

147

27

106

28K

xuningy retweeted

rishit dagli @rishit_dagli

about 1 month ago

a new form of greeting has dropped: see you at icml🇰🇷 1/1 accepted to ICML. more details soon

3

25

2

0

2K

xuningy retweeted

Yu Xiang

@YuXiang_IRVL

about 1 month ago

We are still far from zero-shot policy deployment on new tasks

4

55

10

29

9K

Xuning Yang @xuningy

about 1 month ago

@YuXiang_IRVL Far from it, but the field is moving fast. We have results from DreamZero coming very soon.

0

1

0

98

xuningy retweeted

NVIDIA Robotics

@NVIDIARobotics

about 2 months ago

Generalist robot policies need a benchmark that works across any robot and any policy. 🦾 Introducing RoboLab, a high‑fidelity simulation benchmark built on NVIDIA Isaac and Omniverse to evaluate generalist robot policies in diverse, photoreal, physics‑based environments. Coming soon to the NVIDIA Isaac Lab‑Arena roadmap for large‑scale, robotic policy evaluation. 📖 https://t.co/wW472SHXPz #NationalRoboticsWeek

8

259

40

99

24K

Xuning Yang @xuningy

about 2 months ago

RoboLab comes with RoboLab-120 — a curated, diverse benchmark of 120 tasks to get started. Set up and run in <20 min. (6/6) Try it out 👇 🌐 https://t.co/pNMITqaCus 📄 https://t.co/CDS0tpFnZ0 💻 https://t.co/bnJmhPMXa5

0

21

3

7

2K

Xuning Yang @xuningy

about 2 months ago

→ Customization: Comes with 200+ objects, 100+ backgrounds, lighting, camera poses… don’t like it? No problem, add your own → Diagnostics: motion quality, failure events, + sensitivity analysis for failure attribution (5/6)

1

7

0

941

Xuning Yang

@xuningy

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users