Ryan Hoque

6 months ago

Thrilled to see egodex continue to unlock research breakthroughs! Check this one out from colleagues in FAIR

Amir Bar

@_amirbar

6 months ago

A year ago, I took a big bet and shifted my research to world models. We started with navigation, but the vision was broader: simulate any interaction with the environment, including fine grained manipulation. Today we introduce DexWM, a world model for dexterous manipulation. Trained on 900+ hours of human and robot video, DexWM lets us imagine, plan, and execute dexterous actions on a real robot.

19

310

31

161

57K

0

5

0

1

922

ryan_hoque retweeted

Amir Bar

@_amirbar

6 months ago

very interesting question and so much to unpack. it started in the micro-kitchen. @DavidJFan suggested we talk to @JimmyTYYang1 to brainstorm on how to deploy a WM on a Franka arm. We knew we have a model architecture ("CDiT") which is likely to work. But we missed the right human video training data and a roboticist with capacity to lead. Then: 1) EgoDex, a new large-scale dataset by Apple dropped 2) @raktimgg joined our team and brought the expertise we didn't have. he immediately saw the potential.

4

45

3

21

34K

Research Scientist @ Jasper Research | Ph.D in Applied Maths (Generative Models) @Inria. I also maintain python packages democratizing Deep Generative Models.

8 months ago

Big update: I've left Apple. It’s been a blast pushing the frontier of learning dexterity from human video. I've now joined Meta’s new Robotics Studio as a Research Scientist, where I'll be building some exciting new products with super talented people. Stay tuned!

28

609

6

90

66K

Who to follow

Clément Chadebec

@CChadebec

Russell Mendonca

@mendonca_rl

World models for robotics @GoogleDeepMind Prev - Optimus AI @Tesla, PhD student @CMU_Robotics

Chris Paxton

@chris_j_paxton

Mostly posting about robots. currently AI @agilityrobotics prev embodied AI @AIatMeta, @NVIDIAAI. All views my own.

12 months ago

Exciting news from RSS this weekend! 🚀

12 months ago

Very happy that EgoDex received Best Paper Awards of 1st EgoAct workshop at #RSS2025! Huge thanks to the organizing committee @SnehalJauhri @GeorgiaChal @GalassoFab10 @danfei_xu @YuXiang_IRVL for putting out this forward-looking workshop. Also kudos to my colleagues @ryan_hoque David Yoon, Mouli Sivapurapu, @jian_zhang_ !

peide_huang's tweet photo. Very happy that EgoDex received Best Paper Awards of 1st EgoAct workshop at #RSS2025! Huge thanks to the organizing committee @SnehalJauhri @GeorgiaChal @GalassoFab10 @danfei_xu @YuXiang_IRVL for putting out this forward-looking workshop.

Also kudos to my colleagues @ryan_hoque David Yoon, Mouli Sivapurapu, @jian_zhang_ !

7

69

9

4

9K

0

32

0

3

4K

about 1 year ago

Some nice work on EgoDex visualization by @pablovelagomez1

Pablo Vela

@pablovelagomez1

about 1 year ago

Continued working on the ego-dex dataset, I ported the entire test set to @rerundotio and created a @Gradio app to view it! Links below VVV This allows for a straightforward way to explore each episode of the (test) dataset and better understand how the hand-tracking and slam systems performed. I had to sadly reencode the videos to AV1, which took up a ton of time (nearly 2 hours of wall time for just the test dataset) Next up is taking this representative dataset and making it amenable to training. I'll start with something easy, such as pose estimation, as it's what I'm most familiar with, but the goal is to allow RRD <-> Webdataset standard.

3

45

7

19

37K

1

5

0

2

2K

about 1 year ago

@Ritwik_G @UofMaryland Great news! Congrats!

0

3

0

307

about 1 year ago

@xiaolonw Well deserved, congrats!!

0

1

0

522

about 1 year ago

@Michael_J_Black Thank you @Michael_J_Black ! It’s in Appendix A.4, since it isn’t one unified link but rather a few

2

3

0

2

1K

about 1 year ago

Imitation learning has a data scarcity problem. Introducing EgoDex from Apple, the largest and most diverse dataset of dexterous human manipulation to date — 829 hours of egocentric video + paired 3D hand poses across 194 tasks. Now on arxiv: https://t.co/bJBPER8GTC (1/4)

15

606

91

378

114K

about 1 year ago

@pablovelagomez1 See Appendix A.4. Make sure to replace [filename] appropriately (try test)

1

3

0

326

about 1 year ago

@mihdalal Thank you Murtaza! Very exciting to see what you've been cooking @Tesla_Optimus !

0

57

ryan_hoque retweeted

about 1 year ago

🚨Introducing EgoDex, the largest ego-centric video dataset to-date that focuses on human dexterous manipulation, with structured annotations including 3D upper-body and hand tracking🤲, camera pose📷, and language annotation💬. Kudos to the team and looking forward to what the community can cook from it. Checkout our preprint on arXiv, and data is available for downloading NOW. I am at Atlanta attending ICRA. DMs are open and happy to chat in person. 📄Preprint: https://t.co/pbUA9CoTId #ICRA #robotics #imitationlearning #dexterousmanipulation

2

63

9

27

19K

about 1 year ago

The full dataset is now publicly available to the community, access details are in the paper. Sample code for data loading is coming soon. Enjoy!

2

14

2

2K

about 1 year ago

We also propose new benchmarks and train imitation learning policies for dexterous trajectory prediction. Below are 30 Hz wrist and fingertip trajectories on the test set, where blue = ground truth, red = model predictions, and points get lighter up to 2 seconds in the future.

ryan_hoque's tweet photo. We also propose new benchmarks and train imitation learning policies for dexterous trajectory prediction. Below are 30 Hz wrist and fingertip trajectories on the test set, where blue = ground truth, red = model predictions, and points get lighter up to 2 seconds in the future. https://t.co/eKuktIStCd

1

10

2

1

2K

over 1 year ago

@allenzren Congrats Dr. Ren!

0

1

0

193

ryan_hoque retweeted

over 1 year ago

🚀 New Research on Human-Robot Interaction! 🤖 How can humanoid robots communicate beyond words? Our framework, EMOTION, leverages Large Language Models (LLMs) to dynamically generate expressive gestures, enhancing non-verbal communication in robots. 🤯 Our experiments show that EMOTION can generate various expressive gestures from only TWO examples and match human-generated gestures in understandability & naturalness! 🔍 What’s inside? ✅ LLM-powered motion generation ✅ Human feedback to refine gestures (EMOTION++) ✅ 10 expressive gestures generated and evaluated (thumbs-up, stop, jazz-hands & more!) 📜 Read the full paper: https://t.co/UOYItwsEe0 🎬 Watch the video: https://t.co/O2VkbezW2o Let’s bring robots closer to human-like interactions! What gestures would you like to see next? 👇 Huge kudos to the amazing team at Apple that made this work @Yuhan_Hu_, Nataliya Nechyporenko, @talking_kim, @waltertalbott, @jian_zhang_. #Robotics #HRI #LLMs #HumanRobotInteraction #GestureGeneration #SocialRobots

15

386

82

150

27K

over 1 year ago

@eshear Goodhart's Law, one of my favorite

0

117

over 1 year ago

@oier_mees Thanks @oier_mees ! Latency is an issue, but there are ways to improve. For example, a faster IK solver (Genesis? ;)) could help as we are running IK for each new hand pose

1

0

326

ryan_hoque retweeted