Mohamad H. Danesh

Verified account

@mo_danesh

CS PhD @McGillU and @mila_quebec, working on 🍒 and 🤖 stuff / ex- @LetsUnifyAI, @NUSComputing, @EngineeringOSU

Montréal, Québec

Joined February 2017

800 Following

253 Followers

605 Posts

Pinned Tweet

Mohamad H. Danesh

about 1 month ago

📢 New paper out! We introduce QWM: a single locomotion world model trained across 8 quadrupeds and deployed zero-shot on robots it had never seen by conditioning on their morphology specs: ANYmal-D and Unitree Go1 🦾 No fine-tuning, no warm-up, no retraining from scratch. The key insight: robot morphology isn't a latent variable to infer from motion history, it's a known engineering spec sitting in the USD (or URDF) file. So we just use it directly.

1

21

3

14

1K

mo_danesh retweeted

Aviv Tamar @AvivTamar1

about 23 hours ago

The absolute peak of doing science is witnessing, for the first time, something that the world hasn't seen before. Happened to me today. Can't wait to tell you more about it.

3

39

3

1

2K

Mohamad H. Danesh

1 day ago

@ChongZzZhang @KyleStachowicz Oh cool. Haven't checked the latest versions yet. Wondering how RSL RL's SAC will be compared against FlashSAC

1

0

0

0

42

Mohamad H. Danesh

1 day ago

@KyleStachowicz @ChongZzZhang True, except for RSL RL's PPO

1

0

0

0

69

Who to follow

طراح بازی

Mamad Offline🇮🇷

How can I be homophobic? my bitch is gay!

Verified account

@craftWithPooria

Life · Freedom · Happiness | What more could we ask for? Iranian Builder/Developer/Teacher 5 Startups 2 Exits 3 Failures, Since 2016

Mohamad H. Danesh

1 day ago

@KyleStachowicz @ChongZzZhang In other words, theoretically, if the underlying MDP has bounded actions, parameterizing PPO with an unbounded Gaussian creates a support mismatch. Wouldn't a natively bounded distribution be strictly cleaner?

1

0

0

0

130

Mohamad H. Danesh

1 day ago

@KyleStachowicz @ChongZzZhang Hmmm so why not having bounds for PPO as well? For the sake of unification

1

0

0

0

133

Mohamad H. Danesh

3 days ago

Our method adds as little as ~10 lines on top of TD3+BC: pull generated actions toward the data, push them apart from each other. Go and check it out!

mo_danesh's tweet photo. Our method adds as little as ~10 lines on top of TD3+BC: pull generated actions toward the data, push them apart from each other.

Go and check it out! https://t.co/HJjlFvXkrc

Mohamad H. Danesh

7 days ago

The code is now available! 🚀 DriftQL learns a one-step Q-guided actor corrected by a learned drift field. We beat baselines with no denoising, solvers, auxiliary actors, or distillation. 💻 Code: https://t.co/oPOysnlYFh

0

3

1

0

237

0

2

0

0

49

Mohamad H. Danesh

7 days ago

The code is now available! 🚀 DriftQL learns a one-step Q-guided actor corrected by a learned drift field. We beat baselines with no denoising, solvers, auxiliary actors, or distillation. 💻 Code: https://t.co/oPOysnlYFh

13 days ago

Excited to share DriftQL☄️, a new paradigm for offline RL. Instead of fitting a behavior prior, DriftQL learns a one-step Q-guided actor whose samples are corrected by a drift field. Simple. SOTA on OGBench/D4RL. No denoising. No solvers. No auxiliary actor. No distillation. With my co-authors @mo_danesh, Amin Abyaneh, Scott Fujimoto, Hsiu-Chin Lin, David Meger 🌐 https://t.co/RFfvUiAlW9 🧵

2

13

2

6

1K

0

3

1

0

237

Mohamad H. Danesh

9 days ago

@breadli428 @SFU @leggedrobotics @ETH_en @mxu_cg @JayHe748646 @ki_ki_ki1 @ChongZzZhang @xbpeng4 Nice work as always!

0

2

0

0

123

Mohamad H. Danesh

10 days ago

AutoEval appears to be paused and may potentially be discontinued. For my research, I've trained on the BridgeData V2 and need a remote setup for real-world evaluation. Are there any alternative remote evaluation platforms, shared testbeds, or labs that support Bridge-style setups and allow external researchers to deploy policies remotely?

0

1

0

0

33

Mohamad H. Danesh

11 days ago

@GlenBerseth Congratulations!

0

1

0

0

94

mo_danesh retweeted

13 days ago

Excited to share DriftQL☄️, a new paradigm for offline RL. Instead of fitting a behavior prior, DriftQL learns a one-step Q-guided actor whose samples are corrected by a drift field. Simple. SOTA on OGBench/D4RL. No denoising. No solvers. No auxiliary actor. No distillation. With my co-authors @mo_danesh, Amin Abyaneh, Scott Fujimoto, Hsiu-Chin Lin, David Meger 🌐 https://t.co/RFfvUiAlW9 🧵

2

13

2

6

1K

Mohamad H. Danesh

26 days ago

Excited to announce Michael Rabbat (Co-Founder & VP World Models - AMI labs @amilabs) as a new speaker joining our stellar lineup ✨ 🌐 https://t.co/wMmrTZdfB0

mo_danesh's tweet photo. Excited to announce Michael Rabbat (Co-Founder & VP World Models - AMI labs @amilabs) as a new speaker joining our stellar lineup ✨

🌐 https://t.co/wMmrTZdfB0 https://t.co/LLlrk6H98D

Chenhao Li @breadli428

about 2 months ago

📢 Call for Papers! #RLC2026 🇨🇦 🌎 We are now inviting contributions to the Workshop on Model-based RL in the Era of Generative World Models at @RL_Conference in Montreal, Canada! 🇨🇦 🔗 Webpage https://t.co/y5rdTdnJ07 📄 Submit paper now! https://t.co/NDYAXYdjyd 🧵Format

breadli428's tweet photo. 📢 Call for Papers! #RLC2026 🇨🇦

🌎 We are now inviting contributions to the Workshop on Model-based RL in the Era of Generative World Models at @RL_Conference in Montreal, Canada! 🇨🇦

🔗 Webpage
https://t.co/y5rdTdnJ07

📄 Submit paper now!
https://t.co/NDYAXYdjyd

🧵Format https://t.co/ndiNz3kVsQ

2

46

6

19

10K

0

1

0

0

136

mo_danesh retweeted

Glen Berseth @GlenBerseth

29 days ago

World models are becoming a powerful approach for making the most of available data, but how do we create them to help build better agents? Come check out this workshop at @RL_Conference and submit related ideas!

0

26

5

8

4K

Mohamad H. Danesh

about 1 month ago

This is what made QWM possible 🏗️ Training 8 different quadrupeds simultaneously in one sim was a prerequisite for learning a policy (or a world model if you will) that generalizes across morphologies. Full blog post: https://t.co/3cBFOWp58K

Mohamad H. Danesh

2 months ago

I trained a single PPO policy across 8 quadrupeds simultaneously: Spot, ANYmal (B, C, D), Unitree (Go1, Go2, A1, B2). 🤖 Same weights. Same compute as training on 1 robot. No core Isaac Lab changes. Here's how we broke Isaac Lab's homogeneity assumption to make it work. 🧵👇 https://t.co/5IgcOWbhaD

2

2

1

0

291

0

1

0

1

90

Mohamad H. Danesh

about 1 month ago

Cleaning at Montreal airport is going autonomous. Robots taking over quietly

mo_danesh's tweet photo. Cleaning at Montreal airport is going autonomous. Robots taking over quietly https://t.co/zLPApwn1Tp

0

6

3

1

207

Mohamad H. Danesh

about 1 month ago

For some reason the video got deleted, so here I'm posting it again:

0

0

0

0

56

Mohamad H. Danesh

2 months ago

I trained a single PPO policy across 8 quadrupeds simultaneously: Spot, ANYmal (B, C, D), Unitree (Go1, Go2, A1, B2). 🤖 Same weights. Same compute as training on 1 robot. No core Isaac Lab changes. Here's how we broke Isaac Lab's homogeneity assumption to make it work. 🧵👇 https://t.co/5IgcOWbhaD

2

2

1

0

291

Mohamad H. Danesh

about 1 month ago

This was a fun collaboration with an amazing team 🙌: @breadli428, Amin Abyaneh, @aahoussaini, Kirsty Ellis, @GlenBerseth, Marco Hutter, and Hsiu-Chin Lin from McGill, Mila, ETH Zürich & UdeM. @mcgillu @Mila_Quebec @ETH_AI_Center @leggedrobotics @UMontreal 📄 arXiv: https://t.co/KhdenOW0ey 🌐 Project page (w/ videos): https://t.co/9g5dVomeR5

0

3

0

2

175

Mohamad H. Danesh

about 1 month ago

📢 New paper out! We introduce QWM: a single locomotion world model trained across 8 quadrupeds and deployed zero-shot on robots it had never seen by conditioning on their morphology specs: ANYmal-D and Unitree Go1 🦾 No fine-tuning, no warm-up, no retraining from scratch. The key insight: robot morphology isn't a latent variable to infer from motion history, it's a known engineering spec sitting in the USD (or URDF) file. So we just use it directly.

1

21

3

14

1K

Mohamad H. Danesh

about 1 month ago

The trick: stop treating morphology as a mystery to infer, and start treating it as what it actually is a known engineering spec 📐 We read the robot's USD file, encode its kinematics, mass & actuation, and inject that into the world model's dynamics at every step. No adaptation lag. No warm-up. No dangerous trial-and-error on a real robot 🤖

mo_danesh's tweet photo. The trick: stop treating morphology as a mystery to infer, and start treating it as what it actually is a known engineering spec 📐

We read the robot's USD file, encode its kinematics, mass & actuation, and inject that into the world model's dynamics at every step. No adaptation lag. No warm-up. No dangerous trial-and-error on a real robot 🤖

1

1

0

0

120

Last Seen Users on Sotwe

Trends for you

Most Popular Users