Joseph Amigo

@Jsphamigo

PhD Candidate at NYU & LAAS-CNRS - Robotics, Reinforcement Learning, Deep Learning

New York

Joined February 2018

78 Following

19 Followers

12 Posts

Joseph Amigo @Jsphamigo

7 months ago

@xing_rui12683 Meanwhile ICRA forbids using AI to reformulate your reviews…

601

Joseph Amigo @Jsphamigo

10 months ago

@Rk4342R We’re in the process of cleaning up the code and pushing it to GitHub. I believe it should be available by the end of next week.

134

Joseph Amigo @Jsphamigo

10 months ago

Introducing our new work DMO: Decoupled Model-based policy Optimization! First-order gradient RL that unrolls trajectories with high-fidelity sims & computes gradients via learned models. Paper & demos: https://t.co/SrLCn1mdVA #CoRL2025 w/ @Rk4342R

Joseph Amigo @Jsphamigo

10 months ago

@KyleMorgenstein @Rk4342R Reducing the size of the replay buffer generally impeded learning (however, for the Go2 walking exp, it was beneficial to have 1e5 instead of 1e6).

Joseph Amigo @Jsphamigo

10 months ago

@KyleMorgenstein @Rk4342R I see, very interesting!

Joseph Amigo @Jsphamigo

10 months ago

@KyleMorgenstein @Rk4342R We generally used 4 mini/batches.

Joseph Amigo @Jsphamigo

10 months ago

@KyleMorgenstein @Rk4342R "coming from PPO it’s shocking to see so few get such good performance" -> the price, however, for now, is the need for a differentiable reward function.

Joseph Amigo @Jsphamigo

10 months ago

@KyleMorgenstein @Rk4342R If my speculation is correct, I believe so!

Joseph Amigo @Jsphamigo

10 months ago

@KyleMorgenstein @Rk4342R Thank you! For the value function, we use regular TD-lambda. For the dynamics model, the std for each feature in the obs is different. Depending on the task, I also recommend using design choices I to V of the "4.2 Design Choices" section in https://t.co/PHy5vUVp6g.

Jsphamigo retweeted

C's Robotics Paper Notes @RoboReading

10 months ago

First Order Model-Based RL through Decoupled Backpropagation (DMO) https://t.co/1o4BjruhZc Simulation rollouts, learned model for first order optimization

RoboReading's tweet photo. First Order Model-Based RL through Decoupled Backpropagation (DMO)

https://t.co/1o4BjruhZc

Simulation rollouts, learned model for first order optimization https://t.co/JaQXGp6aWp

109

Joseph Amigo

@Jsphamigo

Last Seen Users on Sotwe

Trends for you

Most Popular Users