Introducing our first technical blog post:
Automatic Dense Rewards for Autonomous Robot Learning
In this post, we utilize VLM chunking techniques to densely reward trajectories based on video and telemetry data without human supervision.
We propose a reward shaping mechanism which aims to reward positive parts of rollouts while adding time step penalty to discourage idleness or inefficiency.
More in the blog post below
https://t.co/WZgUXRvbBZ
Introducing our first technical blog post:
Automatic Dense Rewards for Autonomous Robot Learning
In this post, we utilize VLM chunking techniques to densely reward trajectories based on video and telemetry data without human supervision.