🚀 Excited to share PointAction:
A new Video-Point-Action Model that uses dynamic 3D pointmaps as a universal, geometry-grounded action representation for robot control.
VLA → VAM → ? Lift RGB to RGB+XYZ, then decode robot-specific actions.
https://t.co/z6sB8aUQaS
[1/6]