Can my robot cook my food, rearrange my dresser, tidy my messy table and do so much more without ANY demos or real-world training data?
Introducing ManipGen: A generalist agent for manipulation that can solve long-horizon robotics tasks entirely zero shot, from text input!
1/N
Latest preprint from @Apple MLR - we use conditional diffusion models + Perceiver I/O to learn the policy's state visitation and the value function on hard offline robotic tasks https://t.co/9oRtsxKTag. Work with @waltertalbott, @itsbautistam, Devon, Alex and @jsusskin.
New paper TRACT - Faster diffusion model sampling
- Single-step diffusion SotA for CIFAR10 and ImageNet64 with L2 loss without architecture changes
- Up to 2.4x FID improvement
https://t.co/UOMlLb0plG
Excited for this to be out! Introducing GAUDI: a generative model for 3D indoor scenes. We tackle the problem of learning a generative model of 3D scenes parametrized as radiance fields. This has been a great collaboration across multiple teams at @Apple.
https://t.co/aJOqtzA2CI
Check out our work on model-based RL in the presence of severe visual distractions at the #NeurIPS2021 Deep RL workshop on Monday Dec 13, 1230 PST.
Arxiv: https://t.co/VttKf5YwXZ
Code: https://t.co/GZ8accrFPu