Excited to share our new paper on large-angle monocular dynamic novel view synthesis! Given a single RGB video, we propose a method that can imagine what that scene would look like from any other viewpoint.
Website: https://t.co/uhY9NdWAPt
Paper: https://t.co/beb3W8ojOr
π§΅(1/5)
π π¨ππππππ«ππππ accepted to #ICRA2026!
Video diffusion often "hallucinates" robot motion. We ground diffusion in kinematics to synthesize high-fidelity, embodiment-consistent training data.
π Paper: https://t.co/d7YJO2nWEE
π Project: https://t.co/BmDO1uq1jx
We are grateful to be awarded an oral presentation -- please come by Wed 10/2 at 1:30pm (I believe we are the first talk in the oral session) as well as the poster session afterward (number 156) at 4:30pm! #ECCV2024 π
Excited to share our new paper on large-angle monocular dynamic novel view synthesis! Given a single RGB video, we propose a method that can imagine what that scene would look like from any other viewpoint.
Website: https://t.co/uhY9NdWAPt
Paper: https://t.co/beb3W8ojOr
π§΅(1/5)
Excited to share our new paper on large-angle monocular dynamic novel view synthesis! Given a single RGB video, we propose a method that can imagine what that scene would look like from any other viewpoint.
Website: https://t.co/uhY9NdWAPt
Paper: https://t.co/beb3W8ojOr
π§΅(1/5)
Apart from robotics and related scenes, it also works quite well on driving scenarios! In general, we believe our framework can help unlock powerful applications in rich dynamic scene understanding, perception for embodied AI, and interactive 3D video viewing.
π§΅(4/5)
Happy to share our #ICCV2023 paper on 3D reconstruction from a single image!
In Zero-1-to-3, we teach diffusion models to control the camera viewpoint, which enables novel view synthesis applications.
Website: https://t.co/qiJmB2Kfrl
Paper: https://t.co/CaDXVloP1v
π§΅(1/n)
Happy to share our #ICCV2023 paper on 3D reconstruction from a single image!
In Zero-1-to-3, we teach diffusion models to control the camera viewpoint, which enables novel view synthesis applications.
Website: https://t.co/qiJmB2Kfrl
Paper: https://t.co/CaDXVloP1v
π§΅(1/n)
Specifically, we finetune Stable Diffusion, which already has useful 2D image priors thanks to being trained on billion-scale data.
This pipeline allows us to successfully achieve strong zero-shot performance on objects with complex geometry and artistic styles.
π§΅(3/n)
Excited to share our #CVPR2023 paper on tracking with object permanence in video!
In TCOW, we propose both a model and a dataset for localizing objects regardless of their visibility.
Website: https://t.co/C7rrrSQlBN
Paper: https://t.co/GZOYsjijlM
π§΅ (1/n)
Excited to share our #CVPR2023 paper on tracking with object permanence in video!
In TCOW, we propose both a model and a dataset for localizing objects regardless of their visibility.
Website: https://t.co/C7rrrSQlBN
Paper: https://t.co/GZOYsjijlM
π§΅ (1/n)
P.S. Also check out our earlier related work on Revealing Occlusions with 4D Neural Fields (https://t.co/Y4RothutYA)! This paper is essentially about video-to-4D generation, but requires depth input. On the other hand, we demonstrate that TCOW works in the wild too.
π§΅ (7/7)
Excited to share our #CVPR2023 paper on tracking with object permanence in video!
In TCOW, we propose both a model and a dataset for localizing objects regardless of their visibility.
Website: https://t.co/C7rrrSQlBN
Paper: https://t.co/GZOYsjijlM
π§΅ (1/n)
Visit our project webpage at https://t.co/C7rrrSQlBN for many more results, as well as links to the datasets, code, and pretrained models!
Joint work with @ptokmakov, Simon Stent, Jie Li, and @cvondrick.
π§΅ (6/n)