Friday 10:30AM Spotlight Poster#132
SAP3D is a system for 3D perception which improves as the number of views increases. Unposed few-view NVS/3D by test-time adaptation of diffusion NVS models.
[Poster alert - Friday morning #132]
Most of the 3D reconstruction and novel view synthesis systems take a fixed set of images as input. They can not improve when more unposed images are provided, even if they reveal new information about the 3D structure.
Thrilled to be presenting "Humans in 4D" at #ICCV.
We reconstruct humans in the wild with remarkable robustness and accuracy! That too, with a very simple transformerized architecture.
Thursday afternoon *Poster 76* in the South Foyer!
🌎 https://t.co/zvftvJoGj0
Humans in 4D, w/ @goelshbhm@geopavlakos@jathushan@JitendraMalikCV
✨HMR2.0💃🏻 + better PHALP, it's super robust! Give it a try:
🌎https://t.co/0YD6g5RfGA
🤗https://t.co/KiovKjE8WZ
*Thursday afternoon poster*
@soalber00590884@akanazawa@JitendraMalikCV@Avataar Our demo script (https://t.co/RhEOpM2XH0) for running HMR2.0 on an image folder supports saving meshes. For an example, see this cell of the collab: https://t.co/luBVMKEClT
Happy to share that I’m PhD-done! Berkeley was such an enriching and cherishable experience, special thanks to my advisors @akanazawa and @JitendraMalikCV for supporting me throughout.
Next, super excited to join @Avataar, where we’ll continue pushing the boundaries of 3D tech.
Big questions discussed and debated:
- When will the Midjourney moment for 3D happen?
- How will AI NPC's transform gameplay?
- What will AI Native games look like?
- How should studio's think about copyright?
- What new gaming business models fall out from AI?
Releasing 🌟nerfstudio🌟, a plug-and-play python library to easily create your own NeRFs!
@nerfstudioteam is a contributor friendly open-source repo with a realtime web viewer that makes it easy to make cool videos 📽️
https://t.co/zw0ZPqIZ4c
https://t.co/NZM8GPjNrF #nerfstudio 1/
Happy to announce DreamFusion, our new method for Text-to-3D!
https://t.co/4xI2VHcoQW
We optimize a NeRF from scratch using a pretrained text-to-image diffusion model. No 3D data needed!
Joint work w/ the incredible team of @BenMildenhall@ajayj_@jon_barron#dreamfusion
🎊 New paper!
We train loss-conditional diffusion models of *neural network checkpoints* that learn to optimize.
w/ Radosavovic, Brooks, Efros, Malik
proj: https://t.co/R58P8soLpy
arxiv: https://t.co/9AzqV0koIz
code: https://t.co/1nanm8v2WI
For recognition to graduate from 2D to 3D we need new tasks.
Omni3D is a large benchmark for 3D object detection in the wild with over 230k images & 3 million instances and draws from existing datasets.
Our model trained on Omni3D generalizes to new domains like AR captures
Vision datasets (i.e. ImageNet) are usually collected once for a fixed task. But how do we know the choice of camera intrinsics, tasks, etc. is a good one?
Our ICCV paper on “steerable datasets” addresses this problem and gets 'human-level' surface normal preds along the way (⅓)