Very excited to get this out. "CAT3D: Create Anything in 3D with Multi-View Diffusion Models"
Text->3D, image->3D, and few-view->3D, all in one package. SOTA few-view results, beautiful text results, trains in 1 minute, and renders at 60fps in a browser. https://t.co/d8oEXaBZn5
Spatial Intelligence is a critical piece of the AI puzzle. This is my 2024 TED talk about the journey from evolution to AI, on how we build Spatial Intelligence. "Sight turned into insight; Seeing became understanding; Understanding led to action. All these gave rise to intelligence." 1/
https://t.co/qGiVCrJUxJ?