High-resolution image and video generation is hitting a wall because attention in DiTs scales quadratically with token count. But does every pixel need to be in full resolution?
Introducing Foveated Diffusion: a new approach for efficient diffusion-based generation that allocates compute where it matters most.
1/7🧵
Excited to share Generated Reality! We propose a world model for XR – transforming your hands and head movement into an interactive, generative visual experience 🫳🫴 👀
🌍🥽 See more about our work below:
📢Introducing Generated Reality📢
A world model for XR that turns your tracked hand and head poses into an interactive, generative video experience. Take world models to the next level by interacting with the world using your own body!
🔗https://t.co/bgDDO8Laix
1/4