Even the SOTA VideoLLMs see videos in 1 fps, and you CANNOT perceive fine-grained motion 💃 with this frequency 🥲
📣 Presenting Video Parallel Scaling (VPS), an inference-time strategy that lets VideoLLMs see more frames by scaling compute in the parallel-axis 🤩
🚨Introducing VideoRFSplat📽️, a feed-forward text-to-3DGS generative model with high-quality scene-level results without post-optimization (e.g. SDS)
Led by collaborators at EverEx AI - @gohyojun3, @bypark___, @namhyelin99, Byung-Hoon
https://t.co/jfzlfejO8A
A 🧵 👇
1/n
3D consistent videos are hard to generate 🙁
What if we could steer them to be consistent during generation?
Introducing SteerX🛞, a plug-and-play sampling method that works with *any* video diffusion to make videos physically plausible🤩
w/ @bypark___@gohyojun3@namhyelin99