Our recent finding on Diffusion Alignment: a reward model in pixel space can be easily transferred to score noisy diffusion latents directly — at small finetuning cost, via stitching.
This makes Faster & Better for both Training & Inference Alignment.
Meet StitchVM👇
1/
Want to leverage the power of SOTA 3D models like VGGT & Video LDMs for 3D generation? Now you can! 🚀
Introducing VIST3A — we stitch pretrained video generators to 3D foundation models and align them via reward finetuning.
📄 https://t.co/MctMyuDev4
🌐 https://t.co/XQMW4mfjWI
🚀 Just released: FLAIR – a new training-free approach to solving inverse problems using flow-matching models!
🎯 Try it live: https://t.co/yKYafCQa76
📚 Learn more: https://t.co/7jwB0gfUxm
We present Thera🔥: The new SOTA arbitrary-scale super-resolution method with built-in anti-aliasing. Our approach introduces Neural Heat Fields, which guarantee exact Gaussian filtering at any scale, enabling continuous image reconstruction without extra computational cost.
A Variational Perspective on Generative Protein Fitness Optimization
Uses an approach to optimize protein fitness in latent space. Famous AAV dataset by Bryant used.
P: https://t.co/wEwER8goZt
Introducing 🛹 RollingDepth 🛹 — a universal monocular depth estimator for arbitrarily long videos! Our paper, “Video Depth without Video Models,” delivers exactly that, setting new standards in temporal consistency. Check out more details in the thread 🧵