I'll be presenting some of our recent works at the AI for Visual Arts (Room 4AB, 9am), Spatial Intelligence for Cultural Heritage (Room 708, 2:15pm) and the What is Next in Multimodal Foundation Models? (Room 3A, 4pm) workshops at @CVPR tomorrow. Please join if you're around!
Cycle consistency is a powerful and cool idea.
Here we use it to learn image decomposition: decompose an image into components, recombine them, and enforce consistency both ways.
A cool training principle with surprisingly strong results.
🚀 #CVPR2026 Accepted!🚀
Thrilled to share that my first-authored undergraduate paper, “Emergent Extreme-View Geometry in 3D Foundation Models,” has been accepted to CVPR 2026! 🎉
Looking forward to seeing many of you in Denver! ✈️
Project page: https://t.co/taqYaLWKfR
This is a great program for folks interested in postdocs @Cornell. For vision/graphics folks interested in NYC, @ElorHadar, @andrewhowens, and I are potentially recruiting a joint postdoc. Please apply!
.@Cornell is recruiting for multiple postdoctoral positions in AI as part of two programs: Empire AI Fellows and Foundational AI Fellows. Positions are available in NYC and Ithaca.
Deadline for full consideration is Nov 20, 2025!
https://t.co/HHzyB7vNCB
Very excited to introduce our 🌺#ICCV2025 highlight🌺 paper “BlendedPC”!
This work sets a new standard in localised semantic editing of point clouds, using purely text as guidance.
Project page: https://t.co/7waIFnDhVM
Wanna hear more? 👇🧵
💥New preprint! WildCAT3D uses tourist photos in-the-wild as supervision to learn to generate novel, consistent views of scenes like the one shown below. Work from Meta AI internship, with @davnov123 @filippos_kok@ElorHadar@t_monnier (1/5)
I’m looking for PhD students for the 2026 cycle! If you’re @CVPR and think we might be a good fit, come say hi or send me an email with [CVPR2025] in the subject line so that I don’t miss it. #CVPR2025
Excited to share that our latest paper: "InstanceGen: Image Generation with Instance Level Instructions" was recently accepted to #SIGGRAPH2025!
InstanceGen tackles the problem of generating images for complex multi-object prompts
https://t.co/6V5ZdTaEza
👇🧵[1/7]
That's a wrap for #ICLR2025! See you all next year in Brazil! Please all welcome @BharathHarihar3 as the new Senior Program Chair! (With @cvondrick continuing on as General Chair.)
Let it Snow! Animating Static Gaussian Scenes With Dynamic Weather Effects
Contributions:
• A novel framework for incorporating physically-based global dynamic effects into static 3D Gaussian scenes.
• Ensuring realistic scene interaction and collision effects by producing effect-specific appearance parameters and introducing collision handling techniques.
• State-of-the-art results for various dynamic weather effects, demonstrating significant improvements over existing approaches.
I am looking for a postdoc. A serious-looking call coming soon, but this is to get it going. Topics include (but not limited to): LLMs (🫢!), multimodal LLMs, interaction+learning, RL, intersection with cogsci, ... see our work to get an idea:
https://t.co/JKBbzfrGQU
Plz RT 🙏
Cuneiform at #ICLR2025! ProtoSnap finds the configuration of wedges in scanned cuneiform signs for downstream applications like OCR. A new tool for understanding the ancient world!
https://t.co/cfFvTu9U0G
h/t Rachel Mikulinsky @ShGordin@ElorHadar and all collaborators.
🧵👇
'Mitigating Open-Vocabulary Caption Hallucinations' is accepted to EMNLP 2024! 🎉 https://t.co/LeEFLyKI6Y
TL;DR
OpenCHAIR: an open-vocabulary hallucination benchmark
MOCHa: RLAIF framework for hallucination reduction
A great coop with @moranynk@MorrisAlper@RGiryes@ElorHadar