Agneet Chatterjee @agneet42 - Twitter Profile

Pinned Tweet

9 months ago

As video generative models reshape how we consume visual media, we ask: what controls are essential for them to meaningfully support professional workflows? Stable Cinemetrics will be presented at NeurIPS 2025. Please visit our (interactive) webpage for detailed findings, insights and more : https://t.co/3oJSBlYQ3z In this project, we had the unique opportunity to receive feedback from an Academy Award winner. While projects may come and go, that is one experience I will always cherish!

Varun Jampani @jampani_varun

9 months ago

🎬 Introducing Stable Cinemetrics, to be presented at NeurIPS 2025. We present the first taxonomy of professional controls to systematically study and control video generative models through the lens of filmmaking. Interactive webpage with paper link: https://t.co/Eh4Hw3hBZl 🧵

1

24

3

13

4K

0

1

0

543

agneet42 retweeted

Zhiqiu Lin

@ZhiqiuLin

about 2 months ago

Before AI can generate professional videos, it needs to see like a professional. We spent a year with 100+ content creators teaching AI to describe video like a filmmaker would. Introducing CHAI: Critique-based Human-AI Oversight for Building a Precise Video Language [CVPR'26 Highlight, Top 3%]. Try prompting a video generator for a dolly zoom, dutch angle, point of view, or camera roll. Most fall back to the same bland defaults: a push-in, a level shot, a third-person view. Why? These techniques require a language of cinema that current models rarely speak. We built that language: 1️⃣ Precise specification: 5-aspect structured captions co-designed with professional cinematographers covering subject, scene, motion, spatial, and camera dynamics 2️⃣ Scalable oversight: LLMs draft captions, humans critique what's wrong and how to fix it 3️⃣ Post-training recipes: Qwen3-VL-8B surpasses Gemini-3.1 and GPT-5 4️⃣ Video generation: fine-tuned Wan follows 400-word cinematic prompts with precise control Here's how each works 🧵 Work led by CMU and Harvard with @chancharikm, @du_yilun, and @RamananDeva. 📄 Paper: https://t.co/wCwEtvrntM 🌐 Site: https://t.co/oAAQklGrfF

25

372

63

494

35K

Agneet Chatterjee @agneet42

7 months ago

Amidst today's tsunami of papers, it's quietly validating as an early career researcher to see your past work cited by multiple reviewers in their @iclr_conf reviews!

0

411

agneet42 retweeted

Varun Jampani @jampani_varun

8 months ago

Looking forward to ICCV’25! 3 workshop talks, 1 panel and 7 papers. Keynote talk on “Crafting Video Diffusion: Precise Inputs and Rich Outputs” in 3 workshops: https://t.co/LekqWM8Xil https://t.co/jdNDbjVSxY https://t.co/vMJqiph7M3 Panel discussion at https://t.co/WYyrnUMCR0

jampani_varun's tweet photo. Looking forward to ICCV’25! 3 workshop talks, 1 panel and 7 papers.

Keynote talk on “Crafting Video Diffusion: Precise Inputs and Rich Outputs” in 3 workshops:
https://t.co/LekqWM8Xil
https://t.co/jdNDbjVSxY
https://t.co/vMJqiph7M3

Panel discussion at https://t.co/WYyrnUMCR0 https://t.co/RReuLmR3T8

1

15

3

2

2K

Who to follow

Lu Cheng

@luchengSRAI

Assistant Professor @UICCS. Responsible and Reliable AI, causal machine learning, AI for social good. Previously @ASU DMML and @IBMResearch

Kenneth Marino

@Kenneth_Marino

Assistant Professor at University of Utah Computing Fall 2025. NLP+CV+RL

Patel Maitreya

@patelmaitreya

Research Scientist @Adobe | Unified Multimodal Models | Prev. @SonyAI_global @ApgAsu @ASU

Agneet Chatterjee @agneet42

over 1 year ago

@RisingSayak Not an expert in preference datasets: But is there also merit in scoring images? Chosen/Rejected is a form of hard scoring whereas a soft score (e.g. between 1-10) might also be beneficial to the community (and for models) ?

1

0

97

Agneet Chatterjee @agneet42

over 1 year ago

@Dazitu_616 Very cool work! Is there an ETA on the code release?

0

1

0

185

Agneet Chatterjee @agneet42

over 1 year ago

Feels good to be recognized!

EMNLP 2026 @emnlpmeeting

over 1 year ago

We're kicking off the awards session at #EMNLP2024 by announcing our (many) **Outstanding Reviewers**!

0

68

8

15

55K

1

0

458

Agneet Chatterjee @agneet42

over 1 year ago

@sourajitCS @trgokhale @wacv_official Congratulations to both you and @trgokhale !

0

1

0

217

agneet42 retweeted

Sayak Paul

@RisingSayak

over 1 year ago

Looking forward to our SPRIGHT poster session at #ECCV2024 today with @agneet42. It's #213 and we will be there from 4:30 PM to 6:30 PM (CEST). But I am here at ECCV today almost for the day. Looking forward to chats 🇮🇹 https://t.co/DY4DPnEkVZ

0

19

2

0

2K

Agneet Chatterjee @agneet42

over 1 year ago

Looking forward to @eccvconf and Milan! I'll be presenting 2 papers: REVISION (https://t.co/rr6i97qiZB): 10/1, 10.30-12.30, #140 SPRIGHT (https://t.co/jFt26Eu4Hm) w @RisingSayak: 10/2, 16.30-18.30, #213 Let's talk generative models -- or even better, risotto recommendations!

0

11

3

1

1K

agneet42 retweeted

'YZ' Yezhou Yang (杨叶舟) @prof_yz

almost 2 years ago

Dear friends, from Sun Sep 29th, @ApgAsu ers will present 👇papers, DC poster, organize tutorial and workshop paper @eccvconf with a focus on semantically precise #T2I, secure #GenAI and a survey of Recent Event Camera Innovations. Please chat with our talented members! 🙏🤠

prof_yz's tweet photo. Dear friends, from Sun Sep 29th, @ApgAsu ers will present 👇papers, DC poster, organize tutorial and workshop paper @eccvconf with a focus on semantically precise #T2I, secure #GenAI and a survey of Recent Event Camera Innovations.

Please chat with our talented members! 🙏🤠 https://t.co/xbu9wFh1kb

1

8

3

1

3K

Agneet Chatterjee @agneet42

almost 2 years ago

Joint work with @FPSLuozi @trgokhale @prof_yz @cbaral Both REVISION and SPRIGHT (https://t.co/yIiP6grOth) will be presented at @eccvconf and are efforts towards improving the 3D understanding of current vision-language models.

0

1

0

258

Agneet Chatterjee @agneet42

almost 2 years ago

While we conduct most of our experiments on Stable Diffusion 1.4 and 1.5, our pipeline can be extended to incorporate larger models, for increased control over backgrounds, colors or style.

agneet42's tweet photo. While we conduct most of our experiments on Stable Diffusion 1.4 and 1.5, our pipeline can be extended to incorporate larger models, for increased control over backgrounds, colors or style. https://t.co/dMSEgsur0m

1

0

303

Agneet Chatterjee @agneet42

almost 2 years ago

We also develop the RevQA benchmark, to evaluate spatial reasoning abilities of multimodal LLMs. RevQA is a question-answering benchmarking which has 16 diverse question types and their adversarial variations consisting of negations, conjunctions, and disjunctions.

agneet42's tweet photo. We also develop the RevQA benchmark, to evaluate spatial reasoning abilities of multimodal LLMs. RevQA is a question-answering benchmarking which has 16 diverse question types and their adversarial variations consisting of negations, conjunctions, and disjunctions. https://t.co/Kg14v5UhNx

1

0

211

Agneet Chatterjee @agneet42

almost 2 years ago

REVISION elevates the ability of existing T2I models to generate spatially accurate images. Given an input prompt, we generate a synthetic image using REVISION, which is used as additional guidance during image generation.

agneet42's tweet photo. REVISION elevates the ability of existing T2I models to generate spatially accurate images. Given an input prompt, we generate a synthetic image using REVISION, which is used as additional guidance during image generation. https://t.co/8jZMsog6eH

1

0

181

Agneet Chatterjee @agneet42

almost 2 years ago

REVISION parses a prompt into assets (objects) along with the spatial relationship between them and synthesizes a symbolic image in Blender, placing the respective object assets at coordinates corresponding to the parsed spatial relationship.

agneet42's tweet photo. REVISION parses a prompt into assets (objects) along with the spatial relationship between them and synthesizes a symbolic image in Blender, placing the respective object assets at coordinates corresponding to the parsed spatial relationship. https://t.co/ovm22XB8Uh

1

0

223

Agneet Chatterjee @agneet42

almost 2 years ago

Happy to share our latest work, REVISION, which will be presented at #ECCV2024. Project Page : https://t.co/tI9My1aBP9 With REVISION, we combine the controllability of graphics rendering engines and the photorealism of T2I models to improve spatial fidelity. 🧵

2

11

1

0

2K

agneet42 retweeted

Brian Bartoldson

@bartoldson

almost 2 years ago

Adversarial attacks jailbreak models. Existing defenses don’t even safeguard simple models. In our ICML 2024 paper on "Adversarial Robustness Limits”, we show how scaling helps defense, up to the point where attacks start to fool humans (take quiz: https://t.co/mmLDnvTErY). 🧵1/n

bartoldson's tweet photo. Adversarial attacks jailbreak models. Existing defenses don’t even safeguard simple models. In our ICML 2024 paper on "Adversarial Robustness Limits”, we show how scaling helps defense, up to the point where attacks start to fool humans (take quiz: https://t.co/mmLDnvTErY). 🧵1/n https://t.co/Kjl8nFj8TT

1

34

8

10

3K

agneet42 retweeted

Sayak Paul

@RisingSayak

almost 2 years ago

Very pleased to see this accepted at #ECCV2024. See you in Milan to talk about diffusion models and other adjacent areas of work. 🇮🇹 Kudos to our truly global team 😉

RisingSayak's tweet photo. Very pleased to see this accepted at #ECCV2024. See you in Milan to talk about diffusion models and other adjacent areas of work. 🇮🇹

Kudos to our truly global team 😉 https://t.co/rpGQsfAiGY

2

59

3

7

6K

Agneet Chatterjee @agneet42

almost 2 years ago

@saakur3 Thanks Sathya!

0

1

0

259

Agneet Chatterjee @agneet42

almost 2 years ago

Happy to share that 2/2 papers are accepted to #ECCV2024, with 1 of them being SPRIGHT. 👇 Congratulations to all the co-authors. See you in Milan!

AK

@_akhaliq

about 2 years ago

Getting it Right Improving Spatial Consistency in Text-to-Image Models One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt. In

_akhaliq's tweet photo. Getting it Right

Improving Spatial Consistency in Text-to-Image Models

One of the key shortcomings in current text-to-image (T2I) models is their inability to consistently generate images which faithfully follow the spatial relationships specified in the text prompt. In https://t.co/61ck6FMuB7

1

105

19

61

31K

3

40

2

7

8K

Agneet Chatterjee

@agneet42

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users