Darshan Singh @ CVPR @thought2vec - Twitter Profile

about 4 hours ago

Super thrilled to share that our work, SRL-CLIP, won the Best Paper Award at the CV4Smalls #CVPR2026 workshop! 🏆 Huge thanks to my amazing co-authors @zeeshank95 and @MakarandTapaswi . Also, a big thank you to the organizers (@shaydamoezzi and team) for putting together such a thoughtful workshop. With the current trend of "scaling everything," it is incredibly refreshing to focus on data inefficiency as a design opportunity rather than a limitation. Data efficiency is one of the main pillars needed to improve real-world video understanding.

thought2vec's tweet photo. Super thrilled to share that our work, SRL-CLIP, won the Best Paper Award at the CV4Smalls #CVPR2026 workshop! 🏆 Huge thanks to my amazing co-authors @zeeshank95 and @MakarandTapaswi .

Also, a big thank you to the organizers (@shaydamoezzi and team) for putting together such a thoughtful workshop. With the current trend of "scaling everything," it is incredibly refreshing to focus on data inefficiency as a design opportunity rather than a limitation. Data efficiency is one of the main pillars needed to improve real-world video understanding.

Darshan Singh @ CVPR @thought2vec

about 22 hours ago

Hi all! Today I will present our work, SRL-CLIP, a recipe to efficiently adapt CLIP to videos at the CV4Smalls CVPR workshop in Room 102. I have an oral talk scheduled at 2 pm followed by the poster session. Please do stop by! Work done with @zeeshank95 @MakarandTapaswi #CVPR2026

thought2vec's tweet photo. Hi all! Today I will present our work, SRL-CLIP, a recipe to efficiently adapt CLIP to videos at the CV4Smalls CVPR workshop in Room 102. I have an oral talk scheduled at 2 pm followed by the poster session. Please do stop by! Work done with @zeeshank95 @MakarandTapaswi #CVPR2026 https://t.co/JG9JcOEJSB

0

11

2

934

1

13

0

408

Darshan Singh @ CVPR @thought2vec

about 13 hours ago

MINERVA-Cultural is a step toward helping the world build more equitable and diverse VLMs 🌍. Huge thanks to all my co-authors and the team!. Check out the paper here: https://t.co/b6L08Ix1t2 and the dataset here: https://t.co/W3K2me5WS6 #CVPR2026 #AI #ComputerVision (7/7)

0

1

2

0

126

Darshan Singh @ CVPR @thought2vec

about 14 hours ago

Frontier models have become excellent at understanding videos. But what happens when we test them outside the comfort zone of Western, English-centric data? In our #CVPR2026 (Highlight) work, we pushed these models to their limits to see if they can function effectively in diverse global contexts. The results? They are struggling. Work done with @NagraniArsha @skawshik11 @Harman26Singh @dinesh_tewari1 @0xtob @CordeliaSchmid Anelia Angelova @shachi_dave (1/7)

thought2vec's tweet photo. Frontier models have become excellent at understanding videos. But what happens when we test them outside the comfort zone of Western, English-centric data? In our #CVPR2026 (Highlight) work, we pushed these models to their limits to see if they can function effectively in diverse global contexts. The results? They are struggling. Work done with @NagraniArsha @skawshik11 @Harman26Singh @dinesh_tewari1 @0xtob @CordeliaSchmid Anelia Angelova @shachi_dave (1/7)

1

16

5

0

2K

Darshan Singh @ CVPR @thought2vec

about 13 hours ago

This allows us to independently assess each intermediate step of the model's thought process. 🔍 Through our graph-based error tracing, we discovered that roughly 75% of all model failures actually stem from the visual perception of cultural elements. 🤯(6/7)

thought2vec's tweet photo. This allows us to independently assess each intermediate step of the model's thought process. 🔍 Through our graph-based error tracing, we discovered that roughly 75% of all model failures actually stem from the visual perception of cultural elements. 🤯(6/7) https://t.co/p8i0nOnHzY

1

0

1

0

109

Who to follow

Makarand Tapaswi

@MakarandTapaswi

Principal ML Scientist @WadhwaniAI | Assistant Professor @IIIT_Hyderabad | Opinions my own

math ∩ biology ∩ computing, Opinions are my own, RTs are not endorsements.

Darshan Singh @ CVPR @thought2vec

about 22 hours ago

Hi all! Today I will present our work, SRL-CLIP, a recipe to efficiently adapt CLIP to videos at the CV4Smalls CVPR workshop in Room 102. I have an oral talk scheduled at 2 pm followed by the poster session. Please do stop by! Work done with @zeeshank95 @MakarandTapaswi #CVPR2026

0

11

2

934

Darshan Singh @ CVPR @thought2vec

17 days ago

Grateful to be recognized as an Outstanding reviewer at #CVPR2026. Always happy to do my part and give back to the community. This CVPR is very special for me because we also have 1 Main paper (Highlight) and 1 workshop paper (Best paper candidate). See you all in Denver! 🚀

#CVPR2026 @CVPR

17 days ago

We are grateful to all of the 17,491 reviewers who helped make #CVPR2026 possible. We are especially pleased to recognize the following Outstanding Reviewers, whose high-quality reviews (as judged by their Area Chairs) placed them among the top 5% of reviewers.

CVPR's tweet photo. We are grateful to all of the 17,491 reviewers who helped make #CVPR2026 possible. We are especially pleased to recognize the following Outstanding Reviewers, whose high-quality reviews (as judged by their Area Chairs) placed them among the top 5% of reviewers. https://t.co/YjQppx6a8K

5

221

42

29

95K

1

22

0

2K

thought2vec retweeted

Project Aria @Meta

@meta_aria

20 days ago

The Project Aria team is excited to be part of the Third Joint Egocentric Vision (EgoVis) Workshop at #CVPR2026! 👓✨ Don’t miss the latest Project Aria updates as we dive into the future of egocentric perception. Check out the full program of invited speakers below! 📅 June 3, 2026 📍 Colorado Convention Center | Room 704/706 🔗 https://t.co/g1FA6Cx4RU ⚡️@liuziwei7 , @mapo1 , @doughty_hazel

meta_aria's tweet photo. The Project Aria team is excited to be part of the Third Joint Egocentric Vision (EgoVis) Workshop at #CVPR2026! 👓✨

Don’t miss the latest Project Aria updates as we dive into the future of egocentric perception. Check out the full program of invited speakers below!

📅 June 3, 2026
📍 Colorado Convention Center | Room 704/706
🔗 https://t.co/g1FA6Cx4RU

⚡️@liuziwei7 , @mapo1 , @doughty_hazel

0

20

5

10

3K

thought2vec retweeted

Ishaan Watts

@IshaanWatts18

27 days ago

Spending billions to train the "best" base model? You might be optimizing the wrong thing! 🎯 We show that controlling sharpness during mid-training leads to over 35% less forgetting after fine-tuning / quantization... even when the base model itself gets worse. 🧵 Takeaways for pretraining: - Use SAM (Sharpness-Aware-Minimization) in the final steps (~10%) - Try much higher learning rates (yes, even ~10× larger) 1/9

IshaanWatts18's tweet photo. Spending billions to train the "best" base model? You might be optimizing the wrong thing! 🎯

We show that controlling sharpness during mid-training leads to over 35% less forgetting after fine-tuning / quantization... even when the base model itself gets worse.

🧵 Takeaways for pretraining:
- Use SAM (Sharpness-Aware-Minimization) in the final steps (~10%)
- Try much higher learning rates (yes, even ~10× larger)

1/9

31

618

91

441

590K

thought2vec retweeted

Manu Gaur

@gaur_manu

about 2 months ago

Pretrained ViTs like DINOv2 or CLIP are great, but they produce fixed, generic representations that encode the most salient visual concepts (e.g., "cat"). In human vision, prior priming with language changes how people parse an image. We believe visual encoders should do the same 🚨 Introducing Steerable Visual Representations, a new family of visual features you can steer with text towards specific visual concepts.

gaur_manu's tweet photo. Pretrained ViTs like DINOv2 or CLIP are great, but they produce fixed, generic representations that encode the most salient visual concepts (e.g., "cat").
In human vision, prior priming with language changes how people parse an image. We believe visual encoders should do the same
🚨 Introducing Steerable Visual Representations, a new family of visual features you can steer with text towards specific visual concepts.

13

894

135

666

149K

thought2vec retweeted

Harman Singh (in NYC for summer)

@Harman26Singh

3 months ago

Can LLMs Self-Verify? Much better than you'd expect. LLMs are increasingly used as parallel reasoners, sampling many solutions at once. Choosing the right answer is the real bottleneck. We show that pairwise self-verification is a powerful primitive. Introducing V1, a framework that unifies generation and self-verification: 💡 Pairwise self-verification beats pointwise scoring, improving test-time scaling 💡 V1-Infer: Efficient tournament-style ranking that improves self-verification 💡 V1-PairRL: RL training where generation and verification co-evolve for developing better self-verifiers 🧵👇

14

394

66

359

103K

thought2vec retweeted

Makarand Tapaswi @MakarandTapaswi

6 months ago

Our paper was desk rejected @NeurIPSConf! Even before the main deadline! "Non-academic title and abstract" 🙈 Thankfully, @SIGGRAPHAsia was around the corner and a perfect fit for our work on improving robustness of multi-subject multi-attribute layout-guided T2I models! 🧵1/9

5

93

11

45

27K

thought2vec retweeted

Kiana Ehsani

@ehsanik

6 months ago

Researchers consider themselves very successful if they win one test-of-time award (and one is more than enough). Ross @inkynumbers has been winning them nonstop over the past year: CVPR 2024, ICCV 2025, and now NeurIPS 2025, because winning just one was too easy for him! Having known him for many years (first as a climbing partner and then as a colleague), I can’t say I’m surprised. When he sets his mind to something, he perfects it, whether it is making the best vision model, climbing a 5.12d, or continuing the sally-up sally-down push-up challenge until the rest of the team gives up. And to all his collaborators who only worked with him remotely and didn’t get to see him in person every day: you missed out. He is fun to work with but he is even more fun in person. I'm attaching the proof below. I have some true gem videos of his goofy side that I won’t share (saving them for when I need to blackmail him), but here is a photo of Ross pretending to be a lizard under our office sun lamp. Congratulations to Ross and all his co-authors. #NeurIPS2025

ehsanik's tweet photo. Researchers consider themselves very successful if they win one test-of-time award (and one is more than enough). Ross @inkynumbers has been winning them nonstop over the past year: CVPR 2024, ICCV 2025, and now NeurIPS 2025, because winning just one was too easy for him!

Having known him for many years (first as a climbing partner and then as a colleague), I can’t say I’m surprised. When he sets his mind to something, he perfects it, whether it is making the best vision model, climbing a 5.12d, or continuing the sally-up sally-down push-up challenge until the rest of the team gives up.

And to all his collaborators who only worked with him remotely and didn’t get to see him in person every day: you missed out. He is fun to work with but he is even more fun in person. I'm attaching the proof below. I have some true gem videos of his goofy side that I won’t share (saving them for when I need to blackmail him), but here is a photo of Ross pretending to be a lizard under our office sun lamp.

Congratulations to Ross and all his co-authors.
#NeurIPS2025

1

162

12

48

29K

thought2vec retweeted

NeurIPS Conference

@NeurIPSConf

6 months ago

The NeurIPS Test of Time Award recognizes papers that have made lasting contributions to machine learning. For 2025 the award goes to…(https://t.co/HtaOs96F8R) #NeurIPS2025 #NeurIPSanDiego"

NeurIPSConf's tweet photo. The NeurIPS Test of Time Award recognizes papers that have made lasting contributions to machine learning. For 2025 the award goes to…(https://t.co/HtaOs96F8R) #NeurIPS2025 #NeurIPSanDiego" https://t.co/lVlUhTPFpk

8

820

42

72

126K

thought2vec retweeted

Harman Singh (in NYC for summer)

@Harman26Singh

6 months ago

Late life update 🚀 I started my PhD at @UCBerkeley after an incredible time at @GoogleDeepMind. It was exciting to work on Gemini over the past couple of years. These days I am interested in reasoning/improving RL, agents, and diffusion language models. Looking forward to contributing to open science. Also thrilled to be back in the Bay Area. Grateful to mentors, collaborators, and folks who supported me, @partha_p_t @PengchuanZ @nitish_gup @trevorcohn @xiangrenNLP @divy93t @ManishGuptaMG1, Parag Singla, friends, and family. Excited to be at @NeurIPSConf #NeurIPS2025 this week. Looking forward to meeting folks. Feel free to DM if you'd like to chat!

Harman26Singh's tweet photo. Late life update 🚀 I started my PhD at @UCBerkeley after an incredible time at @GoogleDeepMind. It was exciting to work on Gemini over the past couple of years.

These days I am interested in reasoning/improving RL, agents, and diffusion language models. Looking forward to contributing to open science. Also thrilled to be back in the Bay Area.

Grateful to mentors, collaborators, and folks who supported me, @partha_p_t @PengchuanZ @nitish_gup @trevorcohn @xiangrenNLP @divy93t @ManishGuptaMG1, Parag Singla, friends, and family.

Excited to be at @NeurIPSConf #NeurIPS2025 this week. Looking forward to meeting folks. Feel free to DM if you'd like to chat!

23

606

11

132

46K

Darshan Singh @ CVPR @thought2vec

6 months ago

@NithishKannen Wow. This is super cool!!

0

1

0

43

thought2vec retweeted

JB Alayrac @jalayrac

7 months ago

Really proud of what we have achieved with Gemini 3 🚀! The Gemini MM team has worked relentlessly across image 🖼️ and video 🎥 from pre-training to post-training to simply deliver the best multimodal in the world 👏! Looking forward to what you will build🫡!

jalayrac's tweet photo. Really proud of what we have achieved with Gemini 3 🚀!

The Gemini MM team has worked relentlessly across image 🖼️ and video 🎥 from pre-training to post-training to simply deliver the best multimodal in the world 👏!

Looking forward to what you will build🫡! https://t.co/RWsXZa1UkJ

8

218

17

15

33K

Darshan Singh @ CVPR

@thought2vec

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users