rodo @rodosingh23 - Twitter Profile

rodosingh23 retweeted

Hitesh Kandala @HiteshK03

4 months ago

@rodosingh23 @AIatAMD @AMDIndia @AMD

0

1

0

55

rodosingh23 retweeted

Hitesh Kandala @HiteshK03

4 months ago

Our paper DUET-VLM: Dual-stage Unified Efficient Token Reduction for VLM Training and Inference is accepted to CVPR 2026. We propose a unified framework for training-aware visual token reduction in VLMs. 📄 https://t.co/4ayO777DAt 🔗 https://t.co/O4lfNKmAHw #CVPR2026

HiteshK03's tweet photo. Our paper DUET-VLM: Dual-stage Unified Efficient Token Reduction for VLM Training and Inference is accepted to CVPR 2026.

We propose a unified framework for training-aware visual token reduction in VLMs.

📄 https://t.co/4ayO777DAt
🔗 https://t.co/O4lfNKmAHw

#CVPR2026 https://t.co/7QT3Gtm6T7

1

33

9

13

2K

rodosingh23 retweeted

Makarand Tapaswi @MakarandTapaswi

6 months ago

Our paper was desk rejected @NeurIPSConf! Even before the main deadline! "Non-academic title and abstract" 🙈 Thankfully, @SIGGRAPHAsia was around the corner and a perfect fit for our work on improving robustness of multi-subject multi-attribute layout-guided T2I models! 🧵1/9

5

93

11

45

27K

rodosingh23 retweeted

Makarand Tapaswi @MakarandTapaswi

6 months ago

This is our group's first work in generation. The @SIGGRAPHAsia team, shepherd, and reviewers helped us a lot! Thank you. Shivank @x47bsaltydog and Dhruv @sridhruv led this well and taught me a lot about diffusion models in the process! 🎉 Paper: https://t.co/ZlTThPX1H7 🏁 9/9

1

4

1

2

1K

Who to follow

Om Gupta

@om_n_gupta

3rd year @UTAstronomy grad student @UTAustin BS-MS Physics 2021 @iiserkol @cessi_iiserkol Fast Radio Burst research & ❤️ astrophysics. Learning ballroom dance.

Makarand Tapaswi

@MakarandTapaswi

Principal ML Scientist @WadhwaniAI | Assistant Professor @IIIT_Hyderabad | Opinions my own

Darshan Singh @ CVPR

@thought2vec

Research @GoogleDeepMind | @iiit_hyderabad

rodo @rodosingh23

8 months ago

@NainaChaturved8 @CVPR Ghosh this is truly exhausting😓

0

166

rodosingh23 retweeted

IIIT Hyderabad @iiit_hyderabad

12 months ago

A large contingent from IIITH’s Computer Vision Lab participated at the Conference on Vision and Pattern Recognition (CVPR) last month in Nashville. Read on about the cutting edge research that was presented and why it’s a big deal in the vision circles. https://t.co/RTFb0YEENe

iiit_hyderabad's tweet photo. A large contingent from IIITH’s Computer Vision Lab participated at the Conference on Vision and Pattern Recognition (CVPR) last month in Nashville. Read on about the cutting edge research that was presented and why it’s a big deal in the vision circles. https://t.co/RTFb0YEENe https://t.co/KQ7BKiJAU7

0

58

11

5

4K

rodosingh23 retweeted

Zeeshan khan @zeeshank95

about 1 year ago

Can text-to-image Diffusion models handle surreal compositions beyond their training distribution? 🚨 Introducing ComposeAnything — Composite object priors for diffusion models 📸 More faithful, controllable generations — no retraining required. 🔗https://t.co/Q7yzACKZjZ 1/9

zeeshank95's tweet photo. Can text-to-image Diffusion models handle surreal compositions beyond their training distribution?

🚨 Introducing ComposeAnything — Composite object priors for diffusion models
📸 More faithful, controllable generations — no retraining required. 🔗https://t.co/Q7yzACKZjZ
1/9 https://t.co/61IK8yovGh

2

24

9

5

2K

rodosingh23 retweeted

AMD

@AMD

about 1 year ago

AMD has announced the open-sourcing of Instella, a fully open 3-billion-parameter LMs trained on AMD Instinct MI300X GPUs. These models aim to promote collaboration in the AI community by providing access to model weights, configurations, and more. See more from @phoronix ⬇️

7

364

40

18

30K

rodosingh23 retweeted

Makarand Tapaswi @MakarandTapaswi

almost 2 years ago

Thanks to the organizers (@davmoltisanti +) for an opportunity to share my thoughts at the amazing @CVPR Workshop "What Is Next in Video Understanding" https://t.co/je25r8fwbQ Slides summarizing some of our work on video and open challenges here: https://t.co/KuyRJ1TpPi

MakarandTapaswi's tweet photo. Thanks to the organizers (@davmoltisanti +) for an opportunity to share my thoughts at the amazing @CVPR Workshop "What Is Next in Video Understanding" https://t.co/je25r8fwbQ

Slides summarizing some of our work on video and open challenges here: https://t.co/KuyRJ1TpPi https://t.co/G6iI76SZki

2

86

11

24

3K

rodosingh23 retweeted

Manu Gaur

@gaur_manu

over 1 year ago

🚨 Introducing Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation. Given an image pair, it is easier for an MLLM to identify fine-grained visual differences during VQA evaluation than to independently detect and describe such differences 🧵(1/n):

gaur_manu's tweet photo. 🚨 Introducing Detect, Describe, Discriminate: Moving Beyond VQA for MLLM Evaluation.

Given an image pair, it is easier for an MLLM to identify fine-grained visual differences during VQA evaluation than to independently detect and describe such differences 🧵(1/n): https://t.co/JJJT3V0X1G

3

42

17

19

9K

rodo @rodosingh23

almost 2 years ago

Samsung needs to address this oversight for a better customer experience. #GalaxyBuds2Pro #CustomerService #ProductDefect

0

100

rodo @rodosingh23

almost 2 years ago

@SamsungIndia I recently faced a frustrating experience with my Galaxy Buds 2 Pro from Samsung India. Despite being told multiple times by customer service that the product is compatible with iOS, I discovered that it lacks full functionality when paired with my iPhone. #Samsung

2

0

147

rodo @rodosingh23

almost 2 years ago

The product's compatibility issues were not clearly mentioned, leading me to believe this is a defect. If the earbuds can only function as basic Bluetooth devices without access to essential features, shouldn't that be considered a flaw?

1

0

105

rodosingh23 retweeted

Makarand Tapaswi @MakarandTapaswi

about 2 years ago

Recaps are a wonderful narrative sequence that spark viewers' memories 🧠 to follow the current episode. In our latest @CVPR 2024 work with @rodosingh23 and @sridhruv, we leverage recaps to train models that generate story summaries for TV episodes. 📺 🧵1/8

MakarandTapaswi's tweet photo. Recaps are a wonderful narrative sequence that spark viewers' memories 🧠 to follow the current episode. In our latest @CVPR 2024 work with @rodosingh23 and @sridhruv, we leverage recaps to train models that generate story summaries for TV episodes. 📺
🧵1/8 https://t.co/GsLYdQ31Jb

2

53

4

5

4K

rodo @rodosingh23

about 2 years ago · Seattle

@shiba14857 @MakarandTapaswi @CVPR @sridhruv Hi Manash we open sourced the code and models, please check ✔️.

0

1

0

52

rodosingh23 retweeted

Makarand Tapaswi @MakarandTapaswi

about 2 years ago

@iiit_hyderabad @rodosingh23 @sridhruv @HRaajesh @dnaveenr @zeeshank95 Presenting both papers tomorrow morning @CVPR (Thu AM). Meet us at poster 385 and 422 if you want to see how we continue to make improvements in machine understanding of stories!

0

10

2

0

717

rodosingh23 retweeted

Makarand Tapaswi @MakarandTapaswi

about 2 years ago

Given multiple short movie clips, can models generate coherent identity-aware descriptions? 🤔 Turns out, this is a complicated task as it requires linking identities and what they are doing over time. Our @CVPR 2024 work improves this: https://t.co/K4TQFm6GYg 🧵1/7

4

59

14

6K

rodosingh23 retweeted

Hamel Husain

@HamelHusain

about 2 years ago

RAG is another example of bloated jargon. This should just be "provide relevant context"

82

721

49

82

95K

rodosingh23 retweeted

Lior Alexander

@LiorOnAI

about 2 years ago

OpenAI just announced "GPT-4o". It can reason with voice, vision, and text. The model is 2x faster, 50% cheaper, and has 5x higher rate limit than GPT-4 Turbo. It will be available for free users and via the API. The voice model can even pick up on emotion and generate emotive voice.

72

1K

258

409

485K

rodo

@rodosingh23

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users