Arnav Das @arnaved - Twitter Profile

arnaved retweeted

11 days ago

personal news: i've joined Elorian as Chief Reasoning Architect. multimodal AGI is the most critical frontier as we move from the era of chatbots to coding agents to models that reason and act over the physical world. i'm really excited to design natively visual models across thinking, agents, architectures, and the systems stack with the amazing team at Elorian. i wish the best to everyone at xAI & SpaceX — driving posttraining was a unique experience with so many memorable stories. all the best to the team, and to Elon.

49

350

14

52

81K

arnaved retweeted

Elorian AI @ElorianAI

11 days ago

We’re thrilled to welcome @dustinvtran to Elorian as our Chief Reasoning Architect. After leading post-training at xAI and contributing to Gemini at Google DeepMind, Dustin is joining Elorian to help build the next generation of visual reasoning models. Excited for what's ahead 🚀

0

23

5

0

2K

arnaved retweeted

Jeff Bilmes @jbilmes

about 1 month ago

What makes a dataset valuable? And when is "more data" not the same as "better data" in machine learning and AI? Read more to find out: https://t.co/Q0wPOtfm5d

0

6

3

230

Arnav Das @arnaved

2 months ago

Huge thanks to my amazing collaborators @BhattGantavya @Sahil1V and the whole team! Hope to see everyone at the poster! 🙏

0

1

0

101

Arnav Das @arnaved

2 months ago

Thrilled to present our paper: Matched Data, Better Models: Target Aligned Data Filtering with Sparse Autoencoders at ICLR today 🎉! We use SAEs to filter pretraining datasets for CLIP-style models. 📍 Poster Session 6, Pavilion 4 | 3:15–5:45pm today 📄 https://t.co/a7yyHn0MR7

arnaved's tweet photo. Thrilled to present our paper: Matched Data, Better Models: Target Aligned Data Filtering with Sparse Autoencoders at ICLR today 🎉! We use SAEs to filter pretraining datasets for CLIP-style models.
📍 Poster Session 6, Pavilion 4 | 3:15–5:45pm today
📄 https://t.co/a7yyHn0MR7 https://t.co/xWJUQDxppL

2

31

6

12

2K

Arnav Das @arnaved

2 months ago

Compared to all methods, our approach nearly matches the SOTA method but requires 5x fewer GPU hours!

1

0

104

arnaved retweeted

Jifan Zhang @jifan_zhang

8 months ago

New research paper with Anthropic and Thinking Machines AI companies use model specifications to define desirable behaviors during training. Are model specs clearly expressing what we want models to do? And do different frontier models have different personalities? We generated thousands of scenarios to find out. 🧵

jifan_zhang's tweet photo. New research paper with Anthropic and Thinking Machines

AI companies use model specifications to define desirable behaviors during training. Are model specs clearly expressing what we want models to do? And do different frontier models have different personalities?

We generated thousands of scenarios to find out. 🧵

62

1K

169

1K

321K

arnaved retweeted

Reubennnnnnnnnnnnn @ReubenNarad

12 months ago

Whoa... Grok 4 beats o3 on our never-released benchmark: HumorBench, a non-STEM reasoning benchmark that measures humor comprehension. The task is simple: given a New Yorker Caption Contest cartoon and caption, explain the joke.

ReubenNarad's tweet photo. Whoa... Grok 4 beats o3 on our never-released benchmark: HumorBench, a non-STEM reasoning benchmark that measures humor comprehension. The task is simple: given a New Yorker Caption Contest cartoon and caption, explain the joke. https://t.co/Hxolf8QKjB

2

11

5

2

4K

Arnav Das @arnaved

about 1 year ago

8/8 Huge thanks to @BhattGantavya, @lilly8sharma, @Sahil1V, and Jeff Bilmes

0

1

0

66

Arnav Das @arnaved

about 1 year ago

1/8 🚀 How can retrieval augmentation be made both relevant and non-redundant for few-shot adaptation? I'm excited to introduce COBRA. Catch our poster at #CVPR25 (ExHall D, Poster #450) on Sat 14 Jun, 5–7 p.m. CDT: https://t.co/dsdH6PJTHj

arnaved's tweet photo. 1/8 🚀 How can retrieval augmentation be made both relevant and non-redundant for few-shot adaptation? I'm excited to introduce COBRA. Catch our poster at #CVPR25 (ExHall D, Poster #450) on Sat 14 Jun, 5–7 p.m. CDT: https://t.co/dsdH6PJTHj https://t.co/cQ0kXTOHVQ

1

9

3

1

1K

Arnav Das @arnaved

about 1 year ago

7/8 Despite its richer objective, COBRA incurs negligible extra computation at retrieval time and scales effortlessly to pools of hundreds of millions of images.

1

0

54

arnaved retweeted

Sahil Verma @Sahil1V

about 1 year ago

🚨 New Paper! 🚨 Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀 https://t.co/r6DGPDfwle

Sahil1V's tweet photo. 🚨 New Paper! 🚨
Guard models slow, language-specific, and modality-limited?

Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀

https://t.co/r6DGPDfwle https://t.co/qWk4RXb1S3

1

79

39

23

15K

Arnav Das

@arnaved

Last Seen Users on Sotwe

Trends for you

Most Popular Users