Siddartha Devic @sid_devic - Twitter Profile

Pinned Tweet

over 1 year ago

Multicalibration is a fairness notion which requires predictors to be calibrated over subgroups. In work with @dutchhansen (applying to PhDs!), @PreetumNakkiran, and Vatsal Sharan, we empirically ask: when are machine learning models multicalibrated with no additional effort?🧵

sid_devic's tweet photo. Multicalibration is a fairness notion which requires predictors to be calibrated over subgroups.

In work with @dutchhansen (applying to PhDs!), @PreetumNakkiran, and Vatsal Sharan, we empirically ask: when are machine learning models multicalibrated with no additional effort?🧵 https://t.co/lyrXjYITWy

1

45

7

12

8K

Siddartha Devic @sid_devic

5 days ago

@suchenzang In at least one (popular) stylized model, identification is actually more difficult than generation: https://t.co/wqb0nBx7xb. @AnayMehrotra is one expert in this area that I know of!

0

2

0

1

136

sid_devic retweeted

Arthur Liang

@arthliang

about 1 month ago

I’m really excited to be @iclr_conf this week presenting our work on substructure-aware protein modeling! DM me if you want to chat about protein models, baking biological priors into ML architectures, and anything bioML for experimentalists! We built Magneton, a model-agnostic framework that distills decades of curated substructure knowledge into any pretrained encoder. We find that substructure signal is complementary to both sequence and structure as well as helps models generalize to unseen substructure types. Come say hi at our poster on Saturday from 10:30AM-1:00PM @ Pavilion 3 P3-#1006! This work is near and dear to my heart because it’s my first (co)first-author main conference paper from my undergrad. A heartfelt shoutout to my mentors Robert Calef, @manoliskellis, @marinkazitnik for all their support! Website: https://t.co/bvC9WM5lWZ Paper: https://t.co/3hYEbmEc0V

arthliang's tweet photo. I’m really excited to be @iclr_conf this week presenting our work on substructure-aware protein modeling! DM me if you want to chat about protein models, baking biological priors into ML architectures, and anything bioML for experimentalists!

We built Magneton, a model-agnostic framework that distills decades of curated substructure knowledge into any pretrained encoder. We find that substructure signal is complementary to both sequence and structure as well as helps models generalize to unseen substructure types. Come say hi at our poster on Saturday from 10:30AM-1:00PM @ Pavilion 3 P3-#1006!

This work is near and dear to my heart because it’s my first (co)first-author main conference paper from my undergrad. A heartfelt shoutout to my mentors Robert Calef, @manoliskellis, @marinkazitnik for all their support!

Website: https://t.co/bvC9WM5lWZ
Paper: https://t.co/3hYEbmEc0V

1

13

3

4

2K

sid_devic retweeted

Deqing Fu

@DeqingFu

about 1 month ago

New paper: Convergent Evolution: How Different Language Models Learn Similar Number Representations. Language models, classical word embeddings, and even raw token frequencies all develop the same Fourier features for numbers. But only some develop the underlying structure. 🧵

DeqingFu's tweet photo. New paper: Convergent Evolution: How Different Language Models Learn Similar Number Representations.

Language models, classical word embeddings, and even raw token frequencies all develop the same Fourier features for numbers. But only some develop the underlying structure. 🧵

2

108

22

60

45K

Who to follow

Jesse Zhang @ ICRA 2026 ✈️

@Jesse_Y_Zhang

Robotics Postdoc @uwcse w/ A. Gupta, D. Fox. Collab @allen_ai. Prev: PhD from @csatusc. Intern @ NVIDIA, AWS.

Ting-Yun Chang

@CharlotteTYC

PhD student @CSatUSC @nlp_usc

Ameya Godbole

@ameya_godbole1

PhD student @nlp_usc working on generalization and reasoning, prev @UMassAmherst, @iitg (he/him)

sid_devic retweeted

Jesse Zhang @ ICRA 2026 ✈️

@Jesse_Y_Zhang

3 months ago

A reward model that works, zero-shot, across robots, tasks, and scenes? Introducing Robometer: Scaling general-purpose robotic reward models with 1M+ trajectories. Enables zero-shot: online/offline/model-based RL, data retrieval + IL, automatic failure detection, and more! 🧵 (1/12)

8

407

105

234

99K

sid_devic retweeted

Viggie Smalls @Viggie_Smalls93

3 months ago

Los Angeles the coldest place on planet earth right now

288

19K

3K

523

3M

Siddartha Devic @sid_devic

4 months ago

Looks like a super useful tool for practitioners looking to apply multi calibration techniques!!

Lorenzo Perini @LorenzoPerini95

4 months ago

1/6 🧵 Calibration is hard. Multicalibration—fixing errors across every possible subgroup—is usually impossible at scale. Until now. Introducing MCGrad: A production-ready multicalibration library from Meta, accepted at KDD 2026. 🚀 https://t.co/iIxOg8hBIS

1

7

1

2

1K

0

4

1

3

576

Siddartha Devic @sid_devic

6 months ago

@_ghorbani @MishaLaskin @real_ioannis Hi Behrooz, I can't DM you but here's the message I would have sent! Thank you so much for your time.

sid_devic's tweet photo. @_ghorbani @MishaLaskin @real_ioannis Hi Behrooz, I can't DM you but here's the message I would have sent! Thank you so much for your time. https://t.co/rFFK4kNlMz

0

54

Siddartha Devic @sid_devic

6 months ago

@SaleemaAmershi @adamfourney @ASwearngin77874 @bansalg_ @HsseinMzannar @HuaWenyue31539 @w_epperson @ZacharyHuang12 @MayaMurad0 @ecekamar @HosnRafa Hi Saleema, I can't DM you but I would love to chat at neurips! I am a final-year PhD student focused on trustworthy AI / ML, and think that the human-interaction aspect is incredibly important and under-explored. Looking for full-time opportunities in industry. Thank you!

0

122

Siddartha Devic @sid_devic

6 months ago

@aparandehgheibi Hi Ali, would love to chat sometime at Neurips. Am currently looking for full-time opportunities (graduating in the spring). I couldn't DM you, so commenting here!

1

0

89

sid_devic retweeted

Preetum Nakkiran @PreetumNakkiran

7 months ago

LLMs are notorious for "hallucinating": producing confident-sounding answers that are entirely wrong. But with the right definitions, we can extract a semantic notion of "confidence" from LLMs, and this confidence turns out to be calibrated out-of-the-box in many settings (!)

PreetumNakkiran's tweet photo. LLMs are notorious for "hallucinating": producing confident-sounding answers that are entirely wrong. But with the right definitions, we can extract a semantic notion of "confidence" from LLMs, and this confidence turns out to be calibrated out-of-the-box in many settings (!) https://t.co/zcDCTu1ctm

23

579

84

473

51K

Siddartha Devic @sid_devic

7 months ago

@tiancheng_hu This makes sense, I would agree with that hypothesis! Do you have any intuition for why model merging preserves instruction tuning but improves calibration? (I will take a closer look at your paper after I am done with my ICLR reviews haha!)

1

0

128

sid_devic retweeted

Johnny Tian-Zheng Wei @johntzwei

7 months ago

Announcing 🔭✨Hubble, a suite of open-source LLMs to advance the study of memorization! Pretrained models up to 8B params, with controlled insertion of texts (e.g., book passages, biographies, test sets, and more!) designed to emulate key memorization risks 🧵

johntzwei's tweet photo. Announcing 🔭✨Hubble, a suite of open-source LLMs to advance the study of memorization!

Pretrained models up to 8B params, with controlled insertion of texts (e.g., book passages, biographies, test sets, and more!) designed to emulate key memorization risks 🧵 https://t.co/07K2A2uIbv

2

131

41

52

50K

sid_devic retweeted

Aayush Karan

@aakaran31

8 months ago

We found a new way to get language models to reason. 🤯 No RL, no training, no verifiers, no prompting. ❌ With better sampling, base models can achieve single-shot reasoning on par with (or better than!) GRPO while avoiding its characteristic loss in generation diversity.

74

2K

249

2K

277K

Siddartha Devic @sid_devic

8 months ago

@iclr_conf we only get two weeks to submit five reviews? Surely this is a more accelerated timeline than usual, no?

0

4

0

153

sid_devic retweeted

Yatong Chen @YatongChen

9 months ago

We (Moritz Hardt, @walesalaudeen96,@joavanschoren) are organizing the Workshop on the Science of Benchmarking & Evaluating AI @EurIPSConf 2025 in Copenhagen! 📢 Call for Posters: https://t.co/jeXRNDexuX 📅 Deadline: Oct 10, 2025 (AoE) 🔗 More Info: https://t.co/zZmkGzGsRg

YatongChen's tweet photo. We (Moritz Hardt, @walesalaudeen96,@joavanschoren) are organizing the Workshop on the Science of Benchmarking & Evaluating AI @EurIPSConf 2025 in Copenhagen!

📢 Call for Posters: https://t.co/jeXRNDexuX
📅 Deadline: Oct 10, 2025 (AoE)
🔗 More Info: https://t.co/zZmkGzGsRg https://t.co/sdRSTYFg3L

2

51

14

7

5K

Siddartha Devic @sid_devic

10 months ago

@korolova I remember you said you wanted this at some point as well haha

0

86

Siddartha Devic @sid_devic

10 months ago

Cursor made me a chrome extension which redirects any html arxiv links that you stumble across on the internet to the pdf version of the arxiv paper instead. Could be useful for some others, but use at your own risk! https://t.co/oJGScrtNBo

1

6

0

217

Siddartha Devic @sid_devic

11 months ago

Check out our position paper on important directions in LLM uncertainty quantification!

Tejas Srinivasan @_Tejas_S_

11 months ago

🚨 Position paper alert! 🚨 LLM uncertainty quantification (UQ) has been explored with the goal of enabling better reliance on LLMs by humans. However, we argue that common LLM UQ practices are detached from this human-centric aspiration.😭😭 https://t.co/P5g7Kco0xh

_Tejas_S_'s tweet photo. 🚨 Position paper alert! 🚨
LLM uncertainty quantification (UQ) has been explored with the goal of enabling better reliance on LLMs by humans. However, we argue that common LLM UQ practices are detached from this human-centric aspiration.😭😭

https://t.co/P5g7Kco0xh https://t.co/IlyXSYR7Is

2

74

10

51

8K

0

20

1

5

723

sid_devic retweeted

Tejas Srinivasan @_Tejas_S_

11 months ago

🚨 Position paper alert! 🚨 LLM uncertainty quantification (UQ) has been explored with the goal of enabling better reliance on LLMs by humans. However, we argue that common LLM UQ practices are detached from this human-centric aspiration.😭😭 https://t.co/P5g7Kco0xh

2

74

10

51

8K

Siddartha Devic

@sid_devic

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users