Flavio Calmon @FlavioCalmon - Twitter Profile

about 2 months ago

📣 Excited to announce our oral presentation at #ICLR! LLMs capture rich semantic structure, as evidenced by their strong performance across a wide range of language and reasoning tasks. But Sparse Autoencoders (SAEs), a popular interpretability tool, mostly learn local, noisy, token-level features when applied to LLMs (e.g., hundreds of features for the word “the”). So why aren’t SAEs finding that rich semantic structure? 👉 Because they ignore the sequential nature of language. We introduce Temporal SAEs to bridge this gap. https://t.co/HLvuAV7Qek 🧵 [1/N]

hima_lakkaraju's tweet photo. 📣 Excited to announce our oral presentation at #ICLR!

LLMs capture rich semantic structure, as evidenced by their strong performance across a wide range of language and reasoning tasks.

But Sparse Autoencoders (SAEs), a popular interpretability tool, mostly learn local, noisy, token-level features when applied to LLMs (e.g., hundreds of features for the word “the”).

So why aren’t SAEs finding that rich semantic structure?

👉 Because they ignore the sequential nature of language.

We introduce Temporal SAEs to bridge this gap.

https://t.co/HLvuAV7Qek

🧵 [1/N]

5

169

26

108

23K

FlavioCalmon retweeted

Carol Long @carollong2047

9 months ago

Can GenAI agents🤖 manage a supply chain? Lessons from the classical beer game🍺 (1/n) @davidsimchilevi @FlavioCalmon @AndreCalmon

carollong2047's tweet photo. Can GenAI agents🤖 manage a supply chain? Lessons from the classical beer game🍺 (1/n) @davidsimchilevi @FlavioCalmon @AndreCalmon https://t.co/RbKi3NLs7z

1

5

3

1

473

FlavioCalmon retweeted

Alex Oesterling @alex_oesterling

11 months ago

‼️🕚New paper alert with @ushabhalla_: Leveraging the Sequential Nature of Language for Interpretability (https://t.co/VCNjWY6gtK)! 1/n

1

17

8

6

2K

FlavioCalmon retweeted

Hadi Khalaf @hskhalaf

11 months ago

How can we improve LLMs without any additional training? 🤔 The standard playbook is using Best-of-N: generate N responses ➡️ use a reward model to score them ➡️ pick the best 🏆 More responses = better results... right? Well, not exactly. You might be reward hacking! Instead, you should hedge! 🎯

1

9

1

2

1K

Who to follow

Haewon Jeong

@HaewonJeong00

Assistant Prof @UCSB ECE. Previously, Ph.D student @CMU_ECE & Post-doc @Harvard @hseas. She/her/hers. https://t.co/eukRWcPU9i

Mahdi Haghifam

@HaghifamMahdi

Foundations and Algorithm for Reliable and Responsible AI. @TTIC_Connect @KhouryCollege. PhD @UofT @VectorInst. Intern @GoogleDeepMind,@ServiceNowRSRCH

Murat Kocaoglu

@murat_kocaoglu_

Asst. Prof. at Johns Hopkins CS. Research on causal inference, causal discovery, generative AI, info theory, online learning.

FlavioCalmon retweeted

Dor Tsur @DorTsurr

12 months ago

Can we use coding-theory, heavy-tailed distributions, and optimal-transport to create 𝘇𝗲𝗿𝗼-𝗱𝗶𝘀𝘁𝗼𝗿𝘁𝗶𝗼𝗻, 𝗲𝗮𝘀𝘆 𝘁𝗼 𝘂𝘀𝗲, 𝘄𝗮𝘁𝗲𝗿𝗺𝗮𝗿𝗸𝘀 𝗳𝗼𝗿 𝗟𝗟𝗠𝘀? We show they can — and the result is pretty exciting! 🎉 🧵 (1/n)

DorTsurr's tweet photo. Can we use coding-theory, heavy-tailed distributions, and optimal-transport to create 𝘇𝗲𝗿𝗼-𝗱𝗶𝘀𝘁𝗼𝗿𝘁𝗶𝗼𝗻, 𝗲𝗮𝘀𝘆 𝘁𝗼 𝘂𝘀𝗲, 𝘄𝗮𝘁𝗲𝗿𝗺𝗮𝗿𝗸𝘀 𝗳𝗼𝗿 𝗟𝗟𝗠𝘀? We show they can — and the result is pretty exciting! 🎉 🧵 (1/n) https://t.co/UZaeAOYqd5

1

2

312

FlavioCalmon retweeted

Harvard University

@Harvard

about 1 year ago

Without its international students, Harvard is not Harvard. https://t.co/V8uvTNaL64

20K

90K

16K

4K

12M

FlavioCalmon retweeted

Hadi Khalaf @hskhalaf

about 1 year ago

Happy to share we received best paper at NENLP workshop at Yale 🥳🥳! tldr: Current alignment methods give excessive discretion to annotators in defining what good behavior means. This means we don't know what we are aligning to ‼️ We formalize discretion in alignment and propose mechanisms for data curators & model developers to monitor for it. Paper link below ⬇️

hskhalaf's tweet photo. Happy to share we received best paper at NENLP workshop at Yale 🥳🥳!

tldr: Current alignment methods give excessive discretion to annotators in defining what good behavior means. This means we don't know what we are aligning to ‼️

We formalize discretion in alignment and propose mechanisms for data curators & model developers to monitor for it.

Paper link below ⬇️

3

23

3

3K

FlavioCalmon retweeted

Hao Wang @HW_HaoWang

about 1 year ago

[1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient.

HW_HaoWang's tweet photo. [1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient. https://t.co/kAYPUZHUNN

9

26

8

2

4K

FlavioCalmon retweeted

Lucas Monteiro Paes @Lucas_MPaes

over 1 year ago

9/n Full paper here: 🔗 https://t.co/XCSZxIm7Na. Huge thanks to my amazing team of co-authors: @maartenbuyl, @hskhalaf, @claudiomverdun, @caiocvm, and @FlavioCalmon. Work done @hseas!

0

6

1

0

371

FlavioCalmon retweeted

Lucas Monteiro Paes @Lucas_MPaes

over 1 year ago

AI is built to “be helpful” or “avoid harm”, but which principles should it prioritize and when? We call this alignment discretion. As Asimov's stories show: balancing principles for AI behavior is tricky. In fact, we find that AI has its own set of priorities (comic @xkcd)👇

Lucas_MPaes's tweet photo. AI is built to “be helpful” or “avoid harm”, but which principles should it prioritize and when?

We call this alignment discretion. As Asimov's stories show: balancing principles for AI behavior is tricky.

In fact, we find that AI has its own set of priorities
(comic @xkcd)👇 https://t.co/FGBcGetYX4

1

10

6

0

754

FlavioCalmon retweeted

Bogdan Kulynych @hiddenmarkov

over 1 year ago

The standard practice in differential privacy of targeting ε at small δ is extremely lossy for interpreting the level of privacy protection. In practice (e.g., for DP-SGD), we can do much better! We show how in the #NeurIPS2024 paper: https://t.co/LKeW48wMx1 Short summary👇

5

10

3

793

FlavioCalmon retweeted

Maarten Buyl @maartenbuyl

over 1 year ago

Imagine an all-powerful AI with any ideology you don't agree with! Super proud of this work, where we show that every LLM reflects a different ideological worldview, which should worry everyone.

1

3

1

0

344

FlavioCalmon retweeted

Alex Oesterling @alex_oesterling

over 1 year ago

Finally, I am pleased to announce 🪢Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)🪢 Joint work with Usha Bhalla, as well as @Suuraj, @FlavioCalmon, and @hima_lakkaraju, which was just accepted to NeurIPS 2024! Check out the paper here: https://t.co/N1dmE1mkmA

alex_oesterling's tweet photo. Finally, I am pleased to announce

🪢Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE)🪢

Joint work with Usha Bhalla, as well as @Suuraj, @FlavioCalmon, and @hima_lakkaraju, which was just accepted to NeurIPS 2024! Check out the paper here:
https://t.co/N1dmE1mkmA https://t.co/88abB0u38O

1

177

17

101

28K

FlavioCalmon retweeted

Alex Oesterling @alex_oesterling

over 1 year ago

Part 2 of my 2024 publication tweets! Please welcome Multi-group Proportional Representation, a novel metric for measuring representation in image generation and retrieval. This work was recently accepted at @NeurIPSConf 2024. (1/n)

1

17

2

1K

FlavioCalmon retweeted

Alex Oesterling @alex_oesterling

over 1 year ago

First up, how do various aspects of trustworthy machine learning interact? Can we expect a production ML system to satisfy all regulatory requirements of fairness, privacy, and interpretability simultaneously when past research generally focuses on one component at a time? (1/n)

1

8

1

0

511

Flavio Calmon @FlavioCalmon

over 1 year ago

Mario was a friend, close collaborator, and the first post-doc I hired at Harvard. This is a devastating loss to our community. Please consider reading one of Mario's papers this week. You can also learn more about his research here: https://t.co/zn8JG1tONX

0

4

0

818

Flavio Calmon @FlavioCalmon

over 1 year ago

Mario Diaz Torres, a brilliant researcher and mathematician, passed away suddenly on August 31st. @MDMarioDiaz was a rising star in the LatAm math community and was doing exceptional work in information theory, differential privacy, and related areas. https://t.co/cTeHRS2WMB

1

12

3

1

2K

Flavio Calmon @FlavioCalmon

over 1 year ago

Mario was incredibly passionate about math, information theory, and statistics. He was homeschooling his son so he could “teach him math in a principled and advanced manner.” Now his family really needs our support. Please consider donating here: https://t.co/YqX3lmEptH

4

15

9

1

5K

Flavio Calmon @FlavioCalmon

almost 2 years ago

This week, I spoke on the panel “AI, Rights, and Democracy” at the Brazilian Supreme Court. Thank you @STF_oficial for the invitation. It was an incredible experience! See my talk (in pt-br) here: https://t.co/JTz0z0YLyE

0

12

2

1

621

FlavioCalmon retweeted

Alexandra Olteanu @o_saja

almost 2 years ago

Back home from FAccT - I thankful for the work our community is doing & the values it stands for. Serving it has been a labor of love for me & I am beyond grateful to have done so this year along my truly wonderful program co-chairs & human beings @mikarv @RDBinns @FlavioCalmon

0

50

4

0

4K

Flavio Calmon

@FlavioCalmon

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users