Daniel Scalena

@daniel_sc4

Intern of TS @cohere | PhDing @unimib 🇮🇹 & @GroNlp 🇳🇱, interpretability et similia

Joined February 2015

792 Following

198 Followers

103 Posts

Pinned Tweet

Daniel Scalena @daniel_sc4

8 months ago

You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀 Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones. 🧵👇

daniel_sc4's tweet photo. You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀

Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones.
🧵👇 https://t.co/SlMiM2sHco

2

25

5

18

6K

Daniel Scalena @daniel_sc4

10 days ago

personal update: I'll be starting an internship at @cohere, working on code agents, one of the most interesting things happening in AI right now. feeling really grateful for this one and genuinely excited to see where it goes. here we go! 👀

1

87

0

8

6K

Daniel Scalena @daniel_sc4

about 1 month ago

Really glad to see this work accepted at ICML 2026🎉 Can’t wait to present this work in Seoul!

Daniel Scalena @daniel_sc4

8 months ago

You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀 Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones. 🧵👇

daniel_sc4's tweet photo. You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀

Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones.
🧵👇 https://t.co/SlMiM2sHco

2

25

5

18

6K

1

33

6

8

3K

Daniel Scalena @daniel_sc4

about 2 months ago

@glnmario My year has 11 months because one month’s salary goes to ACL

0

8

0

0

468

Who to follow

PREMIUM Domain https://t.co/SlEsZITpde https://t.co/UxoPjAJaYE https://t.co/zZwzsMz2SI https://t.co/iGlLxnbkmB https://t.co/3qvRDXO4ke https://t.co/xZYYJc2OyF https://t.co/7ZYDEbJl60 https://t.co/CikmtrqfyK

marco acorte (@[email protected])

I'm NaN, I'm a fr33 man !!!! ahahahahaha #geek and coffe #developer, abusing guitar for @drgoreband work: https://t.co/sN2RVreDqA fun: https://t.co/UJEiYITxgQ @[email protected]

Lorenzo Mangano

@LoryMangano1997

📝 Redattore (Fuori Traiettoria/Andare a Pesca con un’Audi R18) 🎙️Telecronista

Daniel Scalena @daniel_sc4

2 months ago

Presenting this tomorrow at EACL2026, poster session at 9am If you’re around come say hi, happy to chat about the work and ideas More details in the thread 👇

Daniel Scalena @daniel_sc4

about 1 year ago

📢 New paper: Applied interpretability 🤝 MT personalization! We steer LLM generations to mimic human translator styles on literary novels in 7 languages. 📚 SAE steering can beat few-shot prompting, leading to better personalization while maintaining quality. 🧵1/

daniel_sc4's tweet photo. 📢 New paper: Applied interpretability 🤝 MT personalization!

We steer LLM generations to mimic human translator styles on literary novels in 7 languages. 📚

SAE steering can beat few-shot prompting, leading to better personalization while maintaining quality.

🧵1/ https://t.co/SucqQzOs1w

2

35

6

13

5K

0

6

0

2

989

Daniel Scalena @daniel_sc4

3 months ago

@GoodfireAI Nice work! I wonder, probe trained on answer choices needs known options. What if you probe model confidence and early exit there regardless of the answer it's thinking? I feel like after some t the model already knows and the rest is just overthinking

0

0

0

0

216

Daniel Scalena @daniel_sc4

3 months ago

@paradigmainc Ok I was trying to cook something to improve model’s scientific creativity, throwing the repo into flywheel feels like the next logical step

0

4

0

0

196

Daniel Scalena @daniel_sc4

4 months ago

@gsarti_ Gemini listened to its AGI intrusive thoughts on this

0

1

0

0

31

Daniel Scalena @daniel_sc4

4 months ago

@thelokasiffers They were pioneers vibecoding it with gpt2 back then

0

1

0

0

52

daniel_sc4 retweeted

Gabriele Sarti @gsarti_

5 months ago

Happy to announce I will be mentoring a SPAR project this Spring! ✨Check out the programme and apply by Jan 14th to work with me on understanding and mitigating implicit personalization in LLMs, i.e. how models form hidden beliefs about users that shape their responses.

gsarti_'s tweet photo. Happy to announce I will be mentoring a SPAR project this Spring! ✨Check out the programme and apply by Jan 14th to work with me on understanding and mitigating implicit personalization in LLMs, i.e. how models form hidden beliefs about users that shape their responses. https://t.co/bDA4V34qRR

1

15

2

1

835

daniel_sc4 retweeted

Gabriele Sarti @gsarti_

5 months ago

Now accepted at EACL main! Check it out! ⬇️

0

15

1

2

1K

Daniel Scalena @daniel_sc4

5 months ago

Want models to translate in the style you actually like? Our paper just got accepted at EACL Main 🚀, check out our work on using interpretability for MT personalization! And, see you in Morocco! 🇲🇦

Daniel Scalena @daniel_sc4

about 1 year ago

📢 New paper: Applied interpretability 🤝 MT personalization! We steer LLM generations to mimic human translator styles on literary novels in 7 languages. 📚 SAE steering can beat few-shot prompting, leading to better personalization while maintaining quality. 🧵1/

daniel_sc4's tweet photo. 📢 New paper: Applied interpretability 🤝 MT personalization!

We steer LLM generations to mimic human translator styles on literary novels in 7 languages. 📚

SAE steering can beat few-shot prompting, leading to better personalization while maintaining quality.

🧵1/ https://t.co/SucqQzOs1w

2

35

6

13

5K

0

3

0

0

269

Daniel Scalena @daniel_sc4

6 months ago

@thelokasiffers Woo big congrats on the launch and shoutout for starting in Rome. Can’t wait to hear more!

0

1

0

0

54

Daniel Scalena @daniel_sc4

7 months ago

@Turn_Trout @GladiaLab SVs are approximate directions in the latent space. They look for exact matches in the latent space. This could make things harder, but I’m still curious to know!

0

1

0

0

133

Daniel Scalena @daniel_sc4

7 months ago

@andy_peng05 @Cohere_Labs @UvA_Amsterdam Hi, thank you very much! Good catch on the p@1, it was meant to be p@k (pass@k). We’ll fix it asap in the preprint!

0

1

0

0

34

Daniel Scalena @daniel_sc4

8 months ago

@ZotosLeo @FersiniE @MalvinaNissim @ahmetustun89 Also cc @jiawzhao and @FuYichao123 — I guess your great work on DeepConf is highly relevant to this project!

0

0

0

0

133

Daniel Scalena @daniel_sc4

8 months ago

You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀 Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones. 🧵👇

daniel_sc4's tweet photo. You can easily save up to 65% of compute while improving performance on reasoning tasks 🤯 👀

Meet EAGer: We show that monitoring token-level uncertainty lets LLMs allocate compute dynamically - spending MORE on hard problems, LESS on easy ones.
🧵👇 https://t.co/SlMiM2sHco

2

25

5

18

6K

Daniel Scalena @daniel_sc4

8 months ago

Takeaway: EAGer shows we can be MORE efficient & MORE effective by letting models focus compute where it matters most. 📄Paper: https://t.co/JCHBWnKhUX 💻Code: https://t.co/Cp5nr38DLk ✨Huge thanks to my mentors and collaborators @ZotosLeo @FersiniE @MalvinaNissim @ahmetustun89

1

1

0

1

142

Daniel Scalena @daniel_sc4

8 months ago

Had so much fun working on this! Thank you @Cohere_Labs for sharing our work!

8 months ago

How can we make reasoning models more efficient without sacrificing performance? Introducing EAGER, our new entropy-aware generation method, saving compute by up to 65% while lifting Pass@k by up to 37% on benchmarks like AIME.

Cohere_Labs's tweet photo. How can we make reasoning models more efficient without sacrificing performance?

Introducing EAGER, our new entropy-aware generation method, saving compute by up to 65% while lifting Pass@k by up to 37% on benchmarks like AIME. https://t.co/aMN17duCLb

2

15

5

4

2K

0

6

1

0

418

Last Seen Users on Sotwe

Trends for you

Most Popular Users