Luca Zhou @LucaZh00 - Twitter Profile

Pinned Tweet

about 1 month ago

🎉 Our paper on interpretable model mergeability prediction has been accepted to ICML 2026! Excited to share this joint work on understanding when model merging succeeds, with amazing co-authors @bozhao, @yuqirose, and @EmanueleRodola 🙏 Happy to chat in Seoul! 🇰🇷

LucaZh00's tweet photo. 🎉 Our paper on interpretable model mergeability prediction has been accepted to ICML 2026!

Excited to share this joint work on understanding when model merging succeeds, with amazing co-authors @bozhao, @yuqirose, and @EmanueleRodola 🙏

Happy to chat in Seoul! 🇰🇷 https://t.co/E9VMwnfYpU

2

39

4

12

3K

LucaZh00 retweeted

Emanuele Rossi @emaros96

6 days ago

Can a sentence carry a sound? In Communicating Sound Through Natural Language, we introduce lexical acoustic coding (LAC): a way for LLM agents to transmit short sounds as structured English, then re-render the same audio back from that text. (1/6)

4

134

21

104

14K

Luca Zhou @LucaZh00

about 1 month ago

👉https://t.co/xhqx2EP9b3 #ICML2026 #MachineLearning #AI #ModelMerging

0

1

0

174

Luca Zhou @LucaZh00

about 1 month ago

🎉 Our paper on interpretable model mergeability prediction has been accepted to ICML 2026! Excited to share this joint work on understanding when model merging succeeds, with amazing co-authors @bozhao, @yuqirose, and @EmanueleRodola 🙏 Happy to chat in Seoul! 🇰🇷

2

39

4

12

3K

Luca Zhou @LucaZh00

about 1 month ago

These signals are also actionable 🚀 Encouraging gradient similarity during fine-tuning improves post-merge performance for most methods we tested.

1

2

1

0

172

LucaZh00 retweeted

ItalAI

@_italai

about 2 months ago

Even SOTA vision-language models struggle with grounded reasoning on time series. They fail at precise numeric & temporal understanding and often don’t properly use the visual signal. “CaTS-Bench: Can Language Models Describe Time Series?”, recently accepted to Findings of ACL 2026, introduces a large-scale, real-world multimodal benchmark for context-aware time series captioning and reasoning (combining numeric signals, metadata, and plots). Find out more 👉https://t.co/9umcAB1usr

1

4

2

1

368

Luca Zhou @LucaZh00

about 2 months ago

Huge thanks to @yuqirose and @GalassoFab10 for their supervision, as well as @AlessioSampier1, @ZihaoKevinZhou, Pratham Yashwante, and Marshall Fisher for their collaboration!

0

1

0

708

Luca Zhou @LucaZh00

about 2 months ago

🚀Our paper “CaTS-Bench: Can Language Models Describe Time Series?” has been accepted to ACL 2026 Findings! It introduces a new multimodal benchmark for time series captioning and reasoning across various domains.

LucaZh00's tweet photo. 🚀Our paper “CaTS-Bench: Can Language Models Describe Time Series?” has been accepted to ACL 2026 Findings!

It introduces a new multimodal benchmark for time series captioning and reasoning across various domains. https://t.co/Fc2GUkMjIa

1

3

0

1

126

Luca Zhou @LucaZh00

about 2 months ago

💡We find that current VLMs: 1) Struggle with numeric and temporal grounding 2) Largely ignore visual cues when reasoning 📷Check out more at: https://t.co/zPUJrmC1Df

1

0

89

LucaZh00 retweeted

Paradigma

@paradigmainc

3 months ago

introducing Flywheel: the infrastructure for autonomous research.

27

550

73

537

120K

Luca Zhou @LucaZh00

6 months ago

My first ever conference attendance at @NeurIPSConf was incredibly fulfilling. From discussions on model merging to the latest breakthroughs in deep learning, it was the perfect environment to sharpen my thinking and explore new directions for my research. 🚀

LucaZh00's tweet photo. My first ever conference attendance at @NeurIPSConf was incredibly fulfilling. From discussions on model merging to the latest breakthroughs in deep learning, it was the perfect environment to sharpen my thinking and explore new directions for my research. 🚀 https://t.co/ttjz6PZLjm

0

2

1

0

73

Luca Zhou @LucaZh00

8 months ago

📄 Read "On Task Vectors and Gradients": https://t.co/cGcjHnDSLX #AI #MachineLearning #ModelMerging #DeepLearning #NeurIPS2025 #UniReps

0

54

Luca Zhou @LucaZh00

8 months ago

Want to make model merging ⚡️ fast and effective? Our new paper, accepted to the UniReps Workshop @NeurIPSConf, reveals a surprising insight: Merging models after just one epoch of fine-tuning is often as good as merging fully converged ones!

LucaZh00's tweet photo. Want to make model merging ⚡️ fast and effective?

Our new paper, accepted to the UniReps Workshop @NeurIPSConf, reveals a surprising insight: Merging models after just one epoch of fine-tuning is often as good as merging fully converged ones! https://t.co/xvlxRvEdRk

1

4

1

697

Luca Zhou @LucaZh00

8 months ago

Why? We provide the first theoretical proof that a task vector is essentially a scaled gradient. This reframes task arithmetic as a form of approximate multitask learning.

1

0

86

LucaZh00 retweeted

Donato Crisostomi @DonatoCrisosto1

over 1 year ago

I know you're probably thinking, "Yeah, these neuron-permutation-based model merging methods are cool.. but are they cycle-consistent (CC)?" Say no more! It just so happens that our new #NeurIPS24 paper covers exactly this! Huh? No idea what I am talking about? Read on (1/6)

DonatoCrisosto1's tweet photo. I know you're probably thinking, "Yeah, these neuron-permutation-based model merging methods are cool.. but are they cycle-consistent (CC)?"

Say no more!
It just so happens that our new #NeurIPS24 paper covers exactly this!

Huh? No idea what I am talking about? Read on
(1/6) https://t.co/m0hk2m0Y4o

1

29

9

5

3K

Luca Zhou @LucaZh00

over 1 year ago

🙏Big thanks to my amazing co-authors @dansolombrino, @DonatoCrisosto1, @mariasofiabuc, @fabreetseo, and @EmanueleRodola for their incredible contributions—this project wouldn’t have come to life without them!🙌 👉 Read the paper here: https://t.co/NKWxpD2gt2

0

3

0

168

Luca Zhou @LucaZh00

over 1 year ago

↗️Task vectors? More like gradients🔽! We show that, under certain assumptions, they’re actually deeply related.🔗 ATM: A game-changing framework for multi-task model merging with no hidden fees💵!

LucaZh00's tweet photo. ↗️Task vectors? More like gradients🔽!

We show that, under certain assumptions, they’re actually deeply related.🔗

ATM: A game-changing framework for multi-task model merging with no hidden fees💵! https://t.co/bhghxOw5pL

1

10

3

0

1K

Luca Zhou @LucaZh00

over 1 year ago

🔍What’s the paper about? ◾Discovered fascinating relations between task vectors and multi-task gradients. ◾Proposed ATM: an efficient SOTA framework for multi-task model merging. ◾Mathematically and empirically motivated the effectiveness of ATM.

1

3

0

167

Luca Zhou

@LucaZh00

Last Seen Users on Sotwe

Trends for you

Most Popular Users