Riccardo Zaccone

@RickZack96

Past Visiting PhD student @ MBZUAI | PhD student in Decentralized Learning @PoliTONews | Alumno IEEE-HKN @HKNPoliTo

Turin, Italy

Joined August 2022

75 Following

69 Followers

35 Posts

Pinned Tweet

Riccardo Zaccone @RickZack96

5 days ago

Are curious about how to resize any pretrained model on demand? Then you'll love our paper "FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment", accepted at ICML 2026 (Spotlight) Joint work with @sam_hrvth, @stevelaskaridis, @mciccone_AI 🧵1/7

RickZack96's tweet photo. Are curious about how to resize any pretrained model on demand?

Then you'll love our paper "FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment", accepted at ICML 2026 (Spotlight)

Joint work with @sam_hrvth, @stevelaskaridis, @mciccone_AI

🧵1/7 https://t.co/0882MMIbNu

6

21

4

9

8K

Riccardo Zaccone @RickZack96

about 13 hours ago

@gabriberton @georgiagkioxari @taiyasaki I'm looking forward to reading about (2): I can see valid points for/against this. Another point is distinguishing closed-source from unreproducible articles, which is actually a broader concern.

0

0

0

0

154

Riccardo Zaccone @RickZack96

4 days ago

@atrost3122 Thanks @atrost3122!

0

1

0

0

21

Riccardo Zaccone @RickZack96

5 days ago

@sam_hrvth @stevelaskaridis @mciccone_AI Results, finally! FlexRank exhibits much more graceful degradation as the param budgets reduces. Results on commonsense tasks of lm-eval-harness (LLMs) and on ImageNet-1K (ViTs). More results and ablations in the paper 📑 📄 Project Page: https://t.co/NWeW1J8mg7

RickZack96's tweet photo. @sam_hrvth @stevelaskaridis @mciccone_AI Results, finally!

FlexRank exhibits much more graceful degradation as the param budgets reduces.

Results on commonsense tasks of lm-eval-harness (LLMs) and on ImageNet-1K (ViTs).

More results and ablations in the paper 📑

📄 Project Page: https://t.co/NWeW1J8mg7 https://t.co/ocYxJ6DFlG

0

0

0

0

99

Who to follow

Debra Ruth Becker

Faith, family, friends, proud to be an American. America First...then give generously.

Postdoctoral Researcher @ Univerisity of Amsterdam -- I like AI, Art, Ethics, and human rights :)

@chiaraplizzari

Assistant Professor @ Bocconi University, Milan

Riccardo Zaccone @RickZack96

5 days ago

Are curious about how to resize any pretrained model on demand? Then you'll love our paper "FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment", accepted at ICML 2026 (Spotlight) Joint work with @sam_hrvth, @stevelaskaridis, @mciccone_AI 🧵1/7

RickZack96's tweet photo. Are curious about how to resize any pretrained model on demand?

Then you'll love our paper "FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment", accepted at ICML 2026 (Spotlight)

Joint work with @sam_hrvth, @stevelaskaridis, @mciccone_AI

🧵1/7 https://t.co/0882MMIbNu

6

21

4

9

8K

Riccardo Zaccone @RickZack96

5 days ago

@sam_hrvth @stevelaskaridis @mciccone_AI How does layerwise rank reduction translate into real inference savings? Standard SVD adds parameter/FLOP overhead, requiring aggressive rank reduction to yield benefits. We propose a novel factorization that guarantees savings at any budget, without aggressive rank reduction.

RickZack96's tweet photo. @sam_hrvth @stevelaskaridis @mciccone_AI How does layerwise rank reduction translate into real inference savings?

Standard SVD adds parameter/FLOP overhead, requiring aggressive rank reduction to yield benefits.

We propose a novel factorization that guarantees savings at any budget, without aggressive rank reduction. https://t.co/uTkrevbEpF

0

1

1

0

94

Riccardo Zaccone @RickZack96

5 days ago

@sam_hrvth @stevelaskaridis @mciccone_AI 🧐Noticed the nestedness constrain above? ‼️This is a key passage: we prove for a 1-NN that violating nestedness degrades the Pareto Front. This serves as insight that the same constraint should be imposed globally on DNNs.

RickZack96's tweet photo. @sam_hrvth @stevelaskaridis @mciccone_AI 🧐Noticed the nestedness constrain above?

‼️This is a key passage: we prove for a 1-NN that violating nestedness degrades the Pareto Front.

This serves as insight that the same constraint should be imposed globally on DNNs. https://t.co/RVX1Nt9nwP

0

0

0

0

87

Riccardo Zaccone @RickZack96

5 days ago

@sam_hrvth @stevelaskaridis @mciccone_AI We now have layers which can be cut by reducing their rank: how to spend any global budget across layers? While the problem is combinatorial, we can find an approximate Pareto Front in polynomial time. A subset of these submodels is then optimized with KD.

RickZack96's tweet photo. @sam_hrvth @stevelaskaridis @mciccone_AI We now have layers which can be cut by reducing their rank: how to spend any global budget across layers?

While the problem is combinatorial, we can find an approximate Pareto Front in polynomial time.

A subset of these submodels is then optimized with KD. https://t.co/7ms3U8r15i

0

1

1

1

111

Riccardo Zaccone @RickZack96

5 days ago

@sam_hrvth @stevelaskaridis @mciccone_AI The first step is initializing an elastic rank space without retraining the model from scratch. This is achieved by decomposing each pretrained layer via a DataSVD, i.e. an SVD aligned with data directions. ⚠️IMPORTANT: we show that this is good only as initialization

RickZack96's tweet photo. @sam_hrvth @stevelaskaridis @mciccone_AI The first step is initializing an elastic rank space without retraining the model from scratch.

This is achieved by decomposing each pretrained layer via a DataSVD, i.e. an SVD aligned with data directions.

⚠️IMPORTANT: we show that this is good only as initialization https://t.co/crTHvjia1V

0

0

0

0

101

Riccardo Zaccone @RickZack96

5 days ago

@sam_hrvth @stevelaskaridis @mciccone_AI Pretrained models are fixed monoliths, but in practice runtime budgets vary beyond the coarsely grained released variants. Ideally, we'd like a family of shared-weight, pareto-optimal models, i.e. models lying on the optimal performance/cost frontier.

RickZack96's tweet photo. @sam_hrvth @stevelaskaridis @mciccone_AI Pretrained models are fixed monoliths, but in practice runtime budgets vary beyond the coarsely grained released variants.

Ideally, we'd like a family of shared-weight, pareto-optimal models, i.e. models lying on the optimal performance/cost frontier. https://t.co/2YXQtg1pNF

0

1

0

0

120

Riccardo Zaccone @RickZack96

6 months ago

@niclane7 @KairouzPeter @gingsmith @konstmish @aaron_defazio @MatharyCharles @Ar_Douillard @sam_hrvth @_arohan_ @samsja19 I'll be presenting both works at NeurIPS, happy to meet and talk more about it if you're around. https://t.co/MpYdeo5asx

Riccardo Zaccone @RickZack96

6 months ago

🚀 Excited to be at #NeurIPS2025 this week! I’ll be presenting our work on distributed and federated optimization. You'll find me on 6th Dec: - OPT for ML: 20A 10-11 am - Reliable ML: 2, 1:30 2:15 pm If you're working on learning at scale, come find me at — happy to chat 🤝

0

8

2

0

508

0

0

0

0

112

Riccardo Zaccone @RickZack96

11 months ago

Do you feel FL research is stuck with methods that do not work well in realistic scenarios? 🤔 🫵We got you! Introducing 🚀Generalized Heavy-Ball Momentum (GHBM)🚀, accepted at TMLR: the FL algorithm with both SOTA theoretical guarantees and much better empirical results. 🧵1/9

RickZack96's tweet photo. Do you feel FL research is stuck with methods that do not work well in realistic scenarios? 🤔

🫵We got you!
Introducing 🚀Generalized Heavy-Ball Momentum (GHBM)🚀, accepted at TMLR:
the FL algorithm with both SOTA theoretical guarantees and much better empirical results.

🧵1/9 https://t.co/qZoI7ujCX4

1

21

6

8

4K

Riccardo Zaccone @RickZack96

6 months ago

@niclane7 @KairouzPeter @gingsmith @konstmish @aaron_defazio @MatharyCharles @Ar_Douillard @sam_hrvth @_arohan_ @samsja19 Sure thing! Reproducibility is key in FL research — that’s why we released full code and hyperparameters already at the submission stage. Integrating the method into Flower would be a great next step for wider adoption. Happy to collaborate!

2

1

0

0

79

Riccardo Zaccone @RickZack96

6 months ago

@KairouzPeter @gingsmith @konstmish @aaron_defazio @MatharyCharles @Ar_Douillard @niclane7 @sam_hrvth @_arohan_ @samsja19 📢 NEWS: if you wondered if it is possible to just use classical momentum (e.g. FedAvgM) and achieve the results of GHBM, our follow-up work at OPT@NeurIPS2025 provides the answer: https://t.co/SpU6gTA9kj

1

0

0

0

124

Riccardo Zaccone @RickZack96

11 months ago

Tagging relevant people in Optimization, FL and Distributed Training who might be interested in this work @KairouzPeter @gingsmith @konstmish @aaron_defazio @MatharyCharles @Ar_Douillard @niclane7 @sam_hrvth @_arohan_ @samsja19

1

4

0

0

207

Riccardo Zaccone @RickZack96

6 months ago

🚀 Excited to be at #NeurIPS2025 this week! I’ll be presenting our work on distributed and federated optimization. You'll find me on 6th Dec: - OPT for ML: 20A 10-11 am - Reliable ML: 2, 1:30 2:15 pm If you're working on learning at scale, come find me at — happy to chat 🤝

0

8

2

0

508

RickZack96 retweeted

Gabriele Trivigno @gabTrivv

7 months ago

🔥 Our paper SANSA is a #NeurIPS2025 Spotlight! We turn #SAM2 into a semantic few-shot segmenter for objects and parts, fully promptable (mask · point · box · scribble); only 10M trainable parameters and 5× faster than competitors. Code, models & demo https://t.co/bdfUd1YnlG 👇

1

22

11

13

2K

Riccardo Zaccone @RickZack96

7 months ago

@MatharyCharles Thank you for sharing this opportunity — I’ve just submitted my application

0

0

0

0

196

Riccardo Zaccone @RickZack96

8 months ago

It's always nice when the service to the community is recognized, thank you @NeurIPSConf

RickZack96's tweet photo. It's always nice when the service to the community is recognized, thank you @NeurIPSConf https://t.co/gmLjreGAoM

0

0

0

0

23

Riccardo Zaccone @RickZack96

10 months ago

@AtaeiMe @SametOymac That should be classified as irresponsible reviewing, and it should be easy to prove that mentioned papers were allucinated, as well as published after the submission deadline. Just for curiosity, what is the comment for the "1" score?

1

0

0

0

180

Last Seen Users on Sotwe

Trends for you

Most Popular Users