Alexander Theus @theusresearch - Twitter Profile

about 2 months ago

#ICLR2026 Into mode connectivity, model merging, or permutation invariance? We show how optimization dynamics shape the loss landscape of merged weights. Come check it out! 📅 23/04 10:30AM – 13:00PM 📍 Pavilion 3 P3-1809 w/ @TheusResearch @DamienTeney @orvieto_antonio

1

6

1

0

159

TheusResearch retweeted

Weight Space Symmetries @ ICML 2026 @weightsymmetry

about 2 months ago

📢 Submissions are OPEN for the Weight Space Symmetry Workshop @icmlconf! ⏰ Deadline extended → April 30 (23:59 AOE) Consider submitting any work related to weight symmetries: optimization, model merging, weight space learning, and so on! #ICML2026 #weightsymmetry2026

weightsymmetry's tweet photo. 📢 Submissions are OPEN for the Weight Space Symmetry Workshop @icmlconf!
⏰ Deadline extended → April 30 (23:59 AOE)

Consider submitting any work related to weight symmetries: optimization, model merging, weight space learning, and so on!

#ICML2026 #weightsymmetry2026 https://t.co/OXbrUeA3Jc

1

19

6

2

6K

TheusResearch retweeted

Weight Space Symmetries @ ICML 2026 @weightsymmetry

2 months ago

📢Excited to announce the Workshop on Weight-Space Symmetries @icmlconf! We welcome 4-page submissions analysing symmetries, their effects on training and model structure, and practical methods to utilize them. Submission Deadline: April 24 (23:59 AoE) #ICML2026

weightsymmetry's tweet photo. 📢Excited to announce the Workshop on Weight-Space Symmetries @icmlconf! We welcome 4-page submissions analysing symmetries, their effects on training and model structure, and practical methods to utilize them.

Submission Deadline: April 24 (23:59 AoE)
#ICML2026 https://t.co/xvtpRnlR7u

3

56

37

15

22K

Alexander Theus @TheusResearch

9 months ago

Excited to announce that our paper has been accepted as an Oral at NeurIPS 2025! 🥳

Alexander Theus @TheusResearch

11 months ago

1/ 🚨 New paper alert! 🚨 We explore a key question in deep learning: Can independently trained Transformers be linearly connected in weight space — without a loss barrier? Yes — if you uncover their rich symmetries. 📄 arXiv: https://t.co/wVoLYNzk0m

TheusResearch's tweet photo. 1/ 🚨 New paper alert! 🚨
We explore a key question in deep learning:
Can independently trained Transformers be linearly connected in weight space — without a loss barrier?
Yes — if you uncover their rich symmetries.
📄 arXiv: https://t.co/wVoLYNzk0m https://t.co/W9WWYTZqig

2

59

8

29

6K

1

12

1

3

1K

Alexander Theus @TheusResearch

11 months ago

10/ 📄 Paper: https://t.co/wVoLYNyMaO By: @Theus__A , Alessandro Cabodi, @SAnagnostidis , @orvieto_antonio , @unregularized , and @val_boeva 🙏 Huge thanks to my amazing co-authors for this collaboration! #Transformers #LMC #MachineLearning #DeepLearning

1

0

1

557

Alexander Theus @TheusResearch

11 months ago

1/ 🚨 New paper alert! 🚨 We explore a key question in deep learning: Can independently trained Transformers be linearly connected in weight space — without a loss barrier? Yes — if you uncover their rich symmetries. 📄 arXiv: https://t.co/wVoLYNzk0m

2

59

8

29

6K

Alexander Theus @TheusResearch

11 months ago

9/ 🔑 Takeaway: Transformers can be linearly connected — but only if you exploit richer network symmetries. We show that general symmetry alignment (not just permutations) unlocks low-loss paths across ViTs and GPT-2.

1

0

522

Alexander Theus

@TheusResearch

Last Seen Users on Sotwe

Trends for you

Most Popular Users