#ICLR2026
Into mode connectivity, model merging, or permutation invariance? We show how optimization dynamics shape the loss landscape of merged weights. Come check it out!
π 23/04 10:30AM β 13:00PM
π Pavilion 3 P3-1809
w/ @TheusResearch@DamienTeney@orvieto_antonio
π’ Submissions are OPEN for the Weight Space Symmetry Workshop @icmlconf!
β° Deadline extended β April 30 (23:59 AOE)
Consider submitting any work related to weight symmetries: optimization, model merging, weight space learning, and so on!
#ICML2026#weightsymmetry2026
π’Excited to announce the Workshop on Weight-Space Symmetries @icmlconf! We welcome 4-page submissions analysing symmetries, their effects on training and model structure, and practical methods to utilize them.
Submission Deadline: April 24 (23:59 AoE)
#ICML2026
1/ π¨ New paper alert! π¨
We explore a key question in deep learning:
Can independently trained Transformers be linearly connected in weight space β without a loss barrier?
Yes β if you uncover their rich symmetries.
π arXiv: https://t.co/wVoLYNzk0m
1/ π¨ New paper alert! π¨
We explore a key question in deep learning:
Can independently trained Transformers be linearly connected in weight space β without a loss barrier?
Yes β if you uncover their rich symmetries.
π arXiv: https://t.co/wVoLYNzk0m
9/ π Takeaway:
Transformers can be linearly connected β but only if you exploit richer network symmetries.
We show that general symmetry alignment (not just permutations) unlocks low-loss paths across ViTs and GPT-2.