#ICLR2026
Into mode connectivity, model merging, or permutation invariance? We show how optimization dynamics shape the loss landscape of merged weights. Come check it out!
📅 23/04 10:30AM – 13:00PM
📍 Pavilion 3 P3-1809
w/ @TheusResearch@DamienTeney@orvieto_antonio
PSA: never, ever write "we use the same learning rate across all methods for fair comparison"
I read this as "do not trust any of our conclusions" and then i move on.
If learning rate tuning is not mentioned, it takes me a little more time to notice that, but i also move on.
@AdirRahamim1 nice! we worked on a related question: how does the optimization dynamics affect model merging? how do different optimization components interact with each other? https://t.co/YoYeT9JG1S
@_igorshilov Cool work! I have seen a similar idea before, where they also show the possibility of confining unwanted data to specific neurons in smaller networks:
https://t.co/9j5yZlNRlv
More on https://t.co/F2yzZEJXQs e.g.,
- Python remains the most used language
- Quantization (4-8bits) + LoRA is used for LLM
- AutoML methods are becoming useful for narrow domains
@aaron_defazio Shameless plug that you may find interesting. We studied how the optimization dynamics affect the averaging of model weights https://t.co/YoYeT9KdRq
“To write well is to think clearly” - which is to say that by delegating more and more of your writing effort to AI, you are inviting the slow erosion of your own ability to think clearly.
Industrial Revolution brought environmental pollution, and Artificial Intelligence brings information contamination. It is getting increasingly serious. Pollution destroys the environment, but contaminated information destroys civilizations.
@giffmana I am using yabai and it disables the macos animations. The desktop transition is instant and comparable to i3. The only problem I have is that sometimes it is buggy, I need to reset it bc it stops autotiling
ppl in ML don’t realize how good they have it with arxiv, blogposts, and discord
so many other fields still operate in a world of gatekept paywalled slop sites and year-long journal wait times
legit feels like a third world country
I get a lot of reviews that say my work is not novel and I bet I'm not alone. It's always frustrating because I see novelty where the reviewer doesn't. Rather than rebut every critique, I've written a blog post to help reviewers think about novelty. https://t.co/UXLabOkYcn