🚨 Life update 🚨 I moved to Toronto 🇨🇦and joined @VectorInst as a Postdoctoral Fellow to work with @colinraffel and his lab on collaborative, decentralized, and modular machine learning to democratize ML model development. Exciting times ahead! 🪿
Are curious about how to resize any pretrained model on demand?
Then you'll love our paper "FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment", accepted at ICML 2026 (Spotlight)
Joint work with @sam_hrvth, @stevelaskaridis, @mciccone_AI
🧵1/7
⏳ Don't miss the chance to share your research on Continual Adaptation @ CATS workshop for #ICML2026.
Working on Scale & Efficiency, Alignment, Multimodality, or Forgetting? Get your 4-page submissions in.
🗓️ Deadline: April 30, 2026 (23:59 AoE)🔗 https://t.co/7aqeP342EN
The DiLoCo team at Google DeepMind and Google Research is proud to release Decoupled DiLoCo, the next frontier for resilient AI pre-training.
Decoupled DiLoCo enables training with datacenters across the world, using heterogeneous hardware, and never halting the system despite hardware failures.
@thegautamkamath One-page rebuttals are sufficient in most situations and easier to verify. If reviewers formulate their concerns precisely (unfortunately harder and harder), authors should be able to address them directly and get straight to the point. No fluff.
@manthanguptaa nice write-up. We have released a study of different tokenizers' characteristics and a multilingual robustness benchmark if you are interested https://t.co/DI6zsgoNP3
@kchonyc haha +1 for PAI, Mother's Dumpling is good, and BIWON for Korean food! Amal for fancy Lebanese food. I also like Momo Ghar (more east), and RAIJIN or Ikkousha for ramen