Rishab Bala @Sub_RBala - Twitter Profile

Pinned Tweet

about 2 months ago

🌟New paper alert: The Master Key Hypothesis🌟 Post-training is expensive AND you have to repeat it for every new model. What if you could just transfer the capability instead? We introduce UNLOCK — a training-free and label-free framework that transfers capabilities across models using low-rank linear projections. Takeaways: 1⃣ Skills like CoT & math reasoning live as directions in the model's latent space. We capture these as a steering vector — we call this the Master Key. 2⃣ Simple low-rank linear transformations are sufficient to transfer the Master Key across models' latent spaces 3⃣ UNLOCK elicits behaviors that even prompting can't reliably trigger 4⃣ Gains scale with base model strength. Stronger models show larger gains post-transfer. 5⃣ UNLOCK works by sharpening the output distribution — steering the model toward the right answer.

Sub_RBala's tweet photo. 🌟New paper alert: The Master Key Hypothesis🌟

Post-training is expensive AND you have to repeat it for every new model. What if you could just transfer the capability instead?
We introduce UNLOCK — a training-free and label-free framework that transfers capabilities across models using low-rank linear projections.

Takeaways:
1⃣ Skills like CoT & math reasoning live as directions in the model's latent space. We capture these as a steering vector — we call this the Master Key.
2⃣ Simple low-rank linear transformations are sufficient to transfer the Master Key across models' latent spaces
3⃣ UNLOCK elicits behaviors that even prompting can't reliably trigger
4⃣ Gains scale with base model strength. Stronger models show larger gains post-transfer.
5⃣ UNLOCK works by sharpening the output distribution — steering the model toward the right answer.

2

62

16

55

13K

Rishab Bala

@Sub_RBala

29 days ago

@johnhewtt @GeorgeMorgulis Very interesting read! We also find that steering vectors can be used to capture and transfer reasoning capabilities such as CoT, and the steered model can match/surpass prompting. Low-rank signal transfer seems to be very promising! Paper: https://t.co/vQLzjv7axu

0

66

Sub_RBala retweeted

Tu Vu

@tuvllms

about 1 month ago

I am in Rio de Janeiro, Brazil for #ICLR2026 @iclr_conf. Would love to connect and chat about any LLM research topics, explore collaboration, or discuss potential funding opportunities for my lab at @ Virginia Tech.

1

44

5

3K

Rishab Bala

@Sub_RBala

about 2 months ago

@thecekbote If the W_o and next FFN already do the mixing once per layer, isn't IHA functionally similar to MoE in the sense that you mix heads/residuals?

0

26

Who to follow

nikhil tayal

@Alloutnikhil

Building https://t.co/da8aocU6uc — Self-learning AI agents for delegated work. You can deploy yours today in less than a minute.

Esther.Ge

@Ester_Anaya

🌸 Journalist/Periodista. East Asia & Korean Studies 🇰🇷 Marketing, Advertising & Social Media 📚 Web app developer in progress 💻 Based in Spain

Rishab Bala

@Sub_RBala

about 2 months ago

@universeinanegg Our recent work shows that reasoning can also be approximately linear in nature: https://t.co/vQLzjv7axu. But I do agree that the manual searching is required to find these maps/vectors

0

1

0

16

Sub_RBala retweeted

Rishab Bala

@Sub_RBala

about 2 months ago

🌟New paper alert: The Master Key Hypothesis🌟 Post-training is expensive AND you have to repeat it for every new model. What if you could just transfer the capability instead? We introduce UNLOCK — a training-free and label-free framework that transfers capabilities across models using low-rank linear projections. Takeaways: 1⃣ Skills like CoT & math reasoning live as directions in the model's latent space. We capture these as a steering vector — we call this the Master Key. 2⃣ Simple low-rank linear transformations are sufficient to transfer the Master Key across models' latent spaces 3⃣ UNLOCK elicits behaviors that even prompting can't reliably trigger 4⃣ Gains scale with base model strength. Stronger models show larger gains post-transfer. 5⃣ UNLOCK works by sharpening the output distribution — steering the model toward the right answer.

2

62

16

55

13K

Rishab Bala

@Sub_RBala

about 2 months ago

Work done in collaboration with @linusdd44804 @SharmaRituraj19 @anjiefang, Fardin Abdi, Viktor Rozgic, Zheng Du, @mohitban47, @tuvllms Paper: https://t.co/vQLzjv7axu GitHub: https://t.co/N2CPT13IHh

0

5

1

268

Rishab Bala

@Sub_RBala

about 2 months ago

🌟New paper alert: The Master Key Hypothesis🌟 Post-training is expensive AND you have to repeat it for every new model. What if you could just transfer the capability instead? We introduce UNLOCK — a training-free and label-free framework that transfers capabilities across models using low-rank linear projections. Takeaways: 1⃣ Skills like CoT & math reasoning live as directions in the model's latent space. We capture these as a steering vector — we call this the Master Key. 2⃣ Simple low-rank linear transformations are sufficient to transfer the Master Key across models' latent spaces 3⃣ UNLOCK elicits behaviors that even prompting can't reliably trigger 4⃣ Gains scale with base model strength. Stronger models show larger gains post-transfer. 5⃣ UNLOCK works by sharpening the output distribution — steering the model toward the right answer.

2

62

16

55

13K

Rishab Bala

@Sub_RBala

about 2 months ago

Based on our findings, we introduce the Master Key Hypothesis and postulate the convergence of capability representations across model families and scales.

Sub_RBala's tweet photo. Based on our findings, we introduce the Master Key Hypothesis and postulate the convergence of capability representations across model families and scales. https://t.co/LmjoVWWTNC

1

3

1

3

346

Rishab Bala

@Sub_RBala

3 months ago

The 3 self-distillation papers seem to be extremely similar in the method and only differ in how the feedback is generated/incorporated. They are also only compared to SFT (known to be the weakest method), while incorporating feedback is also done with other PO methods. Not quite sure of the takeaways, but the improvements and continual learning settings look good!

0

2

0

352

Rishab Bala

@Sub_RBala

4 months ago

@giffmana @ArashVahdat @HaoZhao_AIRSUN What is the prompt for generating an image like this?

0

26

Sub_RBala retweeted

The Sanghani Center at Virginia Tech @SanghaniCtrVT

7 months ago

@therealthapa One more @SanghaniCtrVT paper at #EMNLP2025: Efficient Model Development through Fine-tuning Transfer Main proceedings @linusdd44804 @Sub_RBala @tuvllms (all VT) w/@fyliufengyuan, @kandpal_nikhil https://t.co/OGQa84ots4

0

2

3

0

484

Sub_RBala retweeted

Tu Vu

@tuvllms

10 months ago

Excited to share that our paper on efficient model development has been accepted to #EMNLP2025 Main conference @emnlpmeeting. Congratulations to my students @linusdd44804 and @Sub_RBala on their first PhD paper! 🎉

0

51

9

10

6K

Rishab Bala

@Sub_RBala

12 months ago

@blelbach Will there be a recording?

0

1

0

311

Rishab Bala

@Sub_RBala

12 months ago

@ryolu_ @cursor_ai Before all that can we get a way to remap autocompletes from tab to another button? And a way to get partial edits?

0

57

Rishab Bala

@Sub_RBala

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users