Muqeeth @Muqeeth10 - Twitter Profile

about 2 months ago

Hi! If you are interested in game-theoretic analysis of the AI race and open vs. closed sourcing, check out our new paper: " Why Open Source? A Game-Theoretic Analysis of the AI Race " https://t.co/FNXcUNBiwl There are some cute complexity results there 🙂

1

19

8

2K

Muqeeth10 retweeted

Cooperative AI Foundation

@coop_ai

3 months ago

The Cooperative AI Summer School 2026 'Expression of interest' applications are now open! If you're an early-career professional studying or working in cooperative AI, apply to join us in Canada this August for an exciting intensive programme.

2

56

14

37

16K

Muqeeth10 retweeted

Kawin Ethayarajh

@ethayarajh

4 months ago

AI is changing economics, and --- as we just saw in Dwarkesh's interview with Dario --- AI researchers need to start thinking about economics too! The Center for Applied AI at UChicago will be hosting an AI & Economics Summer Institute to explore exactly this. We will bring together leading researchers with advanced graduate students in economics/AI/ML/NLP for an in-person program between Aug 6 - 11.

ethayarajh's tweet photo. AI is changing economics, and --- as we just saw in Dwarkesh's interview with Dario --- AI researchers need to start thinking about economics too!

The Center for Applied AI at UChicago will be hosting an AI & Economics Summer Institute to explore exactly this.

We will bring together leading researchers with advanced graduate students in economics/AI/ML/NLP for an in-person program between Aug 6 - 11.

6

201

45

149

37K

Muqeeth10 retweeted

Ian Gemp @drimgemp

5 months ago

Have you been using LLMs to play games, negotiate salaries, or strategize in other ways? Whether it worked or not, we want to see your demo at our “Strategic Engineering” workshop (https://t.co/gkYhWk2kHK) at #AAMAS2026 in Cyprus! Starter library @ https://t.co/ZwYYkS9ccW!

1

9

2

1K

Who to follow

Sharut Gupta

@sharut_gupta

PhD @MIT_CSAIL | Previously @GoogleDeepMind (Gemini), @AIatMeta | BTech @iitdelhi

Pratik Joshi

@Roprajo

Research Engineer @GoogleDeepMind | Teaching machines to code | Prev @LTIatCMU @GoogleAI, @MSFTResearch @BITSPilaniGoa

Jialu Li

@JialuLi96

Applied Scientist @Adobe; Previous @unccs @Cornell_CS; Past intern @Amazon @Apple @Google. Working on VLN, image generation, multi-modal LLM.

Muqeeth10 retweeted

Malikeh Ehghaghi

@Malikeh5

5 months ago

📢 I am excited to announce that our paper, "TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior," is now live both on Hugging Face and arXiv. 🖇️ arXiv Page: https://t.co/DfVBJ35udf 🤗 HF Org: https://t.co/l1mtLQ2gTW #LLM #NLP #Tokenization

Malikeh5's tweet photo. 📢 I am excited to announce that our paper, "TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior," is now live both on Hugging Face and arXiv.

🖇️ arXiv Page: https://t.co/DfVBJ35udf
🤗 HF Org: https://t.co/l1mtLQ2gTW

#LLM #NLP #Tokenization https://t.co/3D9z0BlQUl

1

10

3

2

724

Muqeeth @Muqeeth10

6 months ago

@dvnxmvl_hdf5 As the game is played repeatedly, agent can display reciprocity across rounds : cooperate when other player cooperates and retaliate when the other player defects last round. Since the values of items are public in this specific game, it is possible to do so.

0

2

0

336

Muqeeth @Muqeeth10

6 months ago

New preprint! Learning Robust Social Strategies with Large Language Models. We apply multi-agent RL finetuning to train LLMs that achieve cooperative and non-exploitable behavior in social dilemmas for the first time. 📄 https://t.co/lMKxJ4XoBx 🧵 ⬇️ (1/8)

Muqeeth10's tweet photo. New preprint! Learning Robust Social Strategies with Large Language Models. We apply multi-agent RL finetuning to train LLMs that achieve cooperative and non-exploitable behavior in social dilemmas for the first time.

📄 https://t.co/lMKxJ4XoBx
🧵 ⬇️
(1/8) https://t.co/27HbZvP3Ly

1

21

14

3

2K

Muqeeth @Muqeeth10

6 months ago

You can run multi-agent RL training for LLMs right away with our public code: https://t.co/DULDxwNBjl. This work was done with my awesome group members @Dereck_Piche*, @muqeeth10*, @MAghajohari, @JuanDuquevan, @mnoukhov, and @AaronCourville.(8/8)

0

7

0

242

Muqeeth @Muqeeth10

6 months ago

AdAlign agents are also robust when facing RL agents trained specifically to exploit them, while GPT-5 nano is exploitable in the same setup. The RL agent ends up cooperating with AdAlign’s tit for tat style policy, since that is its best response. (7/8)

Muqeeth10's tweet photo. AdAlign agents are also robust when facing RL agents trained specifically to exploit them, while GPT-5 nano is exploitable in the same setup. The RL agent ends up cooperating with AdAlign’s tit for tat style policy, since that is its best response. (7/8) https://t.co/yX6rqOcNtv

1

5

0

221

Muqeeth10 retweeted

Anirudh Buvanesh @AnirudhBuvanesh

9 months ago

Zero rewards after tons of RL training? 😞 Before using dense rewards or incentivizing exploration, try changing the data. Adding easier instances of the task can unlock RL training. 🔓📈To know more checkout our blog post here: https://t.co/BPErVcLmP8. Keep reading 🧵(1/n)

2

105

30

112

14K

Muqeeth @Muqeeth10

10 months ago

@esha_hq Good one. I do it after learning from a movie I watched in childhood

0

2

0

47

Muqeeth10 retweeted

Prateek Yadav

@prateeky2806

almost 2 years ago

We just released our survey on "Model MoErging", But what is MoErging?🤔Read on! Imagine a world where fine-tuned models, each specialized in a specific domain, can collaborate and "compose/remix" their skills using some routing mechanism to tackle new tasks and queries! 🧵👇 co first-author @colinraffel 📰: https://t.co/TgwHuNGly4

prateeky2806's tweet photo. We just released our survey on "Model MoErging", But what is MoErging?🤔Read on!

Imagine a world where fine-tuned models, each specialized in a specific domain, can collaborate and "compose/remix" their skills using some routing mechanism to tackle new tasks and queries!
🧵👇

co first-author @colinraffel
📰: https://t.co/TgwHuNGly4

5

218

44

102

21K

Muqeeth @Muqeeth10

almost 3 years ago

@sourab_m @Tim_Dettmers Thanks for sharing your work. IIUC, the approach in your paper is similar to the Expert Ensemble, which averages expert outputs by activating all experts. SMEAR achieves comparable performance while being significantly cheap by activating just one merged expert per example.

0

1

0

124

Muqeeth @Muqeeth10

almost 3 years ago

Introducing Soft Merging of Experts with Adaptive Routing (SMEAR) for gradient-based training of mixture-of-experts models. SMEAR matches or outperforms prior routing methods without increasing costs or relying on task metadata. 📄 https://t.co/guwwrV2BZg 🧵 ⬇️ (1/7)

Muqeeth10's tweet photo. Introducing Soft Merging of Experts with Adaptive Routing (SMEAR) for gradient-based training of mixture-of-experts models. SMEAR matches or outperforms prior routing methods without increasing costs or relying on task metadata.

📄 https://t.co/guwwrV2BZg
🧵 ⬇️
(1/7) https://t.co/mFgFMFmTep

3

170

40

90

36K

Muqeeth @Muqeeth10

almost 3 years ago

@KhanovMax That's correct! Having homogeneous experts is a simpler and more common approach. :)

0

1

0

149

Muqeeth @Muqeeth10

almost 3 years ago

@kleptid Therefore, the peak memory cost arises from the inner activations num_tokens * hidden_dim, rather than the merged experts, and is same as other methods. Token-level routing with SMEAR is mathematically equivalent to ensembles. Please refer our paper for discussion on this topic.

0

57

Muqeeth @Muqeeth10

almost 3 years ago

@kleptid In SMEAR, example-level rating is used, with memory cost of merged expert: hidden_dim . expert_ffn_dim. Additionally, the expert_ffn_dim is an order of magnitude smaller than hidden_dim due to our use of parameter-efficient modules as experts.

1

0

35

Muqeeth

@Muqeeth10

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users