Josh McClellan

@JoshMcClellan0

I research generalization and multi-agent systems in reinforcement learning

Joined April 2020

79 Following

21 Followers

36 Posts

Josh McClellan @JoshMcClellan0

3 months ago

@ColeJacksonFB Look at his pff 2024 though (not 23 my bad) "Simpson had career-high grades across the board, earning a 77.3 overall grade, the 13th highest among guards, and a 79.2 run blocking grade, which also ranked top 10 at the position."

Josh McClellan @JoshMcClellan0

3 months ago

@ColeJacksonFB Oops sorry, meant 2024. According to pff, which is what you posted, "Simpson had career-high grades across the board, earning a 77.3 overall grade, the 13th highest among guards, and a 79.2 run blocking grade, which also ranked top 10 at the position."

120

JoshMcClellan0 retweeted

Robert Youssef

@rryssf

4 months ago

ICLR 2025 just gave an Outstanding Paper Award to a method that fixes model editing with one line of code 🤯 here's the problem it solves: llms store facts in their parameters. sometimes those facts are wrong or outdated. "model editing" lets you surgically update specific facts without retraining the whole model. the standard approach: find which parameters encode the fact (using causal tracing), then nudge those parameters to store the new fact. works great for one edit. but do it a hundred times in sequence and the model starts forgetting everything else. do it a thousand times and it degenerates into repetitive gibberish. every edit that inserts new knowledge corrupts old knowledge. you're playing whack-a-mole with the model's memory. AlphaEdit reframes the problem. instead of asking "how do we update knowledge with less damage?" the authors ask "how do we make edits mathematically invisible to preserved knowledge?" the trick: before applying any parameter change, project it onto the null space of the preserved knowledge matrix. in plain english: find the directions in parameter space where you can move freely without affecting anything the model already knows. only move in those directions. it's like remodeling one room in a house by only touching walls that aren't load-bearing. the rest of the structure doesn't even know anything changed. the results from Fang et al. across GPT2-XL, GPT-J, and LLaMA3-8B: > average 36.7% improvement over existing editing methods > works as a plug-and-play addition to MEMIT, ROME, and others > models maintain 98.48% of general capabilities after 3,000 sequential edits > prevents the gibberish collapse that kills other methods at scale and the implementation is literally one line of code added to existing pipelines. what i find genuinely elegant: the paper proves mathematically that output remains unchanged when querying preserved knowledge. this isn't "it works better in practice." it's "we can prove it doesn't touch what it shouldn't." the honest caveats: largest model tested was LLaMA3-8B. nobody's shown this works at 70B+ scale yet. a follow-up paper (AlphaEdit+) flagged brittleness when new knowledge directly conflicts with preserved knowledge, which is exactly the hardest case in production. and the whole approach assumes causal tracing correctly identifies where facts live, which isn't always clean. but as a core insight, this is the kind of work that deserves the award. not because it solves everything. because it changes the question. the era of "edit and pray" for llm knowledge updates might actually be ending.

rryssf's tweet photo. ICLR 2025 just gave an Outstanding Paper Award to a method that fixes model editing with one line of code 🤯

here's the problem it solves:

llms store facts in their parameters. sometimes those facts are wrong or outdated. "model editing" lets you surgically update specific facts without retraining the whole model.

the standard approach: find which parameters encode the fact (using causal tracing), then nudge those parameters to store the new fact.

works great for one edit. but do it a hundred times in sequence and the model starts forgetting everything else. do it a thousand times and it degenerates into repetitive gibberish.

every edit that inserts new knowledge corrupts old knowledge. you're playing whack-a-mole with the model's memory.

AlphaEdit reframes the problem.

instead of asking "how do we update knowledge with less damage?" the authors ask "how do we make edits mathematically invisible to preserved knowledge?"

the trick: before applying any parameter change, project it onto the null space of the preserved knowledge matrix.

in plain english: find the directions in parameter space where you can move freely without affecting anything the model already knows. only move in those directions.

it's like remodeling one room in a house by only touching walls that aren't load-bearing. the rest of the structure doesn't even know anything changed.

the results from Fang et al. across GPT2-XL, GPT-J, and LLaMA3-8B:

> average 36.7% improvement over existing editing methods
> works as a plug-and-play addition to MEMIT, ROME, and others
> models maintain 98.48% of general capabilities after 3,000 sequential edits
> prevents the gibberish collapse that kills other methods at scale

and the implementation is literally one line of code added to existing pipelines.

what i find genuinely elegant: the paper proves mathematically that output remains unchanged when querying preserved knowledge. this isn't "it works better in practice." it's "we can prove it doesn't touch what it shouldn't."

the honest caveats:

largest model tested was LLaMA3-8B. nobody's shown this works at 70B+ scale yet. a follow-up paper (AlphaEdit+) flagged brittleness when new knowledge directly conflicts with preserved knowledge, which is exactly the hardest case in production. and the whole approach assumes causal tracing correctly identifies where facts live, which isn't always clean.

but as a core insight, this is the kind of work that deserves the award. not because it solves everything. because it changes the question.

the era of "edit and pray" for llm knowledge updates might actually be ending.

612

560

51K

JoshMcClellan0 retweeted

Max Simchowitz

@max_simchowitz

6 months ago

⏰⏰ New Science of Robot Learning Paper: "Much Ado About Noising." TL;DR we answer why generative models, like flow and diffusion models, actually work for robotic control tasks🤖🤖 (hint: its not multimodality). This leads to a new minimal iterative policy (MIP) that matches flow models with much faster inference🚄🚄 Check out @ChaoyiPan 's thread and paper to find out more. Amazing work by @ChaoyiPan, together with @GuanyaShi , @nmboffi , @guannanqu. Come find us at NeurIPS to chat more!

199

124

20K

Who to follow

Richard Gibson

@Richard01220284

Retired guitar slinger, cat wrangler.

小宝

@xiaobao_bao52

女sub｜02 ｜屁股每日三痒选手

JoshMcClellan0 retweeted

Andrew Davison @AjdDavison

10 months ago

I still don't believe that Spatial AI is a big data problem. Massive data won't unlock robotics. Representation is still the hard bit. We need efficient, composable representation of 3D physical scenes to enable mental simulation like humans use to plan creative uses of objects.

353

147

44K

JoshMcClellan0 retweeted

Michael Lutter @_mlutter

about 1 year ago

My team at Boston Dynamics has open Research Scientist positions! We are hiring full-time and interns. If you are excited about RL & humanoids, we want to get to know you! Research Scientist: https://t.co/Jgw3RHf2q7 Research Scientist Intern: https://t.co/4VNK8qVi1S

317

208

34K

JoshMcClellan0 retweeted

François Chollet

@fchollet

about 1 year ago

Much of the field obsesses over end-to-end learning. But strong generalization requires compositionality: building modular, reusable abstractions, and reassembling them on the fly when faced with novelty. The models of the future won't be just pipes, they will be Lego castles.

163

362

137K

Josh McClellan @JoshMcClellan0

over 1 year ago

@TacoCohen I would love to chat if you have time. We will also be presenting a relevant poster on using equivariance in Multi agent RL https://t.co/SeCDBtDgZV

143

Josh McClellan @JoshMcClellan0

over 1 year ago

#NeurIPS #MARL #AI #ReinforcementLearning #MachineLearning #Equivariance #GraphNeuralNetworks

Josh McClellan @JoshMcClellan0

over 1 year ago

I'm excited to share that our paper, "Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance," has been accepted to NeurIPS 2024! 🎉 https://t.co/SeCDBtDgZV @ptokekar @furongh

JoshMcClellan0's tweet photo. I'm excited to share that our paper, "Boosting Sample Efficiency and Generalization in Multi-agent Reinforcement Learning via Equivariance," has been accepted to NeurIPS 2024! 🎉 https://t.co/SeCDBtDgZV
@ptokekar @furongh https://t.co/BPDVHk45lE

Josh McClellan @JoshMcClellan0

over 1 year ago

This robustness stems directly from its symmetry guarantees, allowing it to lose less performance when adapting to new scenarios. If you'll be at Neurips come visit our poster next week to learn more and discuss the exciting future of MARL!

Josh McClellan @JoshMcClellan0

almost 2 years ago

@PierreAblin Thanks for sharing. I'm curious how this changed after rebuttals

162

Josh McClellan @JoshMcClellan0

almost 2 years ago

@Brendan_Duke Is this average or median for wages?

Josh McClellan

@JoshMcClellan0

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users