Katherine Hermann

Eghbal Hosseini @eghbal_hosseini

3 months ago

Pleased to share that our paper "Representation Biases: Variance is Not Always a Good Proxy for Importance" is now out as Theory/New Concepts paper in eNeuro! Thread:

AndrewLampinen's tweet photo. Pleased to share that our paper "Representation Biases: Variance is Not Always a Good Proxy for Importance" is now out as Theory/New Concepts paper in eNeuro! Thread: https://t.co/slZyrAPGG7

2

120

16

81

11K

khermann_ retweeted

4 months ago

How do diverse context structures reshape representations in LLMs? In our new work, we explore this via representational straightening. We found LLMs are like a Swiss Army knife: they select different computational mechanisms reflected in different representational structures. 1/

eghbal_hosseini's tweet photo. How do diverse context structures reshape representations in LLMs?
In our new work, we explore this via representational straightening. We found LLMs are like a Swiss Army knife: they select different computational mechanisms reflected in different representational structures. 1/ https://t.co/1QrUIy0PTS

1

87

19

61

12K

Lead Scientist, Verifiable AI Lab of @Apodex_AI. Previously: Research Scientist @FAIR, Postdoc @Caltech, PhD @PrincetonCS, Undergrad @Tsinghua_Uni.

6 months ago

@ermgrant @tweetsatpreet @UAlberta @AmiiThinks @CogInterp @DataOnBrainMind Congratulations, Erin 🎉

0

2

0

183

Who to follow

Kaiyu Yang

@KaiyuYang4

Jascha Sohl-Dickstein

@jaschasd

Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.

Daniel Yamins

@dyamins

CS, psych, and neuro prof @ Stanford. NeuroAI and "regular AI". Also harpsichords and bonsai. https://t.co/xCFbmgT6TG

khermann_ retweeted

Goodfire

@GoodfireAI

7 months ago

LLMs memorize a lot of training data, but memorization is poorly understood. Where does it live inside models? How is it stored? How much is it involved in different tasks? @jack_merullo_ & @srihita_raju's new paper examines all of these questions using loss curvature! (1/7)

GoodfireAI's tweet photo. LLMs memorize a lot of training data, but memorization is poorly understood.

Where does it live inside models? How is it stored? How much is it involved in different tasks?

@jack_merullo_ & @srihita_raju's new paper examines all of these questions using loss curvature! (1/7) https://t.co/w0UWnoBOsX

11

809

136

751

193K

khermann_ retweeted

Ilia Sucholutsky @sucholutsky

8 months ago

🧵🎉 Our mega-paper is finally published in TMLR! We're "Getting Aligned on Representational Alignment" - the degree to which internal representations of different (biological & artificial) information processing systems agree. 🧠🤖🔬🔍 #CognitiveScience #Neuroscience #AI

sucholutsky's tweet photo. 🧵🎉 Our mega-paper is finally published in TMLR! We're "Getting Aligned on Representational Alignment" - the degree to which internal representations of different (biological & artificial) information processing systems agree. 🧠🤖🔬🔍 #CognitiveScience #Neuroscience #AI https://t.co/ciLDCuXwyH

5

150

37

79

34K

khermann_ retweeted

Michael C. Mozer @mc_mozer

8 months ago

[1/4] As you read words in this text, your brain adjusts fixation durations to facilitate comprehension. Inspired by human reading behavior, we propose a supervised objective that trains an LLM to dynamically determine the number of compute steps for each input token.

mc_mozer's tweet photo. [1/4] As you read words in this text, your brain adjusts fixation durations to facilitate comprehension. Inspired by human reading behavior, we propose a supervised objective that trains an LLM to dynamically determine the number of compute steps for each input token. https://t.co/lunDqm8C3N

4

27

10

6

3K

10 months ago

@drjuliashaw @AdamRutherford Congratulations, @drjuliashaw! Can't wait to read it!

0

1

0

142

khermann_ retweeted

10 months ago

Many representational analyses (implicitly) prioritize signals by the amount of variance they explain in the representations. However, in https://t.co/NgjqF8Chzs we discuss results from our prior work that challenge this assumption; variance != computational importance.

1

31

3

15

2K

khermann_ retweeted

10 months ago

In neuroscience, we often try to understand systems by analyzing their representations — using tools like regression or RSA. But are these analyses biased towards discovering a subset of what a system represents? If you're interested, check out our new commentary! Thread:

AndrewLampinen's tweet photo. In neuroscience, we often try to understand systems by analyzing their representations — using tools like regression or RSA. But are these analyses biased towards discovering a subset of what a system represents? If you're interested, check out our new commentary! Thread: https://t.co/2gevMrpH4b

5

360

64

263

34K

khermann_ retweeted

11 months ago

🚀 New Open-Source Release! PyTorchTNN 🚀 A PyTorch package for building biologically-plausible temporal neural networks (TNNs)—unrolling neural network computation layer-by-layer through time, inspired by cortical processing. PyTorchTNN naturally integrates into the Encoder-Attender-Decoder (EAD) architecture (Chung*, Shen* et al., 2025), which flexibly combines diverse neural networks, motivated by the fact that no single model (Transformer, SSM, RNN) dominates all sequence learning tasks. 🧵👇

1

182

40

126

21K

khermann_ retweeted

about 1 year ago

Our first NeuroAgent! 🐟🧠 Excited to share new work led by the talented @rdkeller, showing how autonomous behavior and whole-brain dynamics emerge naturally from intrinsic curiosity grounded in world models and memory. Some highlights: - Developed a novel intrinsic drive (3M-Progress) that better matches the reliable autonomy of animals - First task-optimized model of neural-glial computation - Surprisingly, no linear regression needed: a simple 1-to-1 mapping was enough to pass the NeuroAI Turing Test on whole-brain zebrafish data (~130,000 recorded units), provided you have the right intrinsic drive of course! Check it out! 👇

4

169

33

82

16K

about 1 year ago

@ElizaKosoy Congratulations, Eliza! 🎉

0

1

0

111

khermann_ retweeted

Kelsey Allen @KelseyRAllen

about 1 year ago

How do language models generalize from information they learn in-context vs. via finetuning? We show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. Thread: 1/

AndrewLampinen's tweet photo. How do language models generalize from information they learn in-context vs. via finetuning? We show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning — and ways to improve finetuning. Thread: 1/ https://t.co/1uxOU0b988

8

760

150

679

103K

khermann_ retweeted

about 1 year ago

Humans can tell the difference between a realistic generated video and an unrealistic one – can models? Excited to share TRAJAN: the world’s first point TRAJectory AutoeNcoder for evaluating motion realism in generated and corrupted videos. 🌐 https://t.co/ytEmuAPcYa 🧵

3

63

12

22

18K

about 1 year ago

@DynamicWebPaige For CA: Mount Langley (Southern Sierras) and Mount Tallac (Desolation Wilderness) are both really nice

0

1

0

129

Lukas Muttenthaler @lukas_mut

about 1 year ago

Congratulations, Lukas! 🎉

about 1 year ago

This past Friday I successfully defended my PhD 🎉🙏🏼 What a journey it was! 4.5 years of many ups and many downs. Can’t believe it’s over. I am still processing… Special thanks to my wonderful committee KR Müller, @martin_hebart, @cpilab, and @scychan_brains!

lukas_mut's tweet photo. This past Friday I successfully defended my PhD 🎉🙏🏼 What a journey it was! 4.5 years of many ups and many downs. Can’t believe it’s over. I am still processing…

Special thanks to my wonderful committee KR Müller, @martin_hebart, @cpilab, and @scychan_brains! https://t.co/7kjekrnznM

14

108

3

6

10K

1

4

0

775

khermann_ retweeted

Thomas Fel

@thomas_fel_

about 1 year ago

Train your vision SAE on Monday, then again on Tuesday, and you'll find only about 30% of the learned concepts match. ⚓ We propose Archetypal SAE which anchors concepts in the real data’s convex hull, delivering stable and consistent dictionaries. https://t.co/iaX60GZt0o

thomas_fel_'s tweet photo. Train your vision SAE on Monday, then again on Tuesday, and you'll find only about 30% of the learned concepts match.

⚓ We propose Archetypal SAE which anchors concepts in the real data’s convex hull, delivering stable and consistent dictionaries.

https://t.co/iaX60GZt0o

6

351

79

221

43K

khermann_ retweeted

over 1 year ago

Had a lot of fun speaking with @avileddie about the practical challenges of scaling (especially in Embodied AI), NeuroAI, what to expect in the future, and advice for students getting into the field. Check it out here! https://t.co/HGaC6IwMRs

0

33

9

18

4K

khermann_ retweeted