Jiaxing Wu @crabshellman - Twitter Profile

Pinned Tweet

11 days ago

Historic moment! This reminds me that I just really, really want to be a part of building an independent Mech Interp lab in China.

Neo Research @NeoResearchAI

12 days ago

We're Neo Research (新衡). Asia’s first independent frontier AI safety evaluation & research lab. Today we're publishing our first report: an independent safety evaluation of DeepSeek v4 Pro. (1/5)

19

786

88

386

107K

0

20

Jiaxing Wu @crabshellman

11 days ago

Reviewing last winter's notes and realizing I spent an unreasonable amount of time alone with activations. Time flies, and I have zero evidence to prove this research ever happened.

crabshellman's tweet photo. Reviewing last winter's notes and realizing I spent an unreasonable amount of time alone with activations. Time flies, and I have zero evidence to prove this research ever happened. https://t.co/uNpe2cpbig

0

12

Jiaxing Wu @crabshellman

12 days ago

Interpretability is not just for safety.

0

17

Jiaxing Wu @crabshellman

19 days ago

"Interpretability" is a misnomer. We study reaction mechanisms and call it Chemistry. We study motion mechanisms and call it Physics. We study computation in biological neural networks and call it neuroscience.We aren't just doing "Mechanistic Interpretability." This is Aiology.

1

0

40

crabshellman retweeted

Ningyu Xu @xny_ele

8 months ago

Our paper is now out in PNAS!💡 Are LLMs developing human-like concepts that are central to human cognition? If so, how are such concepts represented, organized, and related to behavior? https://t.co/ZnLIevdN4j 1/N

xny_ele's tweet photo. Our paper is now out in PNAS!💡

Are LLMs developing human-like concepts that are central to human cognition? If so, how are such concepts represented, organized, and related to behavior?

https://t.co/ZnLIevdN4j

1/N https://t.co/3tpZzFdhql

1

5

2

1

984

crabshellman retweeted

Chris Olah

@ch402

20 days ago

https://t.co/udIVxLdid5

155

3K

552

1K

267K

crabshellman retweeted

Viktor Moskvoretskii @Vitya_Vitalich

23 days ago

New paper! 🧵 Post-training doesn't build the Assistant, it just turns up the volume on personas that pretraining already laid down, at 0.22% of total tokens! We traced them across OLMo-3 and Apertus here's what we found👇

Vitya_Vitalich's tweet photo. New paper! 🧵

Post-training doesn't build the Assistant, it just turns up the volume on personas that pretraining already laid down, at 0.22% of total tokens!

We traced them across OLMo-3 and Apertus here's what we found👇 https://t.co/PkBWNhZO0K

6

93

13

87

31K

crabshellman retweeted

Core Francisco Park

@corefpark

25 days ago

🚨 New Paper! (Part 1: Pretraining) Many recent works show beautiful representational geometry in neural networks. But what controls the geometry of world representations during pretraining? We decouple the world from data to study this in a controlled setup. 1/n

12

578

81

439

47K

crabshellman retweeted

Rui Jackie Lin @jackielvnut

about 2 months ago

♟️🧐How can a Chess Transformer reach — or even surpass — human grandmaster-level play with only a single forward pass? We study BT4, the strongest and most stable open-source model of Leela Chess Zero. And we adapted Transcoders and Lorsas, showing that sparse replacement layers work on BT4, which can reveal interpretable computational features across MLP and attention modules. This brings us one step closer to sparsifying and interpreting an entire Chess Transformer — and understanding what makes it so strong!👇 #AI #ML #MechInterp #Chess

jackielvnut's tweet photo. ♟️🧐How can a Chess Transformer reach — or even surpass — human grandmaster-level play with only a single forward pass? We study BT4, the strongest and most stable open-source model of Leela Chess Zero. And we adapted Transcoders and Lorsas, showing that sparse replacement layers work on BT4, which can reveal interpretable computational features across MLP and attention modules. This brings us one step closer to sparsifying and interpreting an entire Chess Transformer — and understanding what makes it so strong!👇 #AI #ML #MechInterp #Chess

4

223

29

177

17K

crabshellman retweeted

Jess Riedel

@Jess_Riedel

2 months ago

Stoked to release this first meaty post in a series describing our vision for the Alignment journal. Many thanks to the authors and contributors: @danielmurfet , @dan_mackinlay , @geoffreyirving , @mhutter42 , @Lang__Leon , Gautam Kamath, Konstantinos Voudouris, Edmund Lau, Alexander Gietelink Oldenziel, and Seth Lazar. @AlignmentJrnl

Jess_Riedel's tweet photo. Stoked to release this first meaty post in a series describing our vision for the Alignment journal.

Many thanks to the authors and contributors: @danielmurfet , @dan_mackinlay , @geoffreyirving , @mhutter42 , @Lang__Leon , Gautam Kamath, Konstantinos Voudouris, Edmund Lau, Alexander Gietelink Oldenziel, and Seth Lazar. @AlignmentJrnl

1

44

12

18

7K

crabshellman retweeted

NASA

@NASA

2 months ago

Liftoff. The Artemis II mission launched from @NASAKennedy at 6:35pm ET (2235 UTC), propelling four astronauts on a journey around the Moon. Artemis II will pave the way for future Moon landings, as well as the next giant leap — astronauts on Mars.

4K

178K

55K

11K

14M

crabshellman retweeted

Fazl Barez @FazlBarez

3 months ago

If this policy is not revoked, I won’t be reviewing/ACing for #NeurIPS Science requires open exchange of ideas! When participation gets shaped by geopolitics, it ends up reflecting power structures, not merit--narrows what science can be and powerful nations get full control!

5

248

19

12

25K

crabshellman retweeted

Mark Rofin @broccolitwit

3 months ago

8/ 🔵 Pre-caching: the representation at position i also gets gradients from predicting tokens at positions j > i+1, as future attention heads can attend back to position i and read from it. So the model is incentivized to "prepare" useful info for the future!

1

10

2

1

537

crabshellman retweeted

Palli Thordarson @PalliThordarson

3 months ago

In the human health space, Rosie's story demonstrates that we can "democratise" the process of designing cancer vaccine. While genomic analysis & RNA production will continue to be specialised they could turn into pure service provision, especially as automation increases. /5

2

114

6

6K

Jiaxing Wu @crabshellman

3 months ago

@julianharris @iannuttall @RayFernando1337 Cool, thanks！

0

18

crabshellman retweeted

Tianxiang Sun @tianxiangsun

4 months ago

FARS is automatically doing AI research: https://t.co/t8fBcqQfWh

0

4

1

0

223

Jiaxing Wu @crabshellman

4 months ago

We can identify a 9D helix beyond our imagination that happens to manifest such elegant properties when projected into a lower-dimensional subspace we live.

crabshellman's tweet photo. We can identify a 9D helix beyond our imagination that happens to manifest such elegant properties when projected into a lower-dimensional subspace we live. https://t.co/wPzvtyiaI2

Subhash Kantamneni

@thesubhashk

over 1 year ago

(1/N) LLMs represent numbers on a helix? And use trigonometry to do addition? Answers below 🧵

21

939

155

765

218K

0

1

0

1

113

Jiaxing Wu @crabshellman

4 months ago

@jamesaoldfield @philiptorr @ioannispatras @Adel_Bibi @FazlBarez Congrats!

1

2

0

70

Jiaxing Wu @crabshellman

4 months ago

@Ziyue54058032 I would say this is a masterpiece.

0

10

Jiaxing Wu @crabshellman

4 months ago

@karpathy People always wanted to simulate society like Stanford AI Town. But an agent internet like Moltbook might be the most accessible approach: you don't need to model a complex physical world. The social media substrate is structural, text-based, and we already know how to build it.

1

0

636

Jiaxing Wu

@crabshellman

Last Seen Users on Sotwe

Trends for you

Most Popular Users