Jamie Simon @learning_mech - Twitter Profile

11 days ago

I thoroughly enjoyed reading this recent paper by @yasamanbb et al (https://t.co/nU3X6KW3pT) that derives analytically why certain latent variables must lead to geometry in word embeddings. (getting Fourier modes even with open boundary but exponential kernel is neat!) I think it would be great to compare this to some of @prfsanjeevarora et al's work on this (eg https://t.co/UpK9DTEC03) More broadly, I have been thinking about the right data generating process for language. For vision, we have latent spaces with great manifold structure (eg the SO3 pose of an object) and nonlinear mixing functions. But for language? Are there really any continuous latent variables? What is the "DSprites" of language? Is it all just co-occurrence stats or is there something more in LLM word embeddings?

klindt_david's tweet photo. I thoroughly enjoyed reading this recent paper by @yasamanbb et al (https://t.co/nU3X6KW3pT) that derives analytically why certain latent variables must lead to geometry in word embeddings. (getting Fourier modes even with open boundary but exponential kernel is neat!) I think it would be great to compare this to some of @prfsanjeevarora et al's work on this (eg https://t.co/UpK9DTEC03)

More broadly, I have been thinking about the right data generating process for language. For vision, we have latent spaces with great manifold structure (eg the SO3 pose of an object) and nonlinear mixing functions. But for language? Are there really any continuous latent variables? What is the "DSprites" of language? Is it all just co-occurrence stats or is there something more in LLM word embeddings?

5

498

58

442

28K

Jamie Simon @learning_mech

20 days ago

@KrzakalaF ah, lovely! I quite like this sort of "posit a simpler dynamics and study it" sort of approach. very physicsy! I like Misha Belkin's NFA/RFM stuff (which this reminds me of) for the same reason. possible path towards an answer to our Open Dir 1?? https://t.co/ELfes2pYFG

0

5

1

4

537

Jamie Simon @learning_mech

23 days ago

@electro_vansh ah, thx! unf don't think I'll have the time, but honored :)

0

11

Jamie Simon @learning_mech

about 1 month ago

did you know that with a few modifications, you can get the Ising model to simulate cells fighting to the death? one of my favorite side projects of all time: https://t.co/EkuplsbV8o

11

603

80

327

63K

Jamie Simon @learning_mech

23 days ago

@alexdong @imbue_ai @KuninDaniel https://t.co/anJEkgz1tP :)

0

1

0

18

learning_mech retweeted

Good Work

@goodworkmb

27 days ago

Leaked Sam Altman messages (2023)

10

455

19

34

70K

learning_mech retweeted

Imbue

@imbue_ai

29 days ago

Mechanistic interpretability aspires to be the biology of deep learning. @KuninDaniel and @learning_mech say that an emerging theory of deep learning they and their team call 🛠️ learning mechanics 🛠️ will be the physics.

2

22

3

9

2K

learning_mech retweeted

varun

@varunneal

about 1 month ago

crazy how you can pinpoint the exact curvature of a trillion dimensional model based on how wiggly the loss curve is

2

157

10

91

14K

Jamie Simon @learning_mech

about 1 month ago

@MattHausmannAtx (yes, look up the Cellular Potts model)

0

42

Jamie Simon @learning_mech

about 1 month ago

@MattHausmannAtx psh who would want to

1

0

241

Jamie Simon @learning_mech

about 1 month ago

@reson8Labs oh they can hug

1

0

350

Jamie Simon @learning_mech

about 1 month ago

@guzmansalv me too. my favorite method is having an immune system!

0

2

0

479

Jamie Simon @learning_mech

about 1 month ago

for the bright-eyed and bushy-tailed: there's a Learning Mechanics discord! young academics who want to do research in this area should especially consider joining + starting convos. https://t.co/kAuVlMQzrd

0

31

2

29

2K

learning_mech retweeted

Daniel Kunin @KuninDaniel

about 1 month ago

Excited to share that our paper “Sequential Group Composition: A Window into the Mechanics of Deep Learning” was accepted to ICML 2026 in Seoul! Co-led with @giovannimarchet and @AdeleMyersPhD @hopfbifurcator @ninamiolane Paper: https://t.co/8HsLrKWtlf

8

237

47

182

72K

learning_mech retweeted

Good Work

@goodworkmb

about 1 month ago

Palantir office speedrun

203

35K

2K

4K

2M

Jamie Simon @learning_mech

about 1 month ago

ditto. props to @justanotherlaw for taking a second look :) (though I also found merit in the criticisms in the first version.) hopeful we can eventually (hopefully soon enough...) make contact w/ AI alignment + governance, whose noble causes we would v much like to aid.

Alex Atanasov @ABAtanasov

about 1 month ago

This is a great post and I especially respect the author for updating his view when presented with new information. I strongly encourage young researchers interested in interpretability, science of DL, and safety to look at it. https://t.co/dbJ5K49GTH

3

160

10

171

12K

0

10

0

1

1K

learning_mech retweeted

Alejandro Saucedo | KubeCon 2025 AI Day Keynote @AxSaucedo

about 1 month ago

It seems we're at a stage where deep learning is evolving from alchemy into an engineering discipline; this is an exciting paper which lays out that a scientific theory is emerging for Deep Learning. Paper: https://t.co/hf8QRIgw3P Tweet: https://t.co/9v6cgCyEeX

AxSaucedo's tweet photo. It seems we're at a stage where deep learning is evolving from alchemy into an engineering discipline; this is an exciting paper which lays out that a scientific theory is emerging for Deep Learning.

Paper: https://t.co/hf8QRIgw3P
Tweet: https://t.co/9v6cgCyEeX https://t.co/eNRK3ezCx8

1

25

3

14

2K

Jamie Simon @learning_mech

about 1 month ago

yeah, totally! I once messaged everyone on facebook with my first and last name. I eventually made a big group chat! v ethnically + geographically diverse. probs the closest I've gotten as an adult to meeting a truly random slice of the US.

Devon ☀️

@devonzuegel

about 1 month ago

Jury selection is cool. It's probably the closest you ever get to seeing a true random sample of the population

2

47

1

2

5K

0

7

0

816

learning_mech retweeted

Tim Duignan @TimothyDuignan

about 1 month ago

Aren’t diffusion models explicitly derived from a correspondence with physics and entirely consistent with how physics says you should model systems over a range of scales ( ie mori zwanzig theory: langevin dynamics with a fitted vector field ? ) what more do you want?

6

94

4

55

18K

Jamie Simon @learning_mech

about 1 month ago

@evergreencqfu nice! replied :)

1

0

62

Jamie Simon @learning_mech

about 1 month ago

thanks for the flag. comments on all pages of https://t.co/WOx6YmKJGt should now be working! don't be shy :)

Changqing Fu @evergreencqfu

about 1 month ago

@learning_mech thanks Jamie! the open Q page’s comment system seems not working :P I tried but the comment will not appear on the page

0

1

0

992

1

8

0

1

967

Jamie Simon

@learning_mech

Last Seen Users on Sotwe

Trends for you

Most Popular Users