Michael Kleinman @MichaelKleinman - Twitter Profile

Pinned Tweet

almost 2 years ago

Excited to share our @iclr_2024 spotlight paper. Our work shows that critical learning periods exist in a minimal analytically tractable model of artificial deep networks (deep linear networks) trained with SGD. Paper: https://t.co/Qb6TyfiWEZ Work w/ A. Achille & S. Soatto

1

9

4

0

1K

MichaelKleinman retweeted

Aditya Chattopadhyay

@achatto1994

about 1 month ago

#CVPR2026 is around the corner and we're excited to share Gated KalmanNet: A Fading Memory Layer through Test-Time Ridge Regression. Looking forward to meeting everyone who wants to learn more. Gated KalmaNet (GKA, pronounced "gee-ka") generalizes Mamba-2 and Gated DeltaNet, and outperforms both under identical training conditions. It also works beyond language: swapping the Mamba layer in MambaVision for GKA improves ImageNet accuracy with no vision-specific tuning. 1/4

1

8

5

0

913

MichaelKleinman retweeted

Prannay Kaul

@PrannayKaul

about 1 month ago

Introducing Priming Hybrid models are faster and cheaper than Transformers to scale. But developing alternative architectures from scratch requires expensive pre-training runs. Priming solves this by leveraging pre-trained Transformer weights to train equally performant Hybrid models with 2× faster throughput. Builders can now iterate on Hybrid architectures for under 150B tokens, 100× cheaper than pre-training. 1/12

1

15

7

5

1K

MichaelKleinman retweeted

Daniel Kunin @KuninDaniel

almost 2 years ago

🌟Announcing NeurIPS spotlight paper on the transition from lazy to rich🔦 We reveal through exact gradient flow dynamics how unbalanced initializations promote rapid feature learning co-led @AllanRaventos and @ClementineDomi6 @FCHEN_AI @klindt_david @SaxeLab @SuryaGanguli

5

236

40

137

65K

Who to follow

We reinvent how people discover, play, and enjoy chess ♟️ Join our 1m+ community ⬇️

alireza

@probablisticboy

thinking about neural computations

MichaelKleinman retweeted

Jakub Smékal @jakub_smekal

almost 2 years ago

Excited to share the first paper of my PhD: Towards a theory of learning dynamics in deep state space models https://t.co/OMX0yTDlJw with @jimmysmith1919, @MichaelKleinman, @dan_biderman, and @scott_linderman. Accepted as a Spotlight talk at the NGSM workshop at ICML 2024!

5

155

21

75

14K

MichaelKleinman retweeted

David Sussillo

@SussilloDavid

almost 2 years ago

1/5 Excited to finally share our new paper (led by @lndriscoll, now a group leader at the Allen!) in @NatureNeuro on modular computation in neural networks! We've explored how artificial recurrent networks handle multiple tasks, offering insights into flexible computation. #tweeprint https://t.co/Yur7HxwM4U

6

295

84

101

37K

MichaelKleinman retweeted

Nima Hadidi @nrhadidi

almost 2 years ago

What do LLMs map to in the brain? In some datasets, not much. We emphasize the need for simple controls when analyzing the neural predictivity of trained and untrained LLMs. https://t.co/R8aHBwlmfC In collaboration w/ @ebrahim_feghhi, supervised by @IbanDlank and @JonathanCKao

nrhadidi's tweet photo. What do LLMs map to in the brain? In some datasets, not much. We emphasize the need for simple controls when analyzing the neural predictivity of trained and untrained LLMs.

https://t.co/R8aHBwlmfC

In collaboration w/ @ebrahim_feghhi, supervised by @IbanDlank and @JonathanCKao https://t.co/aZrZTVlvkR

3

18

4

7

2K

Michael Kleinman @MichaelKleinman

almost 2 years ago

From a neuroscience perspective our analysis provides an alternative explanation of critical periods that does not hinge on biochemical changes in plasticity, but is rather fundamental to a dynamical learning process. Paper: https://t.co/Qb6TyfiWEZ Code: https://t.co/y5gDD9kasN

0

2

0

94

Michael Kleinman @MichaelKleinman

almost 2 years ago

Excited to share our @iclr_2024 spotlight paper. Our work shows that critical learning periods exist in a minimal analytically tractable model of artificial deep networks (deep linear networks) trained with SGD. Paper: https://t.co/Qb6TyfiWEZ Work w/ A. Achille & S. Soatto

1

9

4

0

1K

Michael Kleinman @MichaelKleinman

almost 2 years ago

Overall, our analysis shows that critical periods in deep networks depend primarily on two main factors: the depth of the model and the structure of the data distribution.

1

2

0

110

Michael Kleinman @MichaelKleinman

about 5 years ago

I'm going to be presenting our work on defining a notion of "usable information" and using it to study how optimal representations emerge during NN training! Today at 5pm PDT at ICLR 2021. https://t.co/rVsmVv4MLR. W/ Alessandro Achille, Daksh Idnani, @JonathanCKao

0

5

2

0

MichaelKleinman retweeted

Jonathan Kao @JonathanCKao

over 6 years ago

Check out our new preprint on using multi-area recurrent neural networks to better understand decision-making. This is joint work with first author @MichaelKleinman and co-senior author @ChandMuse. https://t.co/OuN3u6dvcG

2

34

15

6

0