Christian Internò @ChrisInterno - Twitter Profile

Pinned Tweet

4 months ago

1/10) We found an observer effect in world models: Invasive adaptation can corrupt latent physics! 🌍 New Paper: https://t.co/WSpVdmJPhx The phenomenon where the act of measuring a system unavoidably alters its state or properties (Heisenberg, 1927 & Sassoli de Bianchi, 2013).

2

160

26

56

276K

ChrisInterno retweeted

Samuele Marro

@MarroSamuele

2 days ago

"Benchmarking at the Edge of Comprehension" is now an Oral at ICML 2026! GPT-3.5 successfully benchmarks GPT-5.2🥳 https://t.co/85cyAkvu5Z

2

34

14

9

6K

ChrisInterno retweeted

alphaXiv

@askalphaxiv

7 days ago

New paper from Yann LeCun! "When Does LeJEPA Learn a World Model?" This paper proves that under Gaussian latent dynamics, LeJEPA can recover the hidden state behind nonlinear observations up to rotation. The intuition is that linear latent features are the most stable across nearby views, while nonlinear features decay faster, so the objective naturally selects the real world variables. The key caveat is that this guarantee holds under specific assumptions, and Gaussian latents are the unique case that guarantees this.

askalphaxiv's tweet photo. New paper from Yann LeCun!

"When Does LeJEPA Learn a World Model?"

This paper proves that under Gaussian latent dynamics, LeJEPA can recover the hidden state behind nonlinear observations up to rotation.

The intuition is that linear latent features are the most stable across nearby views, while nonlinear features decay faster, so the objective naturally selects the real world variables.

The key caveat is that this guarantee holds under specific assumptions, and Gaussian latents are the unique case that guarantees this.

14

879

145

587

44K

ChrisInterno retweeted

Lucas Maes

@lucasmaes_

7 days ago

Would you like to join the research effort on JEPA and World Models easily? After a full year of hard work, we’re excited to finally release stable-worldmodel: an open-source, scalable platform built to accelerate JEPA & World Model research! 📄: https://t.co/gnxGvens5A

lucasmaes_'s tweet photo. Would you like to join the research effort on JEPA and World Models easily?

After a full year of hard work, we’re excited to finally release stable-worldmodel:

an open-source, scalable platform built to accelerate JEPA & World Model research!

📄: https://t.co/gnxGvens5A

38

2K

270

2K

111K

ChrisInterno retweeted

David Klindt

@klindt_david

8 days ago

What does JEPA actually learn? We can finally prove it 🌍 So excited to share our theory of identifiable World Models: LeJEPA recovers the latent variables of the world. Plan in the learned World Model as if it were real, same shortest path. 📄: https://t.co/lC9KK1AxVd

48

2K

259

2K

209K

Christian Internò

@ChrisInterno

22 days ago

Nice initiative @icmlconf :)

0

7

0

167

Christian Internò

@ChrisInterno

28 days ago

Close the laptop and go back to sleep :) @NeurIPSConf

0

1

0

68

Christian Internò

@ChrisInterno

about 1 month ago

🙌🏼

ICML Conference @icmlconf

about 1 month ago

lowk only the PC chairs know the exact time decisions drop, i don't have more info than the rest of you. i'm stressing with y'all, between ICML decisions, the NeurIPS deadline, and everything else going on best of luck to everyone. stay strong, we'll make it through together🫂

15

207

11

7

39K

0

128

ChrisInterno retweeted

Riccardo Cadei @riccardocadeii

about 1 month ago

🚨 ICLR Oral 🚨 Is modern representation learning a valid measurement tool for complex scientific experiments? "Exploratory Causal Inference in SAEnce" proposes a workflow for turning learned sparse representations into exploratory treatment-effect hypothesis testing — with theory and successful translation to experimental ecology. https://t.co/NlonkuheSQ 🎤 Oral — Session 3F, Fri Apr 24, 11:18am 📌 Poster — Pavillon 3 #105, Fri Apr 24, 3:15–5:45pm See you in Rio 🇧🇷

1

15

7

3

774

ChrisInterno retweeted

Marco Fumero @marco_fumero

about 1 month ago

Excited to present our work at #ICLR2026 🇧🇷 Join us on Saturday to explore how dynamical systems emerge in neural network latent spaces and what the resulting vector field reveal about the model and the data🌀 Oral: Sat, Hall 203 A/B, 11:06 AM Poster: Sat, P3 #602, 3-5:45 PM

1

274

33

215

33K

ChrisInterno retweeted

David Klindt

@klindt_david

2 months ago

So excited to finally share this! Linear probes often outperform SAEs, especially out-of-distribution (OOD). @thesubhashk @JoshAEngels et al showed this convincingly (https://t.co/qZtEeId75Y). This prompted @NeelNanda5 and others to de-emphasize SAE research. Empirically, fair enough. But we think the theoretical case for dictionary learning was dismissed too quickly. @oneill_c previously showed SAEs can't do proper sparse coding (https://t.co/aCCUgL37si). @shruti_joshi @vpacela and @isacama_phys took this further and showed how this leads to problems particularly in OOD settings. So the issue may not be with dictionary learning itself, but with the current tools. Here's the core argument: if neural representations are in superposition, i.e. more features than dimensions encoded linearly (https://t.co/ruCrSPtN4U), then linear probes fundamentally cannot be the answer. This is a compressed sensing problem. There's a linear measurement (the representation) and a nonlinear inference procedure (like an SAE encoder) that recovers the higher-dimensional sparse signal. Linear algebra tells us error-free recovery is impossible if decoding is restricted to be linear. (but see this cool work if errors are acceptable https://t.co/lZDnWgodaE) Check out our video: We have some neat demonstrations here. A linear decision boundary in 3D becomes nonlinear in 2D, even though all sparse combinations of latents remain distinguishable. Compressed sensing works: we can, in principle, recover the high-dimensional latent space where linear probes work and generalize OOD. Where does this leave us? With finite data and millions of concepts, simpler methods may perform better for a while. But if we want interpretability and safety methods that work OOD, especially compositional generalization covering all possible jailbreaks and real-world failures, we'll have to build bottom up from the right theory. @kennylpeng @thebasepoint @tegmark @yash_j_sharma @woog09 @livgorton @EkdeepL @thomas_fel_ @nsaphra

4

265

40

245

29K

ChrisInterno retweeted

Shruti Joshi @_shruti_joshi_

2 months ago

SAEs fail at OOD tasks. Why? Features in superposition are linearly representable but not linearly accessible. Instead of discarding sparse coding, we embrace the geometry of superposition and use methods equipped to handle the nonlinearity it induces.

6

201

28

173

41K

Christian Internò

@ChrisInterno

2 months ago

[9/9] Bonus: exact modular unlearning comes for free! The global model is a discrete union of independent generative modules, one per client per class. To forget a client? Delete their column. To erase a class? Delete its row. No retraining. No approximations. True right-to-be-forgotten compliance.

ChrisInterno's tweet photo. [9/9] Bonus: exact modular unlearning comes for free!

The global model is a discrete union of independent generative modules, one per client per class.

To forget a client? Delete their column. To erase a class? Delete its row. No retraining. No approximations. True right-to-be-forgotten compliance.

0

2

0

1

76

Christian Internò

@ChrisInterno

2 months ago

[1/9]🏭 FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios - No pretrained models. - No raw data sharing. - One communication round. - It works when clients hold a completely different class of data. Preprint and code links in the comments 👇

ChrisInterno's tweet photo. [1/9]🏭 FederatedFactory: Generative One-Shot Learning for Extremely Non-IID Distributed Scenarios

- No pretrained models.
- No raw data sharing.
- One communication round.
- It works when clients hold a completely different class of data.

Preprint and code links in the comments 👇

1

10

0

1

2K

Christian Internò

@ChrisInterno

2 months ago

[8/9] Results: 📊 CIFAR-10 Acc: 11.36% → 90.57% 📊 ISIC2019 AUROC: 47.31% → 90.57% 📊 BloodMNIST Acc: 18.05% → 91.17% 📊 RetinaMNIST Acc: 43.50% → 49.30% 📊 PathMNIST Acc: 18.05% → 67.94% All matching the centralized upper bound, the performance you'd get if you could just pool all the data. +) Communication overhead: -99.4%!

ChrisInterno's tweet photo. [8/9] Results:

📊 CIFAR-10 Acc: 11.36% → 90.57%
📊 ISIC2019 AUROC: 47.31% → 90.57%
📊 BloodMNIST Acc: 18.05% → 91.17%
📊 RetinaMNIST Acc: 43.50% → 49.30%
📊 PathMNIST Acc: 18.05% → 67.94%

All matching the centralized upper bound, the performance you'd get if you could just pool all the data.

+) Communication overhead: -99.4%!

1

2

0

1

95

Christian Internò

@ChrisInterno

Last Seen Users on Sotwe

Trends for you

Most Popular Users