Brian Cheung @thisismyhat - Twitter Profile

Pinned Tweet

about 2 years ago

The Platonic Representation Hypothesis https://t.co/eoz1GTBEiU Surprising (?) results: - Pure vision models align with pure text models as scale increases. - This alignment correlates with better downstream performance. Fun work with @minyoung_huh @TongzhouWang @phillip_isola

thisismyhat's tweet photo. The Platonic Representation Hypothesis
https://t.co/eoz1GTBEiU

Surprising (?) results:

- Pure vision models align with pure text models as scale increases.
- This alignment correlates with better downstream performance.

Fun work with @minyoung_huh @TongzhouWang @phillip_isola https://t.co/5wisQBQdpX

8

149

16

62

19K

thisismyhat retweeted

Sophie Wang @SophieLWang

24 days ago

"The Truth Lies Somewhere in the Middle (of the Generated Tokens)" In autoregressive language models, mean pooling hidden states across generation yields better representations than any token alone. project page: https://t.co/kXddYUir4k w/ @phillip_isola and @thisismyhat

9

466

68

384

50K

thisismyhat retweeted

Badr AlKhamissi @bkhmsi

5 months ago

🎉 Re-Align is back for its 4th edition at ICLR 2026! 📣 We invite submissions on representational alignment, spanning ML, Neuroscience, CogSci, and related fields. 📝 Tracks: Short (≤5p), Long (≤10p), Challenge (blog) ⏰ Feb 5, 2026 for papers 🔗 https://t.co/BEtKUM9oQP

bkhmsi's tweet photo. 🎉 Re-Align is back for its 4th edition at ICLR 2026!

📣 We invite submissions on representational alignment, spanning ML, Neuroscience, CogSci, and related fields.

📝 Tracks: Short (≤5p), Long (≤10p), Challenge (blog)

⏰ Feb 5, 2026 for papers

🔗 https://t.co/BEtKUM9oQP https://t.co/U6yl1VU4Fr

1

59

20

24

28K

Brian Cheung @thisismyhat

5 months ago

There are self-play environments everywhere for those with the eyes to see.

Yuxiang Wei

@YuxiangWei9

5 months ago

Software agents can self-improve via self-play RL Introducing Self-play SWE-RL (SSR): training a single LLM agent to self-play between bug-injection and bug-repair, grounded in real-world repositories, no human-labeled issues or tests. 🧵

YuxiangWei9's tweet photo. Software agents can self-improve via self-play RL

Introducing Self-play SWE-RL (SSR): training a single LLM agent to self-play between bug-injection and bug-repair, grounded in real-world repositories, no human-labeled issues or tests. 🧵

64

2K

288

1K

525K

1

10

0

3

2K

Who to follow

/MachineLearning

@slashML

Jascha Sohl-Dickstein

@jaschasd

Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.

Chelsea Finn

@chelseabfinn

Asst Prof of CS & EE @Stanford Co-founder of Physical Intelligence @physical_int PhD from @Berkeley_EECS, EECS BS from @MIT

thisismyhat retweeted

Kush Tiwary @ktiwarylab

6 months ago

👁️🌋 Our new @ScienceAdvances paper: We replayed the Cambrian explosion of vision by evolving AI agents inside a physics engine to understand the principles that shape visual intelligence. We believe that this is a promising way to do AI for science and build new forms of AI by computationally mimicking biological design principles of evolution and learning. Website: https://t.co/xd1Hkt8bvs and paper links at the end of this thread 👇

ktiwarylab's tweet photo. 👁️🌋 Our new @ScienceAdvances paper: We replayed the Cambrian explosion of vision by evolving AI agents inside a physics engine to understand the principles that shape visual intelligence.

We believe that this is a promising way to do AI for science and build new forms of AI by computationally mimicking biological design principles of evolution and learning.

Website: https://t.co/xd1Hkt8bvs and paper links at the end of this thread 👇

2

16

6

1

1K

Brian Cheung @thisismyhat

6 months ago

@phillip_isola An old model's trash is a future model's treasure!

0

4

0

356

Brian Cheung @thisismyhat

6 months ago

There's no such thing as bad data, only bad learning.

Physical Intelligence

@physical_int

6 months ago

We discovered an emergent property of VLAs like π0/π0.5/π0.6: as we scale up pre-training, the model learns to align human videos and robot data! This gives us a simple way to leverage human videos. Once π0.5 knows how to control robots, it can naturally learn from human video.

81

3K

343

1K

1M

6

46

2

16

8K

Brian Cheung @thisismyhat

6 months ago

@KyleStachowicz @chris_j_paxton Then you've learned to not buy from then again. 🙂

0

56

Brian Cheung @thisismyhat

6 months ago

With a sufficient model of the world, any data can be useful data

Physical Intelligence

@physical_int

6 months ago

This also shows up in the representations learned by the model. We plot the model’s representations of human and robot images. As pre-training is scaled up, the representation of humans and robots become more aligned: to a scaled-up model, human videos "look" like robot demos.

17

345

25

108

121K

4

36

1

6

4K

Brian Cheung @thisismyhat

6 months ago

@chris_j_paxton Data is just information. There's always something to learn from data, maybe not what you need right now. But can become important later.

1

3

0

308

Brian Cheung @thisismyhat

6 months ago

@JieWang_ZJUI The human data was originally out of domain and unusable becomes useful after a certain scale of model capability https://t.co/wIa62hyNG5

Physical Intelligence

@physical_int

6 months ago

This also shows up in the representations learned by the model. We plot the model’s representations of human and robot images. As pre-training is scaled up, the representation of humans and robots become more aligned: to a scaled-up model, human videos "look" like robot demos.

17

345

25

108

121K

0

3

0

127

thisismyhat retweeted

Phillip Isola @phillip_isola

6 months ago

Impromptu NeurIPS meetup: "representational convergence by the beach." We will meet at ballroom 20c (near lunch) 2pm Fri and walk over to Marina. Will chat about platonic reps, fractured reps, or anything else about where all these models are heading. Anyone is welcome to join!

6

224

19

84

23K

Brian Cheung @thisismyhat

6 months ago

In all this craziness, let's take a moment and enjoy the three accounts @ilyasut follows.

0

6

0

679

thisismyhat retweeted

Phillip Isola @phillip_isola

7 months ago

This paper is really interesting to me -- it shows substantially stronger representational convergence than previously measured! In the PRH we found ~0.2 mknn alignment between vision and text models. This new paper reaches ~0.4. Challenge: find a setting where it reaches ~1.0.

6

153

14

93

23K

Brian Cheung @thisismyhat

8 months ago

The one time you get quoted about your research philosophy and then get described as an engineer 🤣@VentureBeat

Sakana AI

@SakanaAILabs

8 months ago

Sakana AI’s CTO says he’s ‘absolutely sick’ of transformers, the tech that powers every major AI model “You should only do the research that wouldn’t happen if you weren’t doing it.” (@thisismyhat) 🧠 @YesThisIsLion https://t.co/cGdHONcqDV

21

413

55

132

181K

1

32

0

3

8K

thisismyhat retweeted

Aritra

@ariG23498

8 months ago

Read and reproduced this paper in a free tier colab notebook. 🔥

1

28

3

11

4K

thisismyhat retweeted

Phillip Isola @phillip_isola

8 months ago

Over the past year, my lab has been working on fleshing out theory/applications of the Platonic Representation Hypothesis. Today I want to share two new works on this topic: Eliciting higher alignment: https://t.co/KY4fjNeCBd Unpaired rep learning: https://t.co/vJTMoyJj5J 1/9

10

694

120

490

68K

Brian Cheung @thisismyhat

8 months ago

@jasonfurman Prediction markets don't just predict, they can influence outcomes (e.g. 'the fix is in', reflexivity, etc)

0

19

0

3

31K

Brian Cheung @thisismyhat

8 months ago

So going back to the original takeaway, much like what @sama, @ericjang11, and @noampomsky have said in the past: You can just ask for things. Something interesting might happen. Paper: https://t.co/4bgbJagRH3 Code: https://t.co/VylkC7oOg8 Website: https://t.co/GG6F8fLIIm 3/3

0

4

0

1

397

Brian Cheung @thisismyhat

8 months ago

A takeaway I learned from LLMs: You can just ask for things. What if you asked a language model to imagine senses it never experienced? @SophieLWang , @phillip_isola and I asked language models to "Imagine seeing..." and "Imagine hearing...". 1/3

1

13

0

4

1K

Brian Cheung @thisismyhat

8 months ago

It turns out, a simple cue like asking the model to ‘see’ or ‘hear’ can push a purely text-trained language model towards the representations of purely image-trained or purely-audio trained encoders. 2/3

thisismyhat's tweet photo. It turns out, a simple cue like asking the model to ‘see’ or ‘hear’ can push a purely text-trained language model
towards the representations of purely image-trained or purely-audio trained encoders.

2/3 https://t.co/D3zxShJegt

1

3

0

425

Brian Cheung

@thisismyhat

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users