Chris Hoang @choang333 - Twitter Profile

Pinned Tweet

4 months ago

Our work Midway Network has been accepted to #ICLR2026! Animals learn to recognize objects and how they move just from observing the world. SSL on natural videos emulates “learning by observing” but only tackles recognition or motion, not both! To address this gap, Midway Network is the first SSL architecture to learn both object recognition and motion understanding from natural videos via latent dynamics🧵

choang333's tweet photo. Our work Midway Network has been accepted to #ICLR2026!

Animals learn to recognize objects and how they move just from observing the world. SSL on natural videos emulates “learning by observing” but only tackles recognition or motion, not both!

To address this gap, Midway Network is the first SSL architecture to learn both object recognition and motion understanding from natural videos via latent dynamics🧵

3

424

52

254

48K

choang333 retweeted

Sihyun Yu

@sihyun_yu

about 19 hours ago

Can MLLMs actually track what's happening in a video? Introducing VSTAT 🎯, our new benchmark for visual state tracking. The tasks are simple: count cups, read typed words, count page flips. Humans solve them easily. MLLMs don't. https://t.co/dgqhqeVuSv 🧵 [1/11]

5

162

54

78

37K

Chris Hoang

@choang333

1 day ago

@AlexiGlad 👀

0

3

0

353

Chris Hoang

@choang333

7 days ago

@rronak_ @MichaelElabd @QuantumArjun Congrats on the launch, @rronak_ !

0

1

0

169

Who to follow

yi

@agihippo

I'm a nice and friendly hippoty.

Samuel Sokota

@ssokota

PhD Student at @CarnegieMellon

Kenneth Li

@ke_li_2021

research @thinkymachines

choang333 retweeted

Sungjin Ahn

@SungjinAhn_

14 days ago

🧠We introduce "Generative Recursive Reasoning"! Recursive Reasoning Models like HRM, TRM, and Looped Transformers are deterministic — same input, same reasoning, every time. They collapse the entire space of plausible reasoning paths into a single attractor. Our model GRAM (Generative Recursive reAsoning Models) turns recursion itself into a stochastic latent trajectory. Multiple hypotheses, alternative solution strategies, and inference-time scaling not just by depth, but by width — parallel trajectory sampling. And here's the kicker: the same formulation that gives us conditional reasoning p(y|x) also makes GRAM a general generative model p(x). With only 10M params: • Sudoku-Extreme: 97.0% (TRM 87.4%) • ARC-AGI-1: 52.0% • ARC-AGI-2: 11.1% • N-Queens coverage: 90%+ 📄 Paper: https://t.co/JC7EyXYc9Y 🌐 Project page: https://t.co/LRT1dQiWLZ w/ Junyeob Baek @JunyeobB (KAIST), Mingyu Jo @pyross0000 (KAIST), Minsu Kim @minsuuukim (KAIST & Mila), Mengye Ren @mengyer (NYU), Yoshua Bengio @Yoshua_Bengio (Mila), Sungjin Ahn @SungjinAhn_ (KAIST)

SungjinAhn_'s tweet photo. 🧠We introduce "Generative Recursive Reasoning"!

Recursive Reasoning Models like HRM, TRM, and Looped Transformers are deterministic — same input, same reasoning, every time. They collapse the entire space of plausible reasoning paths into a single attractor.

Our model GRAM (Generative Recursive reAsoning Models) turns recursion itself into a stochastic latent trajectory. Multiple hypotheses, alternative solution strategies, and inference-time scaling not just by depth, but by width — parallel trajectory sampling.

And here's the kicker: the same formulation that gives us conditional reasoning p(y|x) also makes GRAM a general generative model p(x).

With only 10M params:
• Sudoku-Extreme: 97.0% (TRM 87.4%)
• ARC-AGI-1: 52.0%
• ARC-AGI-2: 11.1%
• N-Queens coverage: 90%+

📄 Paper: https://t.co/JC7EyXYc9Y
🌐 Project page: https://t.co/LRT1dQiWLZ

w/
Junyeob Baek @JunyeobB (KAIST),
Mingyu Jo @pyross0000 (KAIST),
Minsu Kim @minsuuukim (KAIST & Mila),
Mengye Ren @mengyer (NYU),
Yoshua Bengio @Yoshua_Bengio (Mila),
Sungjin Ahn @SungjinAhn_ (KAIST)

31

1K

209

1K

181K

choang333 retweeted

Jocelyn Shen

@jocelynjshen

15 days ago

As a rare non-academic note: I collected the recipes I made throughout my PhD and wrote + illustrated a cookbook/memoir! “No time to cook this season” is a Little Forest inspired collection I wrote and illustrated for fun (hesitant to call it a cookbook since it’s mostly handwavey “recipes” but couldn’t find a better description 🤷‍♀️). Hope you can enjoy some peaceful art and stories as a little bedtime read 🥰 - Hardcover + digital copies can be found here: https://t.co/EYNKxNxEOc. - To follow more of my art, you can visit my alias @jiayuestudio (ins: https://t.co/4ItPPQBTqz) - (Let me know if I should print food art stickers too ☺️)

jocelynjshen's tweet photo. As a rare non-academic note: I collected the recipes I made throughout my PhD and wrote + illustrated a cookbook/memoir!

“No time to cook this season” is a Little Forest inspired collection I wrote and illustrated for fun (hesitant to call it a cookbook since it’s mostly handwavey “recipes” but couldn’t find a better description 🤷‍♀️). Hope you can enjoy some peaceful art and stories as a little bedtime read 🥰

- Hardcover + digital copies can be found here: https://t.co/EYNKxNxEOc.
- To follow more of my art, you can visit my alias @jiayuestudio (ins: https://t.co/4ItPPQBTqz)
- (Let me know if I should print food art stickers too ☺️)

6

142

9

52

9K

choang333 retweeted

Mengye Ren

@mengyer

15 days ago

What does it mean to create a new concept rather than retrieve a familiar one? I propose that creativity is what's unfamiliar at first but quickly learnable by an adaptive observer, and show that meta-learning through a frozen Diffusion model produces stylistic & conceptual creations.

mengyer's tweet photo. What does it mean to create a new concept rather than retrieve a familiar one?

I propose that creativity is what's unfamiliar at first but quickly learnable by an adaptive observer, and show that meta-learning through a frozen Diffusion model produces stylistic & conceptual creations.

9

187

41

138

12K

choang333 retweeted

Michael Hu @michahu8

16 days ago

What is the right data mix, and how do we find it as the data keeps changing? This is a core, unsolved problem in continual learning. To tackle it, we built a data mixing algo that works everywhere — pretraining, midtraining, instruction tuning Introducing: On-Policy Mix 🧵1/6

michahu8's tweet photo. What is the right data mix, and how do we find it as the data keeps changing?

This is a core, unsolved problem in continual learning. To tackle it, we built a data mixing algo that works everywhere — pretraining, midtraining, instruction tuning

Introducing: On-Policy Mix

🧵1/6 https://t.co/LCuNkoewVf

6

310

55

319

46K

Chris Hoang

@choang333

21 days ago

@shengranhu @Recursive_SI Congrats Shengran!

1

0

70

Chris Hoang

@choang333

28 days ago

@alexandernwang definitely less! (codex plus is the $20 plan)

1

0

42

Chris Hoang

@choang333

28 days ago

why did I believe that codex plus gives the same usage as claude max...

1

3

0

385

choang333 retweeted

NYU Center for Data Science

@NYUDataScience

28 days ago

AI agents often struggle to plan movements because their internal representations of the physical world can be overly tangled. CDS PhD student Ying Wang (@yingwww_) shows how straightening these pathways improves AI navigation. Accepted to ICML 2026. https://t.co/BOJCXWwWS8 1/

7

71

21

45

44K

choang333 retweeted

Jack Lu

@Jacklu_me

about 1 month ago

Would, but NeurIPS set an adversarial deadline date this year.

1

118

7

6

15K

choang333 retweeted

Alex N. Wang

@alexandernwang

about 1 month ago

What happens to planning and control when world models condition on complex actions? For example, precisely controlling a human agent may require specifying the motion of each joint. In this setting, action dimensionality increases, the model becomes difficult to control, and the cost of planning using search-based methods like CEM explodes. We propose a solution: lift the world model to a higher level of abstraction. We use a lightweight policy to map high-level waypoint actions → low-level joint sequences, so you can control and plan in a concise space. Best of all, this is done without finetuning or losing any world model expressiveness. 1/8

4

184

26

106

31K

choang333 retweeted

fly51fly @fly51fly

about 1 month ago

[CV] Lifting Embodied World Models for Planning and Control A N. Wang, T Darrell, P Izmailov, Y Bai, A Bar [New York University & UC Berkeley] (2026) https://t.co/c15C0mXMa8

fly51fly's tweet photo. [CV] Lifting Embodied World Models for Planning and Control
A N. Wang, T Darrell, P Izmailov, Y Bai, A Bar [New York University & UC Berkeley] (2026)
https://t.co/c15C0mXMa8 https://t.co/xS1Uoru1Fk

0

30

8

40

4K

choang333 retweeted

Jack Lu

@Jacklu_me

about 1 month ago

Context Tuning accepted to ICML 2026 🎉 See you in Seoul. https://t.co/kdwUhxLva3 It’s a neat LLM adaptation method with minimal implementation overhead and great scaling behavior. Hoping to add it in the PEFT library, and will do a follow-up post with lots of new results. Also excited to share my new LLM reasoning/adaptation work 🔜

9

87

13

36

11K

choang333 retweeted

Mengye Ren

@mengyer

about 1 month ago

Talk slides are here! https://t.co/dqqRfgBePP

2

64

10

45

8K

choang333 retweeted

Mengye Ren

@mengyer

about 1 month ago

Giving a talk at ICLR MemAgents Workshop tomorrow, 11:25am local, Room 205: "Does Your LLM Agent Have a Self?" Spoiler: probably not — but the reason is more nuanced than "they're just LLMs." Hope to see you there!

mengyer's tweet photo. Giving a talk at ICLR MemAgents Workshop tomorrow, 11:25am local, Room 205: "Does Your LLM Agent Have a Self?" Spoiler: probably not — but the reason is more nuanced than "they're just LLMs." Hope to see you there!

6

147

26

73

17K

choang333 retweeted

Anthony GX-Chen ✈️ ICLR 2026

@AntChen_

about 1 month ago

Happening *TODAY* at #ICLR2026 Drop by if you want to discuss why diversity collapse happens in post training, and/or how to better RL your LLM :) 📍3:15–5:30pm · Pavilion 4 • Poster # 4717

0

50

4

26

7K

Chris Hoang

@choang333

about 1 month ago

@agentic_ai_lab LaMo: https://t.co/KUN2hI7qHc

0

1

0

53

Chris Hoang

@choang333

about 1 month ago

Come see our work on LaMo at #ICLR2026, a world model that predicts compact latent motion tokens to recurrently advance a visual scene's latent state over time, enabling long-horizon prediction! Presenting at the World Models Workshop on Monday in Room 202 A/B

choang333's tweet photo. Come see our work on LaMo at #ICLR2026, a world model that predicts compact latent motion tokens to recurrently advance a visual scene's latent state over time, enabling long-horizon prediction!

Presenting at the World Models Workshop on Monday in Room 202 A/B https://t.co/U3kaJ7XsFr

azwar abdulsalam @Azlock1729

about 1 month ago

1/5 Excited to share LaMo: A Latent Motion World Model for Long-Horizon Prediction, to be presented at the ICLR 2026 Workshop on World Models. LaMo predicts compact latent motion rather than the next dense latent state.

2

8

0

5

5K

0

60

11

37

5K

Chris Hoang

@choang333

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users