Andrei Polubarov @earo050 - Twitter Profile

Pinned Tweet

2 months ago

Is it possible to build a multi-domain action model capable of adapting to unseen dynamics? Check out our new #ICLR2026 paper! We pushed in-context RL scaling further and released Vintix II. 👇👇👇

1

16

6

1

1K

Andrei Polubarov @earo050

8 days ago

@DJiafei I guess VLA need time to become useful. The way they are trained now is inefficient.

0

118

Andrei Polubarov @earo050

about 2 months ago

Today at #ICLR2026, we are presenting Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner. 📍 Poster: Pavilion 4, #4516 Happy to chat about in-context RL, robotics, and foundation models for decision making

earo050's tweet photo. Today at #ICLR2026, we are presenting Vintix II: Decision Pre-Trained Transformer is a Scalable In-Context Reinforcement Learner.

📍 Poster: Pavilion 4, #4516

Happy to chat about in-context RL, robotics, and foundation models for decision making https://t.co/huIuLMMQTs

0

4

0

68

Andrei Polubarov @earo050

2 months ago

We are grateful to @TianheYu, @TokicMichel, @ShengjieWa34067, @mitrma, @Etonwanaa, @teopir, @pengzh97, @zhoubolei and others for building the environments used in this work!

0

2

0

75

Andrei Polubarov @earo050

2 months ago

Is it possible to build a multi-domain action model capable of adapting to unseen dynamics? Check out our new #ICLR2026 paper! We pushed in-context RL scaling further and released Vintix II. 👇👇👇

1

16

6

1

1K

Andrei Polubarov @earo050

2 months ago

For interesting details and ablations, feel free to read our paper and check out our code! Project site: https://t.co/cxzY5tPodw Paper: : https://t.co/7Z6uPixOc1 Code: https://t.co/p9FS6aCKNa Dataset: https://t.co/kxoeNlqBs3

1

3

0

91

earo050 retweeted

Vladislav Kurenkov

@vladkurenkov

6 months ago

We released 87 hours of @LeRobotHF SO 100/101 datasets. It is a unified, cleaned, and annotated repackage of 598 open-source community datasets (SO100 and SO101), totaling 22,709 episodes, ~9.4M frames, and 563 tasks.

2

35

6

8

2K

earo050 retweeted

Vladislav Kurenkov

@vladkurenkov

about 1 year ago

🚀 Introducing cadrille: a new SOTA model for CAD reconstruction from images, point clouds, and text—all in one framework with the use of RLVR. Multimodal inputs + RLVR = clean, editable 3D models. 🧵👇

vladkurenkov's tweet photo. 🚀 Introducing cadrille: a new SOTA model for CAD reconstruction from images, point clouds, and text—all in one framework with the use of RLVR.

Multimodal inputs + RLVR = clean, editable 3D models.

🧵👇

1

20

8

3K

earo050 retweeted

Denis Tarasov @ML_is_overhyped

about 1 year ago

LLMs are amazing because they can learn in context — read, adapt, and act. Can we do the same for reinforcement learning? That’s the promise of In-Context RL (ICRL). But existing offline ICRL methods don’t even optimize rewards. Our new paper shows why RL matters 🧵

ML_is_overhyped's tweet photo. LLMs are amazing because they can learn in context — read, adapt, and act.

Can we do the same for reinforcement learning? That’s the promise of In-Context RL (ICRL).

But existing offline ICRL methods don’t even optimize rewards.

Our new paper shows why RL matters
🧵 https://t.co/0UmdjM1pS9

1

26

7

19

6K

earo050 retweeted

Alexander Nikulin @how_uhh

about 1 year ago

🎥 Pre-training VLAs on human videos is tempting — Latent Action Models quickly become an essential part of leading VLAs, like GR00T (@DrJimFan) — but can they effectively handle messy real‐world videos? In our #ICML paper we give an answer: not yet, at least without some help!

how_uhh's tweet photo. 🎥 Pre-training VLAs on human videos is tempting — Latent Action Models quickly become an essential part of leading VLAs, like GR00T (@DrJimFan) — but can they effectively handle messy real‐world videos?

In our #ICML paper we give an answer: not yet, at least without some help! https://t.co/8dFDr5aqC8

1

192

27

101

14K

earo050 retweeted

Ilya Zisman @suessmannn

about 1 year ago

🔥 Zero-shot generalization is the dream: adapt instantly, no fine-tuning. It's why LLMs blew up—but it's not just a language modeling thing. It’s happening in RL too. 🚨 @maxsbob21's new paper dives deep into zero-shot RL under shifting dynamics—and why current methods break.

suessmannn's tweet photo. 🔥 Zero-shot generalization is the dream: adapt instantly, no fine-tuning. It's why LLMs blew up—but it's not just a language modeling thing. It’s happening in RL too.

🚨 @maxsbob21's new paper dives deep into zero-shot RL under shifting dynamics—and why current methods break. https://t.co/hdWGz3lZ60

4

144

20

103

15K

earo050 retweeted

Alexander Nikulin @how_uhh

about 1 year ago

Two papers accepted by #ICML2025!

2

70

3

7

6K

earo050 retweeted

Vladislav Kurenkov

@vladkurenkov

over 1 year ago

Can In-Context RL scale across multiple domains? Our preliminary results suggest it can. Vintix: Action Model via In-Context Reinforcement Learning -- https://t.co/NMVu2b08TJ

vladkurenkov's tweet photo. Can In-Context RL scale across multiple domains? Our preliminary results suggest it can.

Vintix: Action Model via In-Context Reinforcement Learning -- https://t.co/NMVu2b08TJ https://t.co/e1Vv7lPvFy

4

55

13

33

59K

Andrei Polubarov

@earo050

Last Seen Users on Sotwe

Trends for you

Most Popular Users