Ayush Tewari @_atewari - Twitter Profile

Pinned Tweet

4 months ago

Is pixel prediction the best way to build a world model? Check out VDAWorld, an alternative path to building interpretable, editable, and physically grounded world models. We use a VLM to build a simulation of the scene with the help of a computer vision toolbox.

7

164

15

96

9K

Ayush Tewari @_atewari

1 day ago

@TianweiY @reve @arena So cool! Congrats!

1

3

0

399

_atewari retweeted

Andrei Bursuc @CVPR @abursuc

1 day ago

Bill Freeman gives us first a list of warm-up bitter lessons. He keeps the bigger ones for later in the talk. #cvpr2026

5

662

78

372

67K

_atewari retweeted

Tobias Fischer @TobiasFischer11

5 days ago

Do 3D reconstruction transformers really need a billion parameters, or are most of those layers just doing the same thing over and over? Introducing Déjà View: a single transformer block, looped K times, that matches or beats models 8–10× its size with lower compute. 🧵

14

694

99

511

90K

Who to follow

International Conference on 3D Vision (3DV)

@3DVconf

Official account for International Conference on 3D Vision (3DV) #3DV2027 Dates: April 6-9, 2027 Place: Thessaloniki, Greece 🇬🇷

Angela Dai

@angelaqdai

Associate Professor @ Technical University of Munich

Gordon Wetzstein

@GordonWetzstein

Professor at Stanford University & Co-founder at Rhoda AI

Ayush Tewari @_atewari

3 days ago

Check out 3D-Belief: a 3D world model that explores how explicit belief representations can support memory, imagination, and planning under partial observability.

Yifan Yin

@yifanyin_11

3 days ago

What should a world model capture for embodied agents? An agent acting under partial observability needs a belief over the 3D world: what it has seen, what may exist beyond view, and how that belief should update as it moves. Introducing 3D-Belief, a 3D world model for embodied belief inference under partial observability. 🧵[1/9]

3

52

14

38

9K

0

12

0

2

765

_atewari retweeted

Jianwen Xie @ CVPR 2026

@jianwen_xie

3 days ago

Nice work! World models meet active sensing and closed-loop planning. We'll be showcasing this demo at the Lambda booth @chuanli11 during CVPR main conference at Denver. If you're attending, stop by and check it out! #cvpr @CVPR We are also hosting a world-model-related workshop at CVPR : https://t.co/k7uKAyzybO

0

11

2

1

2K

_atewari retweeted

SIGGRAPH Asia ➡️ Kuala Lumpur, Malaysia @SIGGRAPHAsia

13 days ago

Submit to #SIGGRAPHAsia2026 Workshops! Bring researchers, students, designers, artists, developers and creators together for lively exchange & discussion. 📅 Challenge-Included: 15 June 2026 📅 Discussion-Focused: 31 July 2026 Submit:https://t.co/IgJGad7J0O #WeavingTheFuture

SIGGRAPHAsia's tweet photo. Submit to #SIGGRAPHAsia2026 Workshops!

Bring researchers, students, designers, artists, developers and creators together for lively exchange & discussion.

📅 Challenge-Included: 15 June 2026
📅 Discussion-Focused: 31 July 2026

Submit:https://t.co/IgJGad7J0O

#WeavingTheFuture https://t.co/XabLfHiemO

0

4

1

0

592

_atewari retweeted

Elliott / Shangzhe Wu @elliottszwu

3 months ago

I’m looking for a PhD student with UK home fee status to join my lab at Cambridge starting this October. If you’re interested and have research experience in multimodal spatial modeling or vision-based robot learning, email me your CV. Funding deadline is approaching very soon.

2

146

33

36

20K

Ayush Tewari @_atewari

3 months ago

@dimadamen @Cambridge_Eng @BristolUni @bristolcs Thanks for inviting me!

1

2

0

178

Ayush Tewari @_atewari

4 months ago

Project page: https://t.co/YCahhjxVjM. Work led by @FelixOMahony, in collaboration with @robertocipolla.

1

6

0

1

663

Ayush Tewari @_atewari

4 months ago

Is pixel prediction the best way to build a world model? Check out VDAWorld, an alternative path to building interpretable, editable, and physically grounded world models. We use a VLM to build a simulation of the scene with the help of a computer vision toolbox.

7

164

15

96

9K

Ayush Tewari @_atewari

4 months ago

In addition to physical plausibility, the use of a python simulator makes it easy for users to modify the simulation! Check out the project page for many more examples.

1

9

1

1K

Ayush Tewari @_atewari

4 months ago

Join us!

Elliott / Shangzhe Wu @elliottszwu

4 months ago

New opening for an Assistant/Associate Professor in Robotics at the Cambridge Engineering Department @Cambridge_Eng. Apply by 8 February 2026: https://t.co/e7SkDYGU14

0

23

2

3

3K

0

4

0

3

840

_atewari retweeted

Andrew Davison @AjdDavison

6 months ago

Makes sense. Matching and 3D reconstruction are inherently iterative computations; this nice paper gives hints on how DUSt3R's transformer achieves that. Now we can get on with figuring out how to do it tens or hundreds of times better/faster with a more specific architecture.

4

193

24

141

22K

_atewari retweeted

Kwang Moo Yi @kwangmoo_yi

7 months ago

Stary and Gaubil et al., "Understanding multi-view transformers" We use Dust3r as a black box. This work looks under the hood at what is going on. The internal representations seem to "iteratively" refine towards the final answer. Quite similar to what goes on in point cloud net

kwangmoo_yi's tweet photo. Stary and Gaubil et al., "Understanding multi-view transformers"

We use Dust3r as a black box. This work looks under the hood at what is going on. The internal representations seem to "iteratively" refine towards the final answer. Quite similar to what goes on in point cloud net https://t.co/BxkbAbHivh

2

78

14

58

7K

_atewari retweeted

Julien Gaubil @jgaubil

7 months ago

DUSt3R et al. are impressive, but how do they actually work? We explored this, and share insights on iterative reconstruction, the roles of cross- and self-attention, and emerging correspondences across the network [1/8] ⬇️

jgaubil's tweet photo. DUSt3R et al. are impressive, but how do they actually work?

We explored this, and share insights on iterative reconstruction, the roles of cross- and self-attention, and emerging correspondences across the network [1/8] ⬇️ https://t.co/rDlYNeQev5

1

6

1

632

_atewari retweeted

Dmytro Mishkin 🇺🇦 @ducha_aiki

6 months ago

Understanding Multi-View Transformers Michal Stary @jgaubil @_atewari @vincesitzmann tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching. https://t.co/UR9agpjD8M

ducha_aiki's tweet photo. Understanding Multi-View Transformers

Michal Stary @jgaubil @_atewari @vincesitzmann

tl;dr: DUSt3R self-attention is it secretly a diffusion model, and cross-attention is matching.
https://t.co/UR9agpjD8M https://t.co/pFJXF61Wtc

2

254

50

208

37K

_atewari retweeted

Krishna Murthy @ CVPR @_krishna_murthy

8 months ago

In Fall 2026, I will begin a tenure-track faculty position @JHUCompSci Announcing the SciPhy lab, where we will study the science of physical agents (robots) We are now recruiting our first cohort of PhD students. If this is you, see https://t.co/heSKsCbWz7

_krishna_murthy's tweet photo. In Fall 2026, I will begin a tenure-track faculty position @JHUCompSci

Announcing the SciPhy lab, where we will study the science of physical agents (robots)

We are now recruiting our first cohort of PhD students. If this is you, see

https://t.co/heSKsCbWz7 https://t.co/FbBfiH9zVc

36

481

79

113

31K

Ayush Tewari @_atewari

10 months ago

@maksym_andr Congrats, Maksym!

0

1

0

356

Ayush Tewari

@_atewari

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users