Tengda Han @TengdaHan - Twitter Profile

Human perception is active: we move around to see, and we see with intention. In our latest work "Seeing without Pixels", we find "how you see" (how the camera moves) roughly reveals "what you do" or "what you observe" -- and this connection can be easily learned from data.

2

166

18

80

21K

0

76

6

27

8K

Tengda Han @TengdaHan

3 months ago

Accepted by @CVPR #CVPR2026 🎉🎉

Tengda Han @TengdaHan

6 months ago

Human learns from unique data -- everyone's OWN life -- but our visual representations eventually align. In our recent work "Unique Lives, Shared World" @GoogleDeepMind, we train models with "single-life" videos from distinct sources, and study their alignment and generalisation.

TengdaHan's tweet photo. Human learns from unique data -- everyone's OWN life -- but our visual representations eventually align. In our recent work "Unique Lives, Shared World" @GoogleDeepMind, we train models with "single-life" videos from distinct sources, and study their alignment and generalisation. https://t.co/dgfDMkEz0d

10

145

31

71

13K

1

35

6

7

4K

Who to follow

Elliott / Shangzhe Wu

@elliottszwu

Assistant Professor @Cambridge_Eng, working on 3D computer vision and inverse graphics, previously postdoc @StanfordSVL, PhD @Oxford_VGG

Richard Zhang

@rzhang88

Sr Research Scientist @AdobeResearch PhD @berkeley_ai, BS/MEng @cornellece 🤖 Computer vision, deep learning, graphics

Visual Geometry Group (VGG)

@Oxford_VGG

Computer Vision research group @UniofOxford led by Andrew Zisserman, Andrea Vedaldi, João Henriques, Christian Rupprecht, and Iro Laina

TengdaHan retweeted

Google DeepMind @GoogleDeepMind

4 months ago

Gemini 3.1 Pro is here. We’ve significantly improved the model’s overall intelligence so it can solve tougher problems. 🧵

288

6K

732

628

925K

Tengda Han @TengdaHan

4 months ago

@TongPetersb Thanks for sharing! Nice blog!

0

4

0

328

TengdaHan retweeted

Sayna Ebrahimi @SaynaEbrahimi

6 months ago

I’m looking for PhD students in Audio & Video for a Summer 2026 internship at Google DeepMind! ⚠️ Requirement: Prior publication in this area. To apply, tell me the most critical research gap in AV understanding to see if we are a match! https://t.co/uKQnftKwpJ

1

126

19

86

12K

Tengda Han @TengdaHan

6 months ago

A SOTA model on 4D reconstruction from @GoogleDeepMind! Amazing work from @ChuhanZhang5 and the team! It was so satisfactory to see these reconstruction results and I've been having a great experience using it

Chuhan Zhang @ChuhanZhang5

6 months ago

A SINGLE encoder + decoder for all the 4D tasks! We release 🎯 D4RT (Dynamic 4D Reconstruction and Tracking). 📍 A simple, unified interface for 3D tracking, depth, and pose 🌟 SOTA results on 4D reconstruction & tracking 🚀 Up to 100x faster pose estimation than prior works

17

327

53

186

68K

5

201

21

108

18K

TengdaHan retweeted

Weidi Xie @WeidiXie

6 months ago

🚀 Glad to share the exciting project — SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass! We explored the generation of 3D scenes with multiple assets from a single image. 🎉 ACCEPTED by 3DV 2026!!! All resources have been open-sourced and publicly available! 📄 Paper: https://t.co/51GB5oTMZR 💻 Code: https://t.co/g7Z1VuTEla 🔗 Model: https://t.co/zam2NDL30z 🌐 WebPage: https://t.co/MGlUuLyHd9 #3DVision #AI #GenerativeAI #ComputerVision #3DV2026 #SceneGen

WeidiXie's tweet photo. 🚀 Glad to share the exciting project — SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass! We explored the generation of 3D scenes with multiple assets from a single image. 🎉 ACCEPTED by 3DV 2026!!!

All resources have been open-sourced and publicly available!
📄 Paper: https://t.co/51GB5oTMZR
💻 Code: https://t.co/g7Z1VuTEla
🔗 Model: https://t.co/zam2NDL30z
🌐 WebPage: https://t.co/MGlUuLyHd9

#3DVision #AI #GenerativeAI #ComputerVision #3DV2026 #SceneGen

1

7

2

882

TengdaHan retweeted

joao carreira @joaocarreira

6 months ago

Future AI models will learn predominantly post-deployment – to do the tasks of interest to each user. This will happen throughout an individual “life”. In a new paper https://t.co/BrH9FxBqG0 we lay out groundwork for this type of capabilities in the wild from a visual standpoint.

2

15

4

10

2K

Tengda Han @TengdaHan

6 months ago

Work from @SaynaEbrahimi, myself, and @dilaragoekay, @goolygu, Maks Ovsjanikov, Iva Babukova, @DanielZoran_ , Viorica Patraucean, @joaocarreira , Andrew Zisserman and @dimadamen at @GoogleDeepMind. Arxiv: https://t.co/8hm90TyGIQ

1

13

2

3

3K

Tengda Han @TengdaHan

6 months ago

Human learns from unique data -- everyone's OWN life -- but our visual representations eventually align. In our recent work "Unique Lives, Shared World" @GoogleDeepMind, we train models with "single-life" videos from distinct sources, and study their alignment and generalisation.

10

145

31

71

13K

Tengda Han @TengdaHan

6 months ago

Sherry is currently on the industry job market. Highly recommend!!

Zihui (Sherry) Xue @sherryx90099597

6 months ago

Excited to share our latest work! Grateful for the guidance from all my collaborators, and special thanks to Tengda for being such an amazing mentor during my internship @GoogleDeepMind 😊

0

25

1

2

6K

1

10

1

3K

Tengda Han @TengdaHan

6 months ago

@KevinQHLin @sherryx90099597 @dimadamen Thanks for reposting!

0

1

0

109

Tengda Han @TengdaHan

6 months ago

@ducha_aiki @sherryx90099597 @dimadamen Thank you Dmytro for reposting! Glad you like it :)

0

4

0

237

Tengda Han @TengdaHan

6 months ago

Project page for more details and qualitative examples: https://t.co/E8BDGIM0w0 Sherry will be at @NeurIPSConf this week! Catch her to chat more!

0

7

0

2

674

Tengda Han @TengdaHan

6 months ago

Human perception is active: we move around to see, and we see with intention. In our latest work "Seeing without Pixels", we find "how you see" (how the camera moves) roughly reveals "what you do" or "what you observe" -- and this connection can be easily learned from data.

2

166

18

80

21K

Tengda Han @TengdaHan

6 months ago

Can you tell which action corresponds to which camera trajectory in the video above? Check out our paper for answers! Work done by our great intern Sherry Xue @sherryx90099597 at @GoogleDeepMind, and with Kristen Grauman, @dimadamen and Andrew Zisserman. https://t.co/ukbMRfAkZk

1

13

3

7

1K

Tengda Han @TengdaHan

6 months ago

A belated post for our ACMMM paper: we recognize and track animated characters for movie understanding tasks. Great work from Zhongrui Gui, also with @JunyuXieArthur @WeidiXie and Andrew Zisserman from @Oxford_VGG . Project page with code and dataset: https://t.co/G70041InQ8

0

1

0

196

Tengda Han @TengdaHan

6 months ago

Animated movies can be effortlessly understood by young minds, but appear to be challenging for video-language models, why? The key problem is the huge diversity of animated characters -- their appearance ranges from human-like faces, to cars, fish, blobs, etc.

1

13

3

1

2K

Tengda Han

@TengdaHan

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users