youming.deng @denghilbert - Twitter Profile

Pinned Tweet

4 months ago

We present the SOTA feed-forward 3DGS pipeline Selfi, which was accepted by #CVPR2026 Project Page: https://t.co/3T31XcnkpD

denghilbert's tweet photo. We present the SOTA feed-forward 3DGS pipeline Selfi, which was accepted by #CVPR2026

Project Page: https://t.co/3T31XcnkpD https://t.co/UD3iH7K0ww

3

313

45

197

53K

denghilbert retweeted

Jihan Yang

@jihanyang13

11 days ago

Camera pose matters for video understanding! Today's MLLMs excel at recognizing activities, but still struggle with the underlying space and ego/object dynamics in video. We trace this gap to a missing piece: camera pose. Introducing Cambrian-P: a multimodal LLM natively grounded in camera pose. (1/n)

jihanyang13's tweet photo. Camera pose matters for video understanding!

Today's MLLMs excel at recognizing activities, but still struggle with the underlying space and ego/object dynamics in video. We trace this gap to a missing piece: camera pose.

Introducing Cambrian-P: a multimodal LLM natively grounded in camera pose. (1/n)

2

276

47

174

52K

denghilbert retweeted

Tobias Fischer @TobiasFischer11

8 days ago

Do 3D reconstruction transformers really need a billion parameters, or are most of those layers just doing the same thing over and over? Introducing Déjà View: a single transformer block, looped K times, that matches or beats models 8–10× its size with lower compute. 🧵

14

688

96

510

89K

youming.deng @denghilbert

25 days ago

@YichuanM congrats. that's so cool!

0

17

Who to follow

Yixuan Wang

@YXWangBot

CS Ph.D. student @Columbia & Research Scientist @NVIDIARobotic | Prev. Meta FAIR Embodied AI, Boston Dynamics AI Institute, Google X #Vision #Robotics #Learning

Kaifeng Zhang

@kaiwynd

PhD student at Columbia University

Xinyu Yang

@Xinyu2ML

Building something new. Opinions are my own. Architect. Working full-stack on foundation agents. They/Them

youming.deng @denghilbert

25 days ago

bunch of bullshit

Dr. Manabendra Saharia

@m_saharia

26 days ago

Yesterday, I was giving an intro talk to our dept's new PhD students. Technical things aside, my number 1 suggestion has remained the same over the years: Treat your PhD like a job. - Avoid 1.5h lunch and three tea breaks. - Avoid gossiping and loitering at work. - Lab at 9 am and leave at 6 pm. Being productive till 11 pm in the lab is a lie people till themselves when their day starts at 1 PM. Everything worth doing can be done with high intensity focus during work hours. And having fun in life is the secret to being productive in a marathon.

102

2K

177

913

2M

0

2

0

206

denghilbert retweeted

Curry Flurry 😈 @BabyFaceDubs

about 1 month ago

Oracle crowd was a devils pit man

25

4K

196

352

103K

denghilbert retweeted

Impressions

@impression_ists

about 1 month ago

Vincent van Gogh, Novels and Rose

9

2K

418

148

50K

denghilbert retweeted

Songyou Peng @songyoupeng

about 1 month ago

Yay, finally! Introducing Vision Banana🍌 from @GoogleDeepMind, our unified model that outperforms SoTA specialist models on various vision tasks! By treating 2D/3D vision tasks as image generation, we unlock a new foundation for CV. Project page: https://t.co/GQgRi6mWwC (1/5)

56

2K

310

1K

282K

denghilbert retweeted

Gene Chou

@gene_ch0u

about 2 months ago

Introducing CityRAG! We wanted video generative models to be grounded in the real world — if I’m in London, I want to look around and actually see Big Ben. CityRAG generates videos of cities featuring real buildings and roads, with arbitrary weather, people, and cars. 1/N page: https://t.co/jxMSX5Ik7F paper: https://t.co/So2V9hyB4D

6

242

49

135

36K

youming.deng @denghilbert

about 2 months ago

@TongPetersb @sainingxie @ylecun @mengyer @YiMaTweets @LukeZettlemoyer @liuzhuang1234 big congrats, Dr. Tong!!

1

0

352

denghilbert retweeted

Art Guide

@ArtGuide_db

about 2 months ago

Vincent van Gogh Sunflowers, 1888

10

1K

252

58

29K

youming.deng @denghilbert

about 2 months ago

Super cool. Self-improving makes model better and better!

Qianqian Wang @QianqianWang5

about 2 months ago

Most multi-view reconstruction models need full supervision. We show they can self-improve without any ground truth labels. Introducing SelfEvo: Self-Improving 4D Perception via Self-Distillation. Up to +36.5% in video depth, +20.1% in camera estimation, zero annotation.