Yehonathan Litman @yehonation - Twitter Profile

Pinned Tweet

5 days ago

To be honest, training on handmade 4D asset datasets is a dead-end. Almost all 4D asset data is synthetic and diverse real data barely exists, so models trained on it struggle to reconstruct objects that deform, get occluded, and move freely about the scene. Our new work, Lift4D, instead lifts 2D & 3D priors into 4D, reconstructing complete dynamic objects from a single in-the-wild video 🧵 (1/n) 🔗Webpage + Demos: https://t.co/XI5jUViTpC

9

449

68

316

47K

yehonation retweeted

Alexandre Morgand @Almorgand

3 days ago

"Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild" TL;DR: combines temporally consistent single-view 3D priors, deformable Gaussian Splatting, and diffusion-guided optimization to reconstruct challenging dynamic scenes from monocular videos.

1

11

1

6

1K

yehonation retweeted

Sadao Tokuyama

@tokufxug

5 days ago

単一の動画から動体の3次元形状、外観、変形を復元する技術「Lift4D」。 3Dモデルによる整合性の高い形状復元と、2Dモデルによる鮮明な外観生成を組み合わせる最適化手法を採用。カメラに映らない遮蔽領域も補完し、激しい動きを含む実環境の映像から滑らかな4D映像を生成する。（詳細はリプ欄。）

1

59

12

42

4K

yehonation retweeted

バーチャルデータサイエンティストアイシア=ソリッド

@AIcia_Solid

5 days ago

変形する物体含む 4DGS って大変らしい。フルに 4D でやるとデータが足りないし、最初に 3D reconstruction やってそのあと video 見て変形するのもいいけど、3D cue が initial frame だけじゃん、という課題意識。からの、video の各 frame を temporal に coherent な感じで 3D にして、いい感じにやるみたい。すごそう👀

0

44

5

38

8K

Who to follow

Xindi Wu

@cindy_x_wu

PhD student @PrincetonCS | Data-centric multimodal ml | prev @roboVisionCMU @CMU_Robotics | @NVIDIAAI @RealityLabs @Snapchat

Tianyuan Zhang

@tianyuanzhang99

General intelligence and continue learning at meta tbd lab. prev Phd in MIT, M.S. in CMU, B.S. in PKU.

Zhizhuo (Z) Zhou

@zhizdev

Building generative 3D @Stanford | Prev @CarnegieMellon @UMich

Yehonathan Litman

@yehonation

4 days ago

@REVOLVO_OCELOTS I'm aiming to release the code next month and you can try it yourself, but based on the example the object is not very visible in most frames and the cuts are very harsh (180 degrees), this is very complex so our method would probably not work😢

0

1

0

27

Yehonathan Litman

@yehonation

5 days ago

To be honest, training on handmade 4D asset datasets is a dead-end. Almost all 4D asset data is synthetic and diverse real data barely exists, so models trained on it struggle to reconstruct objects that deform, get occluded, and move freely about the scene. Our new work, Lift4D, instead lifts 2D & 3D priors into 4D, reconstructing complete dynamic objects from a single in-the-wild video 🧵 (1/n) 🔗Webpage + Demos: https://t.co/XI5jUViTpC

9

449

68

316

47K

yehonation retweeted

Michael Becker

@michaelybecker

4 days ago

fascinating to see another exemplar of monocular video 2 4D multiview reconstruction - World Labs featured a similar paper a few weeks ago. Early results certainly seem viable enough to be useful in many contexts, and the barrier to entry is unthinkably lower than full multiview video

1

10

1

1K

Yehonathan Litman

@yehonation

4 days ago

@michaelybecker Thank you! Working on it, hope to release it by next month.

0

2

0

78

yehonation retweeted

Yehonathan Litman

@yehonation

5 days ago

To be honest, training on handmade 4D asset datasets is a dead-end. Almost all 4D asset data is synthetic and diverse real data barely exists, so models trained on it struggle to reconstruct objects that deform, get occluded, and move freely about the scene. Our new work, Lift4D, instead lifts 2D & 3D priors into 4D, reconstructing complete dynamic objects from a single in-the-wild video 🧵 (1/n) 🔗Webpage + Demos: https://t.co/XI5jUViTpC

9

449

68

316

47K

Yehonathan Litman

@yehonation

5 days ago

@nickkarpov Right now the 3D prior is not able to use multi view data, it only takes single view. That said there are methods for using multi view data with image-to-3D models like MV-SAM3D.

0

1

0

187

Yehonathan Litman

@yehonation

5 days ago

@REVOLVO_OCELOTS It depends on how much the object changes as the deformation model has limited capacity. If the perspective is relatively similar across cuts then it should be fine but if the object completely changes the deformation could break.

1

0

104

yehonation retweeted

Aashish Rai @aashishrai3799

5 days ago

This was indeed essential in the current landscape of 4D reconstruction methods. By far the best reconstruction results from monocular videos.

0

1

0

363

Yehonathan Litman

@yehonation

5 days ago

@JieWang_ZJUI Exactly. If you have infinite budget, e.g. an army of digital artists, you would get much further in designing and modeling high quality static 3D than diverse 4D assets from internet footage. This was the data curation process for SAM3D and it proved successful.

0

2

0

52

Yehonathan Litman

@yehonation

5 days ago

@JieWang_ZJUI Encoding diverse motion for 3D assets makes 4D data curation nearly impossible, for example a folding shirt (which we have a demo of in our website!). In many cases ITW lifting priors trained on clean+dirty data is preferable because clean 4D data is too hard to get.

1

0

52

Yehonathan Litman

@yehonation

5 days ago

@JieWang_ZJUI Just saw your comment! 3D in the wild is not great, but compared to 4D it’s still way better in diversity and quantity. Synthetic 3D data is also much easier to create than 4D, because encoding diverse motion is incredibly difficult even for just one configuration.

1

3

0

206

yehonation retweeted

Jay Karhade @JayKarhade

5 days ago

Really cool work, one of the more impressive visuals I have seen for 4D reconstruction!

0

7

2

3K

yehonation retweeted

Homanga Bharadhwaj

@mangahomanga

5 days ago

Big fan of these results from @yehonation on modeling appearance, geometry, and deformation of objects from in-the-wild videos Excited about potential applications in human-object manipulation!

0

10

1

8

2K

yehonation retweeted

Koichi Namekata @Koichi_N_

5 days ago

Excited to share our #SIGGRAPH2026 paper: Go-with-the-Track! We utilize point-tracks as spatiotemporal conditioning to precisely insert multiple reference images into generated videos. Paper: https://t.co/gbBNilImRU Project page: https://t.co/yB90SmOV7U

1

84

21

51

20K

Yehonathan Litman

@yehonation

5 days ago

@Z1hanW Thank you! Let’s catch up soon :)

0

1

0

219

Yehonathan Litman

@yehonation

5 days ago

Thanks for tweeting about our work AK!

AK

@_akhaliq

5 days ago

Lift4D Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild

2

143

13

99

31K

0

8

0

5

1K

yehonation retweeted

Xiaoxuan Ma @XiaoxuanMa_

5 days ago

Worried about the lack of real-world 4D training data? No, no... Lift4D takes a different route — diving deeper into 2D & 3D priors for truly in-the-wild 4D reconstruction. 🤟 Complete 360° geometry, appearance & deformation from a single casual video. 🔗https://t.co/Getd1Awe6L

2

66

6

51

10K

yehonation retweeted

Shubham Tulsiani @shubhtuls

5 days ago

We present Lift4D -- allowing dynamic 3D reconstruction of objects from monocular videos with large motions, occlusions, and deformations. See @yehonation's thread for details, and be sure to check out the gallery of interactive in-the-wild results on the project webpage :)

0

43

4

18

4K

Yehonathan Litman

@yehonation

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users