Jing He @Jingheya - Twitter Profile

3 months ago

Thanks for sharing, @_akhaliq ! Feel free to check out our new SoTA video depth estimator.🥳🤗👏💪 Useful links ⬇️ Demo: https://t.co/sFTHW1BNbq Code: https://t.co/WXPJ3FkGwb Paper: https://t.co/IDRUJECeGf Page: https://t.co/cLbyaLa2c8

AK

@_akhaliq

3 months ago

DVD Deterministic Video Depth Estimation with Generative Priors paper: https://t.co/Eh41hneFEg

7

188

26

126

62K

3

175

28

115

32K

Jingheya retweeted

AK

@_akhaliq

3 months ago

DVD Deterministic Video Depth Estimation with Generative Priors paper: https://t.co/Eh41hneFEg

7

188

26

126

62K

Jingheya retweeted

Bilawal Sidhu

@bilawalsidhu

5 months ago

One of the wildest emergent capabilities of Genie 3 is that maps actually work. As I walk around the forest, the GPS display updates its heading in real time. Remember. There is no game engine here. This is an AI hallucinating a working navigational instrument purely from next frame prediction. 🤯

73

3K

208

671

247K

Jing He @Jingheya

5 months ago

insane!!!

Haven Feng

@HavenFeng

5 months ago

✨Thinking with Blender~ Meet VIGA: a multimodal agent that autonomously codes 3D/4D blender scenes from any image, with no human, no training! @berkeley_ai #LLMs #Blender #Agent 🧵1/6

72

2K

306

2K

338K

0

2

0

146

Jing He @Jingheya

6 months ago

Thanks @_akhaliq for sharing our work! 🤗Our StereoPilot is an efficient feed-forward architecture that leverages pre-trained video diffusion priors to directly synthesize novel views for stereo conversion. 🥳 Page: https://t.co/rNfXYZoRD9

AK

@_akhaliq

6 months ago

StereoPilot Learning Unified and Efficient Stereo Conversion via Generative Priors

2

30

2

16

12K

0

3

0

174

Jing He @Jingheya

7 months ago

Thanks @_akhaliq for sharing our work!!!🌹 Lotus-2 effectively analyzes the DiT-based rectified-flow formulation and leverages the pre-trained generative model as a deterministic world prior, achieving SoTA performance with significantly finer details.🥳

AK

@_akhaliq

7 months ago

Lotus-2 Advancing Geometric Dense Prediction with Powerful Image Generative Model

4

397

53

255

52K

1

18

5

7

6K

Jingheya retweeted

AK

@_akhaliq

7 months ago

Lotus-2 Advancing Geometric Dense Prediction with Powerful Image Generative Model

4

397

53

255

52K

Jingheya retweeted

Haodong Li

@haodongli00

7 months ago

Thanks so much for sharing @_akhaliq ! The code and @huggingface demos are publicly released, please have a try! 🤗demos: https://t.co/QvGryEhgDk; https://t.co/2pAVCS8jQI Code: https://t.co/NhU6XZQ2mw

1

15

3

8

5K

Jingheya retweeted

Haodong Li

@haodongli00

9 months ago

Thanks @_akhaliq for your kind promotion! The 🤗 @huggingface @Gradio demo and the inference code are released now! Please give it a try!🚀 Huggingface Space: https://t.co/wAj2TVIlwe Github Repo: https://t.co/0i6CIqpxLC Project Page: https://t.co/z7tL4nBWjn

2

86

15

54

8K

Jingheya retweeted

Haodong Li

@haodongli00

9 months ago

Thanks @_akhaliq for sharing our work! The code and demo will be public soon. Please have a look if you are interested.🥰🥰🥰 Paper: https://t.co/rywt2tPEKz, https://t.co/2b4lEhKk05 Github: https://t.co/0i6CIqpxLC Project page: https://t.co/z7tL4nBWjn

1

92

17

31

15K

Jingheya retweeted

Haodong Li

@haodongli00

9 months ago

Thanks for sharing!!! @_akhaliq 🔥 This work focuses on 360° depth estimation, called DA^2: Depth Anything in Any Direction, We first built a large-scale training data (scales-up existing data in nearly 10 times!), then we trained a Sphere-aware ViT using the scaled-up data. Finally, DA^2 achieves remarkable geometrical fidelity and strong zero-shot generalization, which we believe can enable various 3D scene related applications, e.g., world models. Please check out below links for more details: 😊 Paper: https://t.co/wlX7LLmPa6 Huggingface daily paper: https://t.co/LvPBUmhiOP Code (coming soon): https://t.co/2WaDeYnRfd Project page: https://t.co/T0ANj8pe26 A beautiful collaboration with Tencent Hunyuan @TencentHunyuan @TencentGlobal! Thanks all co-authors! @MaskerZW, @Jingheya, @LeoLau_yuhao, @XinLin321, Xin Yang, @YingCongChen1, Chunchao Guo

7

157

28

95

26K

Jing He @Jingheya

9 months ago

Sharing our new panoramic depth estimation method!🥰🤗🥳 It delivers remarkable geometric fidelity and exceptional zero-shot generalization!🤩

AK

@_akhaliq

9 months ago

DA^2 Depth Anything in Any Direction

4

350

51

235

87K

0

5

1

0

316

Jingheya retweeted

AK

@_akhaliq

9 months ago

DA^2 Depth Anything in Any Direction

4

350

51

235

87K

Jing He @Jingheya

over 1 year ago

@RolandWank Yes! You can do video depth estimation using our demo (https://t.co/SjwBu46wgT). Our latest depth model in disparity brings more consistency!

0

1

0

1

345

Jing He @Jingheya

over 1 year ago

Thrilled to share that our paper, "LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction", has been accepted to #ICLR2025! 🎉 Many thanks to @YingCongChen1 @haodongli00 and all co-authors! Github: https://t.co/U6IohPw6lS

Jingheya's tweet photo. Thrilled to share that our paper, "LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction", has been accepted to #ICLR2025! 🎉 Many thanks to @YingCongChen1 @haodongli00 and all co-authors!
Github: https://t.co/U6IohPw6lS https://t.co/k0TbOoBife

4

430

66

231

28K

Jing He @Jingheya

over 1 year ago

@ring_hyacinth 效果也太好了吧！！

0

1K

Jingheya retweeted

Haodong Li

@haodongli00

over 1 year ago

@chrisoffner3d @ChongZitaZhang Just added a resizing pre-processs to force the input image with a resolution ranging from 384 to 1024 in the demo (https://t.co/z6QcYypxyp). Here are some examples:

1

6

1

1K

Jingheya retweeted

Haodong Li

@haodongli00

over 1 year ago

That make scenes, because our model is not trained on (RGB, Depth) pairs that have multiple resolutions, so the model performs best when the ratio of resolution / image content close to the data used.🥲 Inference on images that is too large or small will leads to artifacts due the increasing domain gap between inference and training.🥲

3

6

1

0

619

Jingheya retweeted

Haodong Li

@haodongli00

over 1 year ago

Hi @chrisoffner3d , thanks so much for your interest! The latest version of LOTUS - Depth trained in disparity space works for your cases🥹🥹🥹 Here is the @huggingface @Gradio demo: https://t.co/h4A6upCEiT Please have a try using this link if the original version fails on some cases🤣🤣🤣

0

17

1

6

6K

Jingheya retweeted

Haodong Li

@haodongli00

over 1 year ago

@chrisoffner3d

2

10

1

2K

Jing He

@Jingheya

Last Seen Users on Sotwe

Trends for you

Most Popular Users