Guandao Yang @guandaoyang - Twitter Profile

Pinned Tweet

11 days ago

MeshFlow brings high-quality mesh with interactive speed. It's time to make mesh generation flow! Arxiv: https://t.co/LvaAoq1GoH Web: https://t.co/y6DsntORbz Code: https://t.co/l1GHR9zjCb

Qi Sun @qisun01

11 days ago

Excited to share MeshFlow — a new approach that can generate meshes with a fraction of seconds, while achieving state-of-the-art generation quality. Secret sources? Instead of autoregressive models, use equivariant flow-matching!

2

45

8

23

44K

1

114

17

62

43K

GuandaoYang retweeted

Jeong Joon Park @jjpark3D

7 days ago

Excited to share 𝐄𝐫𝐫𝐨𝐫-𝐂𝐨𝐧𝐝𝐢𝐭𝐢𝐨𝐧𝐞𝐝 𝐍𝐞𝐮𝐫𝐚𝐥 𝐒𝐨𝐥𝐯𝐞𝐫𝐬 (𝐄𝐍𝐒), a PDE framework that recurrently corrects its own prediction by reading the PDE residual field rather than minimizing it! Website: https://t.co/P9TJ3LETFH

1

7

2

4

624

Guandao Yang

@GuandaoYang

10 days ago

There are three reasons why it's called "Functional Attention." 1⃣We make functions, instead of tokens, as first-class citizens for attention module. 2⃣It was inspired by the geometry processing concepts of functional maps. 3⃣It works!

Simon Weber @SimWeberTUM

17 days ago

What if attention wasn't about matching tokens, but operating in function space? Glad to share our #ICML2026 paper: 📄 Functional Attention: From Pairwise Affinities to Functional Correspondences w/ @Jiefang_Xiao @GaoMaolin @stevenygd Daniel Cremers 📄 https://t.co/rhn9NtwrBm

SimWeberTUM's tweet photo. What if attention wasn't about matching tokens, but operating in function space?

Glad to share our #ICML2026 paper:
📄 Functional Attention: From Pairwise Affinities to Functional Correspondences

w/ @Jiefang_Xiao @GaoMaolin @stevenygd Daniel Cremers
📄 https://t.co/rhn9NtwrBm https://t.co/8V3dbshHvt

17

1K

160

1K

123K

9

592

64

556

67K

Guandao Yang

@GuandaoYang

about 1 month ago

@xxunhuang Congratulations Xun!

0

1

0

261

Who to follow

Yin Cui

@YinCuiCV

Research Scientist @NVIDIA | Formerly @Google, @Cornell | Views are my own

Kaichun Mo

@KaichunMo

Senior Research Scientist at NVIDIA Cosmos Lab; Previously CS Ph.D. from Stanford

Songyou Peng

@songyoupeng

Teaching multimodality to Gemini, Nano Banana, and more @GoogleDeepMind. PhD from @ETH and @MPI_IS.

GuandaoYang retweeted

European Conference on Computer Vision #ECCV2026 @eccvconf

about 1 month ago

The call for the #ECCV2026 Doctoral Consortium is now available. Details: https://t.co/ySc2QrkD24

1

14

2

3

7K

GuandaoYang retweeted

European Conference on Computer Vision #ECCV2026 @eccvconf

about 2 months ago

The #ECCV2026 reviewer discussion period has started! Reviewers should carefully read the authors’ rebuttal, consider the other reviews, and actively participate in the discussion BEFORE finalizing their reviews.

eccvconf's tweet photo. The #ECCV2026 reviewer discussion period has started! Reviewers should carefully read the authors’ rebuttal, consider the other reviews, and actively participate in the discussion BEFORE finalizing their reviews. https://t.co/hprvNPVltq

1

24

3

2

15K

GuandaoYang retweeted

European Conference on Computer Vision #ECCV2026 @eccvconf

2 months ago

It’s #ECCV2026 review release (anywhere on earth) day! Good luck 🤞

2

123

11

2

33K

GuandaoYang retweeted

European Conference on Computer Vision #ECCV2026 @eccvconf

5 months ago

What’s New at #ECCV2026 Malmo 🇸🇪? Please read the important policy updates (especially with regard to ECCV 2024) on our “What’s new?” page. Notably, this year, we introduced “Contribution Types”, a mechanism for tagging submissions &reviewers to facilitate fair evaluation. 1/2

eccvconf's tweet photo. What’s New at #ECCV2026 Malmo 🇸🇪? Please read the important policy updates (especially with regard to ECCV 2024) on our “What’s new?” page.

Notably, this year, we introduced “Contribution Types”, a mechanism for tagging submissions &reviewers to facilitate fair evaluation.

1/2 https://t.co/y4zfQi8UoH

2

28

8

5

10K

GuandaoYang retweeted

European Conference on Computer Vision #ECCV2026 @eccvconf

5 months ago

The #ECCV2026 Malmo 🇸🇪 call for papers is now available. Check it out!

1

37

6

2

5K

GuandaoYang retweeted

Mira Murati

@miramurati

8 months ago

Combining the benefits of RL and SFT with on-policy distillation, a promising approach for training small models for domain performance and continual learning.

99

3K

218

1K

484K

GuandaoYang retweeted

Gene Chou @gene_ch0u

about 1 year ago

We've released all code and models for FlashDepth! It produces depth maps from a 2k, streaming video in real-time. This was a really fun course project inspired by discussions with @mohsaied and @stevenygd and we look forward to presenting it at #ICCV2025. GitHub: https://t.co/IHPYJEZFIj Project page: https://t.co/dGHdXJKLKB

6

547

68

338

38K

GuandaoYang retweeted

Bharath Hariharan @BharathHarihar3

about 1 year ago

For those at CVPR, Aditya will be presenting this poster tomorrow at 10:30 (Exhibit hall D, Poster #34). Come hear about why neural field derivatives are noisy, and how we resurrect image processing ideas for neural fields!

0

8

3

0

2K

Guandao Yang

@GuandaoYang

about 1 year ago

@jon_barron Happy Birthday Jon!!!

0

1

0

423

Guandao Yang

@GuandaoYang

about 1 year ago

Really impressive work on real-time video generation! I’m a fan of the principle of closing the train-test gap!

Xun Huang

@xxunhuang

about 1 year ago

Real-time video generation is finally real — without sacrificing quality. Introducing Self-Forcing, a new paradigm for training autoregressive diffusion models. The key to high quality? Simulate the inference process during training by unrolling transformers with KV caching.

29

902

131

645

181K

0

16

0

2K

GuandaoYang retweeted

Ryan Po

@Po_lhr

about 1 year ago

Most video models struggle to feel like real worlds. They forget what’s just out of view, slow down as videos get longer, or breaks causality. We think State Space Models are a natural fit for models with: 🧠 long-term memory across hundreds of frames ⚡ constant-speed generation, even for long rollouts ⏩ fully causal dynamics, fit for real-time interaction

2

52

9

21

5K

Guandao Yang

@GuandaoYang

about 1 year ago

Which multimodal LLMs are ready to take on 3D editing tasks in Blender? We present BlenderGym — the first benchmark to systematically evaluate them. We also show that the right inference strategy can make all the difference! Check out our #CVPR2025 Highlight paper to learn more! 👇

Yunqi (Richard) Gu

@richard_yunqigu

about 1 year ago

Which multimodal LLM should you be using to edit graphics in Blender? Today, we’re releasing our #CVPR2025 Highlight🌟 work, #BlenderGym 🏋️‍♀️, the first agentic 3D graphics editing benchmark that will tell you exactly how multimodal LLMs compare in their Blender-editing skills. What'd we find? 🧵👇

8

83

36

38

23K

0

46

4

11

6K

Guandao Yang

@GuandaoYang

over 1 year ago

@jon_barron Are we making some implicit assumptions that one 3D model is comparable to only one image or one word? If we can make each 3D model worth thousands of images/words, then this data gap can be perhaps smaller.

2

11

0

3

1K

GuandaoYang retweeted

Gordon Wetzstein

@GordonWetzstein

over 1 year ago

Introducing AIpparel @CVPR 2025 - the first multimodal foundation model for digital garments. https://t.co/YDvxGqgmgJ 1/6

1

179

23

85

13K

GuandaoYang retweeted

youming.deng @denghilbert

over 1 year ago

How can we use wide-FOV cameras for reconstruction? We propose self-calibration Gaussian Splatting that jointly optimizes camera parameters, lens distortion, and 3D Gaussian representations to directly reconstruct from a set of wide-angle captures. page: https://t.co/OQ4C20VsvU

2

185

34

100

13K

Guandao Yang

@GuandaoYang

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users