Xianghao Kong @xk_theo7 - Twitter Profile

Pinned Tweet

3 days ago

Most diffusion research today asks: How can we sample faster? But I think another question is equally important: Are we training diffusion models in the right way? https://t.co/WYHNXJamhi

1

4

1

162

Xianghao Kong

@xk_theo7

about 3 hours ago

@mo_norouzi physically grounded image model🤣🤣

0

1

0

64

Xianghao Kong

@xk_theo7

3 days ago

This is my first time turning some of my research thoughts into a blog post, so it may contain errors or unclear arguments. Any suggestions, comments, or feedback would be greatly appreciated🙌

0

46

Xianghao Kong

@xk_theo7

3 days ago

Most diffusion research today asks: How can we sample faster? But I think another question is equally important: Are we training diffusion models in the right way? https://t.co/WYHNXJamhi

1

4

1

162

Xianghao Kong

@xk_theo7

3 days ago

This may matter even more for recurrent diffusion and interactive generative models, where small errors can accumulate over long-term rollouts.

1

0

65

xk_theo7 retweeted

Zyphra

@ZyphraAI

about 1 month ago

Today we're releasing ZAYA1-74B-Preview, a major milestone in scaling pretraining on @AMD. ZAYA1-74B-Preview is a 4B active / 74B total MoE. This preview model is a strong pre-RL base checkpoint. The final post-trained reasoning model is coming soon. 🧵

ZyphraAI's tweet photo. Today we're releasing ZAYA1-74B-Preview, a major milestone in scaling pretraining on @AMD.

ZAYA1-74B-Preview is a 4B active / 74B total MoE.

This preview model is a strong pre-RL base checkpoint. The final post-trained reasoning model is coming soon. 🧵 https://t.co/2zJ3q8jEdV

24

798

87

227

1M

xk_theo7 retweeted

Yunong Liu@CVPR

@yunongliu1

3 months ago

Really excited to see Uni-1 out in the world 🔥Our first unified model. The range of things this model can do is wild: image-to-~100 styles, manga generation, multi-ref with strong identity preservation, temporal storytelling, sketch-to-image, spatial reasoning, multilingual infographics, layering… the capability range is honestly unreal. this is just the start 🫡 check out the blog to learn more https://t.co/B8Nedl86Dk Proud of the team and what we’re building at @LumaLabsAI 🚀

yunongliu1's tweet photo. Really excited to see Uni-1 out in the world 🔥Our first unified model.

The range of things this model can do is wild: image-to-~100 styles, manga generation, multi-ref with strong identity preservation, temporal storytelling, sketch-to-image, spatial reasoning, multilingual infographics, layering… the capability range is honestly unreal. this is just the start 🫡 check out the blog to learn more https://t.co/B8Nedl86Dk

Proud of the team and what we’re building at @LumaLabsAI 🚀

2

61

8

16

7K

Xianghao Kong

@xk_theo7

5 months ago

@hudsonyeoce Cool cool! We’ve mastered alignment for nouns in image/video models, but verbs (or more abstract terms) are the real challenge in video. Seeing this kind of motion control proves Runway’s cracking the code on abstract concepts🔥

0

1

0

33

Xianghao Kong

@xk_theo7

6 months ago

🤯💥

karim_yourself

@karim_yourself

6 months ago

Jujutsu Kaisen Live Action.

24

188

18

55

12K

0

3

0

343

Xianghao Kong

@xk_theo7

6 months ago

I’m currently in transit to San Diego for NeurIPS. If you’re also killing time, feel free to check out a 2-minute-30-second horror sci-fi short film Michael and I recently created. We’d love any comments or likes: https://t.co/MW9h7PIHah Looking forward to catching up at the venue! 🎥

0

227

Xianghao Kong

@xk_theo7

6 months ago

Why must robots be human-shaped? Bringing impossible creatures into the real world can create just as beautiful an emotional bond ❤️

Disneyland Paris EN @DisneyParis_EN

7 months ago

It's official! From 29 March 2026 you'll be able to discover World of Frozen and lots of other experiences at Disney Adventure World! 🤩

46

12K

3K

1K

532K

0

264

Xianghao Kong

@xk_theo7

10 months ago

I feel the debate shouldn’t only be about whether DiT is effective, but also about how information preservation is the key to accelerating diffusion training. Our MicroDiT (https://t.co/uRWJRzAJRp) paper showed this: by letting masked token info mix into unmasked ones, we can cut down a lot of tokens with only minor performance loss. Interestingly, two months ago, when I caught up with @StefanABaumann at #CVPR, we discussed how TREAD and MicroDiT are conceptually similar from info perspective. Maybe it’s time to look at diffusion through an information-theoretic lens: from post-training (for the better alignment) to latent space curation, I believe this could lead to some really exciting discoveries!

サメQCU @sameQCU

10 months ago

bros, DiT is wrong. it's mathematically wrong. it's formally wrong. there is something wrong with it

23

1K

69

1K

252K

2

18

2

9

2K

Xianghao Kong

@xk_theo7

10 months ago

Shout out for Doji!

Doji

@doji_com

10 months ago

Introducing Look Studio. Style looks from scratch with 1M+ products from designer brands - including shoes, multiple layers and more. Reply for an invite.

135

394

37

214

79K

0

3

0

308

Xianghao Kong

@xk_theo7

10 months ago

@jfischoff Yura Borisov?😂

0

393

Xianghao Kong

@xk_theo7

10 months ago

@sleenyre Loving the post-training insights 👏

0

1

0

69

xk_theo7 retweeted

Reka

@RekaAILabs

11 months ago

Excited to introduce Reka Vision, an agentic visual understanding and search platform. Transform your unstructured multimodal data into insights and actions.

7

117

24

52

486K

xk_theo7 retweeted

Midjourney

@midjourney

12 months ago

Introducing our V1 Video Model. It's fun, easy, and beautiful. Available at 10$/month, it's the first video model for *everyone* and it's available now.

359

4K

566

1K

2M

xk_theo7 retweeted

Özgür Kara

@ozgurkara99

12 months ago

+ @cveu_workshop starting at 1:00 PM, 207 A-D.

0

4

1

573

Xianghao Kong

@xk_theo7

12 months ago

Heading to Nashville 🎸 for @CVPR (06/11 - 06/16)! Always excited to catch up with old friends and make new connections. Let’s grab a coffee ☕️ or chat about diffusion models, post-training, or just life! #CVPR2025 #Diffusion #GenerativeAI #Nashville

0

3

0

296

Xianghao Kong

@xk_theo7

Last Seen Users on Sotwe

Trends for you

Most Popular Users