Chenyang Si @scy994 - Twitter Profile

Pinned Tweet

over 2 years ago

🚀🚀Now combining #FreeU with #SDXL, #ControlNet, #LCM, #ScaleCrafter, #Dreambooth, and #Animatediff, you can enhance the generation quality for free! -Project Page: https://t.co/z5duXpY4VF -Code: https://t.co/BwVLQ1SMIZ -Video: https://t.co/W5XKxoEE1H

AK

@_akhaliq

almost 3 years ago

FreeU: Free Lunch in Diffusion U-Net paper page: https://t.co/fpRjbk0CED we uncover the untapped potential of diffusion U-Net, which serves as a "free lunch" that substantially improves the generation quality on the fly. We initially investigate the key contributions of the U-Net architecture to the denoising process and identify that its main backbone primarily contributes to denoising, whereas its skip connections mainly introduce high-frequency features into the decoder module, causing the network to overlook the backbone semantics. Capitalizing on this discovery, we propose a simple yet effective method-termed "FreeU" - that enhances generation quality without additional training or finetuning. Our key insight is to strategically re-weight the contributions sourced from the U-Net's skip connections and backbone feature maps, to leverage the strengths of both components of the U-Net architecture. Promising results on image and video generation tasks demonstrate that our FreeU can be readily integrated to existing diffusion models, e.g., Stable Diffusion, DreamBooth, ModelScope, Rerender and ReVersion, to improve the generation quality with only a few lines of code.

8

732

153

346

566K

4

175

44

85

27K

scy994 retweeted

Tencent Hy

@TencentHunyuan

4 months ago

One static model does not fit all😭 We just dropped our latest work: Functional Neural Memory. Instead of static models, we generate custom "parameters" for every single input. ✅Prompt your model anytime ✅Instant personalization ✅Better instruction following ✅Flexible & dynamic memory (w/o memory bank✌️) (🧵1/6)

11

342

138

202

75K

scy994 retweeted

Songlin Yang @Eddie395402252

4 months ago

🚀The ultimate roadmap for Human-Centric AIGC is here! Our new survey provides a structured and task-grounded lens on Diffusion Models for human-centric content generation. Whether you are a beginner or an expert, this is your one-stop-shop! 📄 Paper: https://t.co/T1CPP9MODS

Eddie395402252's tweet photo. 🚀The ultimate roadmap for Human-Centric AIGC is here!

Our new survey provides a structured and task-grounded lens on Diffusion Models for human-centric content generation. Whether you are a beginner or an expert, this is your one-stop-shop!

📄 Paper: https://t.co/T1CPP9MODS https://t.co/uUOAWWT31B

0

2

1

347

Chenyang Si @scy994

6 months ago

Thanks @_akhaliq for sharing! 🎬 #LongVie2 enables continuous video generation for up to 5 minutes with: 🕹️Strong Controllability 📷 Long-term Visual Fidelity 🔒 Temporal Consistency -Paper: https://t.co/ZZODTS6aWc -Page: https://t.co/VZKDD392Kp -Code: https://t.co/oHhYMUp29l

AK

@_akhaliq

6 months ago

LongVie 2 Multimodal Controllable Ultra-Long Video World Model

3

74

12

52

23K

3

56

9

38

8K

Who to follow

Yushi LAN

@GROS17121524

Ph.D. in CV & Graphics

Ziqi Huang

@ziqi_huang_

Ph.D. student @NTUsg MMLab@NTU - Visual Generation

Brian Li

@Brian_Bo_Li

Brian is building, something new @amilabs Prev works LLaVA-OneVision/LMMs-Eval/LMMs-Engine/OneVision-Encoder.

scy994 retweeted

Ziwei Liu

@liuziwei7

7 months ago

🔥Ultra-Long Video World Model up to 5min🔥 ✨ We introduce #LongVie2, an end-to-end autoregressive video world model that supports continuous video generation lasting up to 5min with: 🕹️ Strong Controllability 📷 Long-term Visual Fidelity 🔒 Temporal Consistency - Project: https://t.co/Z5vxsvpXsy - Code: https://t.co/wcSfQERhXA - Paper: https://t.co/H9Ir5HQ6IY . Thanks to @_akhaliq !

14

866

122

727

83K

scy994 retweeted

AIQUEST

@AiquestAcademy

7 months ago

AI video generators fall apart after 10 seconds. LongVie 2 generates coherent videos for 5 MINUTES straight → with full controllability & zero quality degradation 🎬 3-stage training solves what Sora couldn’t: •Multi-modal control •Degradation-aware learning •History-context alignment World models just got real #AIVideo #GenerativeAI

1

3

2

1

1K

scy994 retweeted

Myron AI artist

@seirdotmk

7 months ago

The "Controllable Video" problem might have just been solved. 🚀 LongVie 2 is a new end-to-end autoregressive framework that masters three things: controllability, long-term visual quality, and temporal consistency. What makes it special: ✅ 5-Minute Continuous Video: Not just a loop, but coherent, evolving generation. ✅ Dense/Sparse Control: Precise world-level supervision for better steering. ✅ State-of-the-Art Benchmarks: Outperforms current models in visual fidelity and "long-range" coherence. The era of "one-minute-plus" AI video that actually stays consistent is finally here. 🔗 https://t.co/QMNRHn0mUF

0

2

1

206

scy994 retweeted

Ming "Tommy" Tang @tangming2005

7 months ago

LongVie 2, an end-to-end autoregressive video world model with: - Strong Controllability - Long-term Visual Fidelity - Temporal Consistency https://t.co/9Q9gJIkMDr

tangming2005's tweet photo. LongVie 2, an end-to-end autoregressive video world model with:
- Strong Controllability
- Long-term Visual Fidelity
- Temporal Consistency https://t.co/9Q9gJIkMDr https://t.co/qpBx7CJqsk

1

11

2

5

2K

scy994 retweeted

Jianxiong Gao @JianxGao

7 months ago

✨ We introduce LongVie 2, an end-to-end autoregressive video world model with: 🕹️ Strong Controllability 🎨 Long-term Visual Fidelity 🔒Temporal Consistency - Page: https://t.co/XjFVgw4uf1 - Paper: https://t.co/mvRCUxYg5t

1

201

34

127

13K

Chenyang Si @scy994

7 months ago

📢 Our #NextVid workshop @ #NeurlPS2025 schedule is live! 🎉 @Kaleidudu @liuziwei7 👏🏻Welcome to the full program! ☀️ Many thanks to our wonderful speakers: @liu_mingyu, @dimadamen, @jiajunwu_cs, @haozhangml, @Enjoy_Yi, @xxunhuang, @sainingxie, @HengshuangZhao

scy994's tweet photo. 📢 Our #NextVid workshop @ #NeurlPS2025 schedule is live! 🎉 @Kaleidudu @liuziwei7

👏🏻Welcome to the full program!

☀️ Many thanks to our wonderful speakers: @liu_mingyu, @dimadamen, @jiajunwu_cs, @haozhangml, @Enjoy_Yi, @xxunhuang, @sainingxie, @HengshuangZhao https://t.co/qwNB9JOdYv

0

9

5

2

2K

Chenyang Si @scy994

7 months ago

Thanks @_akhaliq for sharing! 🎉 📢#PosterCopilot enables precise layout reasoning and multi-round, layer-wise editing for high-quality graphic design. - Paper: https://t.co/Cexw1yWcFc - Project: https://t.co/rPGT90FHGh

AK

@_akhaliq

7 months ago

PosterCopilot Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

2

56

9

32

17K

0

1

0

1

151

scy994 retweeted

Xinting Hu @Kaleidudu

7 months ago

Heading to #NeurIPS2025? Join us for the #NextVid Workshop on Dec 6 in San Diego! We’re exploring the frontier of Video Generation with an outstanding lineup of keynote speakers: Check the thread for today's reveals! 👇

Kaleidudu's tweet photo. Heading to #NeurIPS2025? Join us for the #NextVid Workshop on Dec 6 in San Diego!

We’re exploring the frontier of Video Generation with an outstanding lineup of keynote speakers:

Check the thread for today's reveals! 👇 https://t.co/HxtgbF6TET

2

31

6

7

4K

scy994 retweeted

AK

@_akhaliq

7 months ago

PosterCopilot Toward Layout Reasoning and Controllable Editing for Professional Graphic Design

2

56

9

32

17K

scy994 retweeted

Frank (Haofan) Wang @Haofan_Wang

7 months ago

(1/7) We are delighted to introduce our latest work, PosterCopilot. Given a set of unordered elements, it automatically plans the layout, supports editing of any element, and ultimately produces a professional-grade poster.

Haofan_Wang's tweet photo. (1/7) We are delighted to introduce our latest work, PosterCopilot. Given a set of unordered elements, it automatically plans the layout, supports editing of any element, and ultimately produces a professional-grade poster. https://t.co/V5d7SJ0YG1

5

109

13

82

10K

Chenyang Si @scy994

7 months ago

🔥#PosterCopilot: Professional Poster Design with Controllable Editing🔥 #PosterCopilot enables precise layout reasoning and multi-round, layer-wise editing for high-quality graphic design. Project Page：https://t.co/rPGT90FHGh Paper：https://t.co/xmnBqTrRGV

0

4

1

2

399

Chenyang Si @scy994

8 months ago

👍

Bingyi Kang

@bingyikang

8 months ago

After a year of team work, we're thrilled to introduce Depth Anything 3 (DA3)! 🚀 Aiming for human-like spatial perception, DA3 extends monocular depth estimation to any-view scenarios, including single images, multi-view images, and video. In pursuit of minimal modeling, DA3 reveals two key insights: 💎 A plain transformer (e.g., vanilla DINO) is enough. No specialized architecture. ✨ A single depth-ray representation is enough. No complex 3D tasks. Three series of models have been released: the main DA3 series, a monocular metric estimation series, and a monocular depth estimation series. The core team members, aside from me: @HaotongLin, Sili Chen, Jun Hao Liew, @donydchen. 👇(1/n) #DepthAnything3

79

4K

493

2K

515K

0

3

0

1

341

scy994 retweeted

NTU Singapore

@NTUsg

9 months ago

🏆 Congrats to #NTUsg Prof Ng Geok Ing on the 🇸🇬 President’s Technology Award 2025. A pioneer in Gallium Nitride (#GaN) – found in fast chargers, EVs, satellites & defence – he built 🇸🇬’s global standing in this field and led the creation of the national GaN centre. 👏 We also congratulate Young Scientist Award recipient Assoc Prof Liu Ziwei, who was recognised for advancing #AI in 3D & 4D vision, as well as digital twins, with impact on healthcare, education and other sectors. #PSTA @NTU_EEE @NTU_ccds

NTUsg's tweet photo. 🏆 Congrats to #NTUsg Prof Ng Geok Ing on the 🇸🇬 President’s Technology Award 2025. A pioneer in Gallium Nitride (#GaN) – found in fast chargers, EVs, satellites & defence – he built 🇸🇬’s global standing in this field and led the creation of the national GaN centre. 👏 We also congratulate Young Scientist Award recipient Assoc Prof Liu Ziwei, who was recognised for advancing #AI in 3D & 4D vision, as well as digital twins, with impact on healthcare, education and other sectors. #PSTA @NTU_EEE @NTU_ccds

0

37

5

0

11K

Chenyang Si @scy994

11 months ago

🎯【Call for Papers: NeurIPS 2025 Workshop NextVid： What Makes a Good Video: Next Practices in Video Generation and Evaluation】 ⏰ Key dates: * Submission deadline: Aug 22, 2025 (AOE)  📌 Details: * Website: https://t.co/fNemjz03VK

scy994's tweet photo. 🎯【Call for Papers: NeurIPS 2025 Workshop NextVid： What Makes a Good Video: Next Practices in Video Generation and Evaluation】
⏰ Key dates:
* Submission deadline: Aug 22, 2025 (AOE)
 📌 Details:
* Website: https://t.co/fNemjz03VK https://t.co/rAm09YArtI

0

9

4

1

2K

scy994 retweeted

Ziwei Liu

@liuziwei7

11 months ago

🔥1-min Interactive Video Generation with Multimodal Control🔥 Towards *long-context world model*, #LongVie is an end-to-end autoregressive framework for controllable ultra-long video generation - Page: https://t.co/kKNHhjEmrl - Paper: https://t.co/M5wGkpi0Ve . Thanks @_akhaliq

0

147

33

77

14K

scy994 retweeted

DailyPapers

@HuggingPapers

11 months ago

NVIDIA researchers introduce LongVie, a new framework for multimodal-guided controllable ultra-long video generation. It tackles key issues like temporal inconsistency and visual degradation.

HuggingPapers's tweet photo. NVIDIA researchers introduce LongVie, a new framework for multimodal-guided controllable ultra-long video generation.

It tackles key issues like temporal inconsistency and visual degradation. https://t.co/5UgEzOtHt7

1

30

12

11

2K

scy994 retweeted

DailyPapers

@HuggingPapers

11 months ago

LongVie uses unified noise, global control normalization, and multimodal guidance to ensure consistency & quality in videos up to one minute. Explore the research: https://t.co/e8AU52lbt7 See demo: https://t.co/b6ybl11z3v

0

5

1

0

1K

Chenyang Si

@scy994

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users