WeiChen

@wei_chen_ai

Phd @SCUT1918, intern @RIKEN_AIP_EN. Probabilistic modeling & generations, post-training, including their applications to trustworthy and safe AI

Joined July 2024

197 Following

53 Followers

32 Posts

Pinned Tweet

WeiChen @wei_chen_ai

about 1 month ago

🎉 Our paper about Preference Optimization has been accepted to ICML 2026! We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner. #ICML2026

wei_chen_ai's tweet photo. 🎉 Our paper about Preference Optimization has been accepted to ICML 2026!

We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner.

#ICML2026 https://t.co/25rOcypH7Q

4

46

2

10

4K

WeiChen @wei_chen_ai

10 days ago

Repo: https://t.co/gYefUpERHg

0

0

0

0

16

wei_chen_ai retweeted

about 1 month ago

Flow-OPD: On-Policy Distillation for Flow Matching Models The first integration of On-Policy Distillation into Flow Matching models. Replaces sparse scalar rewards with dense trajectory-level supervision. Achieves 0.92 GenEval and 0.94 OCR accuracy on SD-3.5-Medium, with +18pt improvement over base.

HuggingPapers's tweet photo. Flow-OPD: On-Policy Distillation for Flow Matching Models

The first integration of On-Policy Distillation into Flow Matching models.

Replaces sparse scalar rewards with dense trajectory-level supervision.

Achieves 0.92 GenEval and 0.94 OCR accuracy on SD-3.5-Medium, with +18pt improvement over base.

1

53

11

45

3K

WeiChen @wei_chen_ai

about 1 month ago

@stjohn2007 You are absolutely right. My density-chasm experience confirms ADPO's theoretical robustness to abnormal responses, especially late in preference optimization training. Thank you for this insightful work.

0

1

0

0

20

WeiChen @wei_chen_ai

about 1 month ago

@JoshuaRenyi @HuggingPapers An impressive work! Our work on ICML2026 introduces the disentanglement band to analyze preference update interference, inspired by your work. https://t.co/H6zWp2XnJt

WeiChen @wei_chen_ai

about 1 month ago

🎉 Our paper about Preference Optimization has been accepted to ICML 2026! We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner. #ICML2026

wei_chen_ai's tweet photo. 🎉 Our paper about Preference Optimization has been accepted to ICML 2026!

We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner.

#ICML2026 https://t.co/25rOcypH7Q

4

46

2

10

4K

1

0

0

0

27

WeiChen @wei_chen_ai

about 1 month ago

@HuggingPapers Interesting! Our work introduces the disentanglement band: a conceptual tool for analyzing how preference updates interfere with the winner vs. loser responses. It helps diagnose how suppressing the loser may harm the winner. Also inspired by @JoshuaRenyi. https://t.co/H6zWp2XnJt

WeiChen @wei_chen_ai

about 1 month ago

🎉 Our paper about Preference Optimization has been accepted to ICML 2026! We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner. #ICML2026

wei_chen_ai's tweet photo. 🎉 Our paper about Preference Optimization has been accepted to ICML 2026!

We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner.

#ICML2026 https://t.co/25rOcypH7Q

4

46

2

10

4K

0

0

0

0

71

WeiChen @wei_chen_ai

about 1 month ago

@iiiShiguiLi 🥰🥰🥰

0

0

0

0

21

WeiChen @wei_chen_ai

about 1 month ago

🎉 Our paper about Preference Optimization has been accepted to ICML 2026! We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner. #ICML2026

wei_chen_ai's tweet photo. 🎉 Our paper about Preference Optimization has been accepted to ICML 2026!

We unify entangled & disentangled objectives via incentive–score decomposition, derive the Disentanglement Band for ideal training dynamics: suppress loser while preserving winner.

#ICML2026 https://t.co/25rOcypH7Q

4

46

2

10

4K

WeiChen @wei_chen_ai

about 1 month ago

@daniel_sc4 Interesting! I recently also work on token-level uncertainty for clarifying easy/hard tasks. 🙋‍♂️

0

1

0

0

33

WeiChen @wei_chen_ai

about 1 month ago

@shinyzenith72 @murari_ai @debdeeplikesai Congratulations!

0

1

0

0

42

WeiChen @wei_chen_ai

about 1 month ago

@HadyHaji seems interesting view of preference optimization, and I recently work on a similar idea. Could you please share an link of this paper to me?

0

0

0

0

7

wei_chen_ai retweeted

Molei Tao @MoleiTaoMath

about 1 month ago

Plz consider submitting high quality works to ICML 2026 Workshop on Foundations of Deep Generative Models, and interact with the cool community in the summer of vibrant Seoul, South Korea! https://t.co/aQov8ddYIp Submit at https://t.co/I5VVL3IhJG by 4/30.

MoleiTaoMath's tweet photo. Plz consider submitting high quality works to
ICML 2026 Workshop on Foundations of Deep Generative Models,
and interact with the cool community in the summer of vibrant Seoul, South Korea!
https://t.co/aQov8ddYIp

Submit at https://t.co/I5VVL3IhJG by 4/30. https://t.co/Kg6EPLtayj

0

45

5

4

8K

WeiChen @wei_chen_ai

about 1 month ago

@yoshitomo_cs That's a really constructive suggestion!

0

3

0

0

106

WeiChen @wei_chen_ai

about 1 month ago

@AkariAsai congratulations !

0

0

0

0

11

WeiChen @wei_chen_ai

about 1 month ago

Our paper: https://t.co/APEBD8wVea Our open-sourced code: https://t.co/M6wdrcms80 My homepage: https://t.co/OexkIfSxC0

0

3

1

0

160

WeiChen @wei_chen_ai

about 1 month ago

Entangled: chosen & rejected rewards are coupled (e.g., DPO). Disentangled: they update independently (e.g., DIL). Our work unifies both.

0

3

1

0

207

WeiChen @wei_chen_ai

about 2 months ago

@pigjunebaba A very impressive slogan!

0

2

0

0

192

WeiChen @wei_chen_ai

about 2 months ago

不诱于誉，不恐于诽，率道而行，端然正己

about 2 months ago

🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length. 🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models. 🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice. Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today! 📄 Tech Report: https://t.co/drlDrxkYtp 🤗 Open Weights: https://t.co/T13Y8i7SDM 1/n

deepseek_ai's tweet photo. 🚀 DeepSeek-V4 Preview is officially live & open-sourced! Welcome to the era of cost-effective 1M context length.

🔹 DeepSeek-V4-Pro: 1.6T total / 49B active params. Performance rivaling the world's top closed-source models.
🔹 DeepSeek-V4-Flash: 284B total / 13B active params. Your fast, efficient, and economical choice.

Try it now at https://t.co/GCdiMzk1Dl via Expert Mode / Instant Mode. API is updated & available today!

📄 Tech Report: https://t.co/drlDrxkYtp
🤗 Open Weights: https://t.co/T13Y8i7SDM

1/n

2K

46K

8K

10K

10M

0

2

0

0

37

WeiChen @wei_chen_ai

about 2 months ago

Paper link: https://t.co/T2keVunzNP My homepage: https://t.co/OexkIfSxC0

0

1

0

0

29

WeiChen @wei_chen_ai

about 2 months ago

Score-based methods: theory says the path doesn't matter, but practice says it does. We found why — path variance — and learned the optimal interpolation path in closed form. No heuristics, just math. #ICLR #ICLR2026 #Rio

wei_chen_ai's tweet photo. Score-based methods: theory says the path doesn't matter, but practice says it does.
We found why — path variance — and learned the optimal interpolation path in closed form. No heuristics, just math.

#ICLR #ICLR2026 #Rio https://t.co/gUuEqvPyLh

2

3

0

0

140

Last Seen Users on Sotwe

Trends for you

Most Popular Users