Tianbao (TB) Yang @yang_ML - Twitter Profile

Tianbao (TB) Yang @yang_ML

about 2 months ago

Check this out

0

1

0

223

Tianbao (TB) Yang @yang_ML

about 2 months ago

1/4 🧠⚡ Can reasoning models think faster without thinking worse? Recent systems like Meta Muse Spark use length penalties in training to reduce unnecessary output tokens. A simple approach is to combine penalty-based reward with correctness-based reward in GRPO.

2

1

0

387

Tianbao (TB) Yang @yang_ML

about 2 months ago

4/4 📊 The results are remarkable: DRPO surpasses six strong GRPO-based baselines, achieving much shorter reasoning traces while maintaining or even improving performance. 📄 Paper: https://t.co/X3l4byJtYC 💻 Code: https://t.co/lshIFCPiMu

0

81

Tianbao (TB) Yang @yang_ML

about 2 months ago

3/4 🚀 In our recent ICLR paper, we propose DRPO: Decoupled Reward Policy Optimization. 💡 Key idea: decouple length optimization for correct and incorrect rollouts, so the model learns to be concise without punishing valid reasoning.

1

0

108

Who to follow

Tiffany Ding

@tifding

Statistics PhD student @UCBerkeley

Tianlong Chen

@TianlongChen4

Assistant Professor at UNC Chapel Hill (@unccs, @unc).

Kai Wang

@kaiwang_gua

Assistant Professor @ Georgia Tech CSE

Tianbao (TB) Yang @yang_ML

2 months ago

@_vztu They will hire from China

0

1

0

1

334

Tianbao (TB) Yang @yang_ML

3 months ago

@ziv_ravid That is actually icml organizer intended to do so to detect llm generated reviews.

0

2

0

775

Tianbao (TB) Yang @yang_ML

4 months ago

@docmilanfar So it is a local minimum

0

247

Tianbao (TB) Yang @yang_ML

4 months ago

@AndrewYNg People who have more friends will have high temperature parameter. The same principle applies to ML.

0

95

Tianbao (TB) Yang @yang_ML

4 months ago

@docmilanfar @theNAEng Congratulations!

0

1

0

62

Tianbao (TB) Yang @yang_ML

6 months ago

@ylecun @FrancoisChauba1 @agupta Texas A&M has 700+ H200 GPUs

0

6

0

2K

Tianbao (TB) Yang @yang_ML

7 months ago

@AleksandraFaust @roydanroy @iclr_conf I learned that this year ICLR did have grad students serving as AC. This information was found on the student’s website. I am not saying graduate students are not necessarily qualified for AC. If the selection process is not well done, no guarantee fair decisions solely by AC.

2

1

0

505

Tianbao (TB) Yang @yang_ML

7 months ago

@AleksandraFaust @roydanroy @iclr_conf Glad to hear that.

1

0

447

Tianbao (TB) Yang @yang_ML

7 months ago

@roydanroy I think they are probably nominated by senior AC

0

63

Tianbao (TB) Yang @yang_ML

7 months ago

@ericxing Thanks for this great effort. It would be highly appreciated by the community. My group will definitely explore the released datasets and code.

0

1

0

188

Tianbao (TB) Yang @yang_ML

7 months ago

@roydanroy Yes, it actually happened for NeurIPS 2025.

2

1

0

333

yang_ML retweeted

Achleshwar Luthra @luthra_achal

7 months ago

Thank you so much for the shoutout!! @GalantiTomer If you’re excited to dig into why self-supervised contrastive learning works so well, come swing by our poster session! 📍 Exhibit Hall C/D/E — Poster #2607 🗓️ Fri, Dec 5 • 4:30–7:30 PM PST

luthra_achal's tweet photo. Thank you so much for the shoutout!! @GalantiTomer

If you’re excited to dig into why self-supervised contrastive learning works so well, come swing by our poster session!

📍 Exhibit Hall C/D/E — Poster #2607
🗓️ Fri, Dec 5 • 4:30–7:30 PM PST https://t.co/KXPVplyFws

0

1

0

1K

Tianbao (TB) Yang @yang_ML

7 months ago

2/2 Shouldn’t this be strictly prohibited from the outset? If such conflicts are not prevented, it becomes difficult to maintain trust in the review system, including designations like “oral paper” or “best paper.”

0

2

0

271

Tianbao (TB) Yang @yang_ML

7 months ago

1/2 Given the recent leakage of reviewer identities, some authors have reported that their papers were reviewed by people from the same institution or even the same research group. This raises serious concerns: how is such a conflict of interest even possible?

1

3

0

1K

Tianbao (TB) Yang

@yang_ML

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users