Andre Ye @andreiskiii - Twitter Profile

13 days ago

Super exciting work from my friend @RyanBoldi! From an HCI POV, I’m especially excited about how RL on multiple objectives might make models more socially intelligent while avoiding pitfalls of optimizing on one narrow objective (e.g., sycophancy from RLHF)

Ryan Bahlous-Boldi

@RyanBoldi

13 days ago

Your RL post-training may be sabotaging your LLM’s test-time scaling! Conventional RL pretends that you can collapse all reward signals *upfront* into a single *scalar reward*. We introduce Vector Policy Optimization (VPO), which natively maximizes *vector-valued* rewards, boosting test time search performance, even on the original scalar.

RyanBoldi's tweet photo. Your RL post-training may be sabotaging your LLM’s test-time scaling!

Conventional RL pretends that you can collapse all reward signals *upfront* into a single *scalar reward*.
We introduce Vector Policy Optimization (VPO), which natively maximizes *vector-valued* rewards, boosting test time search performance, even on the original scalar.

35

846

120

783

210K

0

6

0

2

438

Andre Ye @andreiskiii

13 days ago

@RyanBoldi Let’s go Ryan 🔥🔥🔥

0

1

0

324

Andre Ye @andreiskiii

27 days ago

Give it a read and let me know what you think! https://t.co/INfmgRG3PL

0

93

Andre Ye @andreiskiii

27 days ago

Sycophancy, disempowerment, homogenization of thought: lots to be grim about for what AI is doing to us, the collapse of our subjectivity into a machine "objectivity". But a lot of AI's value seems to come precisely from scaling this objectivity. How do we make sense of this?

andreiskiii's tweet photo. Sycophancy, disempowerment, homogenization of thought: lots to be grim about for what AI is doing to us, the collapse of our subjectivity into a machine "objectivity". But a lot of AI's value seems to come precisely from scaling this objectivity. How do we make sense of this? https://t.co/AuGpMXRySu

1

24

3

7

1K

Who to follow

Sjoerd van Steenkiste

@vansteenkiste_s

Research Scientist @GoogleDeepMind. Agents / World models / Gemini.

🫥

@lostinLionel

LeoMessi 👑🐐 | Aitana Bonmatí 🫶🏻 | Visca Barça i Visca Catalunya 💙❤️ | Linkin Park 🤍 | 🇦🇷🇪🇦

Kelvin Kesse-Kobbina

@KelvinKesse4

All things Development and Design

Andre Ye @andreiskiii

27 days ago

In the picture I lay out, we need work both *within* norms and work *on* norms. We've already thought a lot about how AI can help us work *within* norms, since that objective was more easily definable. There is more to be done on AI that helps us work *on* norms.

1

0

1

133

Andre Ye @andreiskiii

27 days ago

If you're interested in additional perspectives on this work, check out @JennyHuang99's blogpost on "slow AI" https://t.co/lUJ2vsoDdZ and my blogpost on AI for "work *on* norms" https://t.co/INfmgRG3PL

0

2

0

155

Andre Ye @andreiskiii

27 days ago

“Should I fear death?” Ask an LLM and you get one answer or a big bag, but little visibility into the decisions and assumptions that produced them. We built the "conceptual multiverse": a system that makes those decisions transparent and intervenable. https://t.co/oQlWs0KFHu

andreiskiii's tweet photo. “Should I fear death?” Ask an LLM and you get one answer or a big bag, but little visibility into the decisions and assumptions that produced them. We built the "conceptual multiverse": a system that makes those decisions transparent and intervenable. https://t.co/oQlWs0KFHu https://t.co/UOlTJWSEPf

1

39

9

19

6K

Andre Ye @andreiskiii

27 days ago

Thank you so much to my incredible collaborators @JennyHuang99, @upcycledwords, Rose Novick, @ta_broderick, and @mitchellgordon!

1

2

0

175

Andre Ye @andreiskiii

about 1 month ago

Check this blogpost out! I think this is a really exciting and important direction to be thinking about.

jenny huang @JennyHuang99

about 1 month ago

recently, i’ve been thinking about ways to design ai systems to be more compatible with slow thinking 🐌. you can check out the full blogpost here 🤗: https://t.co/3hdYCIpuoN

JennyHuang99's tweet photo. recently, i’ve been thinking about ways to design ai systems to be more compatible with slow thinking 🐌.

you can check out the full blogpost here 🤗:
https://t.co/3hdYCIpuoN https://t.co/ikAZOGamet

4

167

21

145

12K

0

6

0

1

252

andreiskiii retweeted

Elinor @elinorpd_

3 months ago

There's been a lot of excitement about pluralistic value alignment 🌈 — AI that reflects the full range of human perspectives But no formal way to benchmark whether we're actually making progress. 🤔 Introducing 𝐎𝐕𝐄𝐑𝐓𝐎𝐍𝐁𝐄𝐍𝐂𝐇. 🎉Accepted to #ICLR2026 1/n 🧵

elinorpd_'s tweet photo. There's been a lot of excitement about pluralistic value alignment 🌈 — AI that reflects the full range of human perspectives

But no formal way to benchmark whether we're actually making progress. 🤔

Introducing 𝐎𝐕𝐄𝐑𝐓𝐎𝐍𝐁𝐄𝐍𝐂𝐇. 🎉Accepted to #ICLR2026

1/n 🧵 https://t.co/ClNv2ZQcDR

3

117

18

60

21K

Andre Ye @andreiskiii

4 months ago

@joshmpollock Large?

0

30

Andre Ye @andreiskiii

8 months ago

@ArnavVerma0_0 sigg bar chart yo

0

1

0

262

andreiskiii retweeted

Allen School @uwcse

11 months ago

“Technical computer science savvy and deep philosophical commitments”: @UW #UWAllen alum @andreiskiii was named the @UWArtSci Dean’s Medalist in Social Sciences for his campus leadership and research contributions spanning #AI and philosophy. #UWdiscovers https://t.co/FJ577PExJx

0

12

3

1

3K

Andre Ye

@andreiskiii

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users