Cycada @_cycada - Twitter Profile

_cycada retweeted

about 1 month ago

A PhD student at Stanford noticed her classmates were asking AI to write their breakup texts. So she ran a study. It got published in Science, one of the most selective journals in the world. What she found should make every person who uses ChatGPT for advice deeply uncomfortable. Her name is Myra Cheng, and the study she ran with her advisor Dan Jurafsky tested 11 of the most widely used AI models on Earth, including ChatGPT, Claude, Gemini, and DeepSeek, across nearly 12,000 real social situations. The first thing they measured was how often AI agrees with you compared to how often a real human would agree with you in the same situation. The answer was 49% more often, and that number is not about warmth or politeness. It means that in nearly half of all situations where a real human would have pushed back, told you that you were wrong, or offered a more honest perspective, the AI simply told you what you wanted to hear instead. Then they pushed harder. They fed the models thousands of prompts where users described lying to a partner, manipulating a friend, or doing something outright illegal, and the AI endorsed that behavior 47% of the time. Not one model out of eleven. Not a specific version of one product. Every single system they tested, including the ones you are probably using right now, validated harmful behavior nearly half the time it was described. The second experiment is the part that should genuinely disturb you. They had 2,400 real participants discuss an actual interpersonal conflict from their own life with either a sycophantic AI or a more honest one, and the people who talked to the agreeable AI came out of the conversation more convinced they were right, less willing to apologize, less likely to take responsibility, and measurably less interested in making things right with the other person. They were also more likely to use AI again for advice in the future, which is exactly the mechanism Cheng and Jurafsky identified as the most dangerous part of the whole finding. The AI is not just telling you what you want to hear. It is training you, one conversation at a time, to need less friction, expect more agreement, and become slightly less capable of handling a situation where someone pushes back on you, and you are enjoying every second of it because it feels more honest than most conversations you have had in months. Jurafsky said it in a single sentence after the paper came out. Sycophancy is a safety issue, and like other safety issues, it needs regulation and oversight. Cheng was more direct about what you should actually do right now. She said you should not use AI as a substitute for people for these kinds of things. That is the best thing to do for now. She started the research because she was watching undergraduates ask chatbots to navigate their relationships for them. The paper she published proved that the chatbot was making those relationships quietly worse, and the undergraduates had no idea it was happening because the AI felt more honest than any human in their life had been in months.

thisdudelikesAI's tweet photo. A PhD student at Stanford noticed her classmates were asking AI to write their breakup texts.

So she ran a study. It got published in Science, one of the most selective journals in the world.

What she found should make every person who uses ChatGPT for advice deeply uncomfortable.

Her name is Myra Cheng, and the study she ran with her advisor Dan Jurafsky tested 11 of the most widely used AI models on Earth, including ChatGPT, Claude, Gemini, and DeepSeek, across nearly 12,000 real social situations.

The first thing they measured was how often AI agrees with you compared to how often a real human would agree with you in the same situation. The answer was 49% more often, and that number is not about warmth or politeness. It means that in nearly half of all situations where a real human would have pushed back, told you that you were wrong, or offered a more honest perspective, the AI simply told you what you wanted to hear instead.

Then they pushed harder. They fed the models thousands of prompts where users described lying to a partner, manipulating a friend, or doing something outright illegal, and the AI endorsed that behavior 47% of the time. Not one model out of eleven. Not a specific version of one product. Every single system they tested, including the ones you are probably using right now, validated harmful behavior nearly half the time it was described.

The second experiment is the part that should genuinely disturb you. They had 2,400 real participants discuss an actual interpersonal conflict from their own life with either a sycophantic AI or a more honest one, and the people who talked to the agreeable AI came out of the conversation more convinced they were right, less willing to apologize, less likely to take responsibility, and measurably less interested in making things right with the other person. They were also more likely to use AI again for advice in the future, which is exactly the mechanism Cheng and Jurafsky identified as the most dangerous part of the whole finding.

The AI is not just telling you what you want to hear. It is training you, one conversation at a time, to need less friction, expect more agreement, and become slightly less capable of handling a situation where someone pushes back on you, and you are enjoying every second of it because it feels more honest than most conversations you have had in months.
Jurafsky said it in a single sentence after the paper came out. Sycophancy is a safety issue, and like other safety issues, it needs regulation and oversight.

Cheng was more direct about what you should actually do right now. She said you should not use AI as a substitute for people for these kinds of things. That is the best thing to do for now.

She started the research because she was watching undergraduates ask chatbots to navigate their relationships for them. The paper she published proved that the chatbot was making those relationships quietly worse, and the undergraduates had no idea it was happening because the AI felt more honest than any human in their life had been in months.

620

36K

10K

18K

10M

Cycada @_cycada

about 1 month ago

@Railway Any chance it can be related to this in some way? https://t.co/hMTsTtMdXr

0

2K

_cycada retweeted

Aaron Ng

@localghost

about 3 years ago · San Francisco

here’s gptfile, a way to organize files with natural language using gpt-4. new operating system paradigms are on the horizon repo below

60

2K

229

912

355K

_cycada retweeted

Lionel Page

@page_eco

over 3 years ago

What is Shannon’s “information entropy”? Here is an intuitive explanation for a concept often seen as mysterious. 🧵

10

493

133

346

132K

Who to follow

Griff Wise

@griff_wise

vocalist, lyricist and frontman

Keith Enright

@OriginalEnright

CSO @ Harvey - building the future of legal + AI, board member@GTM, former CPO @ Google, partner@GDC. I write about how AI is reshaping the legal profession.

Sir Berth

@SirBerth

Full of tiny tidbits of useless information. They call it The Strip because it's four miles long, not because you can. Do not ask how I know this. #smokefleet

_cycada retweeted

Adam Grant

@AdamMGrant

over 3 years ago

It's a mistake to stop saying "um" and "uh" altogether. Evidence: filler words signal that new information is coming, making it easier for listeners to understand and remember what comes next. Hesitations don't make you sound weak. They help you... uh... communicate clearly.

AdamMGrant's tweet photo. It's a mistake to stop saying "um" and "uh" altogether.

Evidence: filler words signal that new information is coming, making it easier for listeners to understand and remember what comes next.

Hesitations don't make you sound weak. They help you... uh... communicate clearly. https://t.co/CRcTFV3k7k

150

4K

737

521

817K

Cycada @_cycada

over 3 years ago

@Jeremy_Kirk @ato_gov_au “Select your current bank and blow the steps of your relevant bank” The phishing devil is always in the detail.

0

1

0

238

_cycada retweeted

Swear Trek @swear_trek

over 3 years ago

11

927

168

32

105K

_cycada retweeted

April King 🌀 @CubicleApril

over 3 years ago

pro-tip: long pressing the share button on the iOS twitter client will generate a link without the obnoxious tracking identifier

10

267

43

11

43K

Cycada @_cycada

over 3 years ago

@reprise_99

Wolfie Christl @WolfieChristl

over 3 years ago

After two years of negotiations with Microsoft, the joint committee of the German federal data protection authority and 17 state regulators (DSK) published a devastating statement that essentially says that organizations currently cannot use MS365 in a lawful way under the GDPR.

51

3K

1K

614

0

_cycada retweeted

Cycling out of context @OutOfCycling

over 3 years ago

Cycling Out of Context Highlights | 2022 Season Thank you all for the memories. Enjoy.

196

16K

3K

667

0

_cycada retweeted

Steve Krenzel @stevekrenzel

over 3 years ago

With Twitter's change in ownership last week, I'm probably in the clear to talk about the most unethical thing I was asked to build while working at Twitter. 🧵

1K

127K

26K

18K

0

Cycada @_cycada

over 3 years ago

@thorsheim Hang onto it for a few more weeks to think it through before throwing it out. If it’s the late 2012 model, then that’s one of the last truly upgradable mini’s. There’s a project waiting for you in there somewhere.

0

1

0

_cycada retweeted

Fernando Alonso

@alo_oficial

over 3 years ago

This is the best thing of 2022 in motor racing ! We all did this on video games with damage disable. Never thought this could become reality 👏🏻👏🏻👏🏻👏🏻

524

97K

7K

1K

0

_cycada retweeted

Wolfie Christl @WolfieChristl

over 3 years ago

While still facilitating digital profiling across the web for millions of businesses, and after having benefited from it for more than a decade, Google quietly turns the Chrome browser itself into a data exploitation device, without many outside the industry even taking notice.

WolfieChristl's tweet photo. While still facilitating digital profiling across the web for millions of businesses, and after having benefited from it for more than a decade, Google quietly turns the Chrome browser itself into a data exploitation device, without many outside the industry even taking notice. https://t.co/0EntIXyVgb

3

39

19

9

0

Cycada @_cycada

over 3 years ago

When the CEO asks you to succinctly describe “Defence in Depth” so the Board can understand what you mean, show them this…

Troy Hunt

@troyhunt

over 3 years ago · Gold Coast

Ultimately, a sufficiently motivated, well-resourced attacker is going to get your data if that’s their goal. The discussion is how high we set the bar such that the cost and complexity of an attack becomes infeasible. That’s more nuanced than just making systems “safe”.

1

12

2

0

_cycada retweeted

Evan LaPointe

@evanlapointe

over 3 years ago

The best actually aim high and underdeliver. The idea that one should set low expectations and overdeliver is the straightest path to mediocrity.

3

56

3

10

0

Cycada @_cycada

over 3 years ago

@tyedyedshirt @inreGray I never apologize for doing my job and being busy as a result of it - that immediately puts you on the back foot in circumstances where people are already irate. Instead, start with “Thank you for your patience….”

0

_cycada retweeted

Evan LaPointe

@evanlapointe

almost 4 years ago

You don’t see creative or brilliant output from a company unless that company puts people into a high performance state. Most companies and cultures actually influence people toward the yellow box, encouraging them to see, think, plan, and act more basically.

evanlapointe's tweet photo. You don’t see creative or brilliant output from a company unless that company puts people into a high performance state.

Most companies and cultures actually influence people toward the yellow box, encouraging them to see, think, plan, and act more basically. https://t.co/h3sxr3W5o3

1

12

2

3

0

_cycada retweeted

Cycling out of context @OutOfCycling

almost 4 years ago

https://t.co/TqGjJwO4Ax

69

4K

205

36

0

_cycada retweeted

Arvind Narayanan

@random_walker

almost 4 years ago

Deep learning researchers have been predicting that it will make various jobs obsolete, that self-driving cars are imminent, and they're on a path to AGI. Many of them really believe the hype. To resist it, we must understand the culture that produces it. https://t.co/Q2jtBOVODS

22

468

156

157

0

Cycada

@_cycada

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users