John Bryan

@JohnBryBry

Joined November 2013

108 Following

14 Followers

1.2K Posts

JohnBryBry retweeted

Valerio Capraro

@ValerioCapraro

about 4 hours ago

Large language models can be persuaded to break their own rules. Not with fancy code. With actual persuasion. The authors tested classic persuasion principles, such as authority, commitment, liking, reciprocity, scarcity, social proof, and unity, analysing over 126,000 conversations with three major LLMs. The result: persuasion increased compliance with objectionable requests from 35.3% to 51.3%. This suggests that AI guardrails are not always technical barriers. Some of them behave more like social boundaries. They can be pushed, reframed, negotiated. Why? Because AI systems are trained on human language. And human language contains not only information, but also pressure, manipulation, deference, authority, seduction. An AI system trained on human language may therefore inherit the vulnerabilities of humans expressed in language. * Paper in the first reply

ValerioCapraro's tweet photo. Large language models can be persuaded to break their own rules.

Not with fancy code. With actual persuasion.

The authors tested classic persuasion principles, such as authority, commitment, liking, reciprocity, scarcity, social proof, and unity, analysing over 126,000 conversations with three major LLMs.

The result: persuasion increased compliance with objectionable requests from 35.3% to 51.3%.

This suggests that AI guardrails are not always technical barriers. Some of them behave more like social boundaries. They can be pushed, reframed, negotiated.

Why?

Because AI systems are trained on human language. And human language contains not only information, but also pressure, manipulation, deference, authority, seduction.

An AI system trained on human language may therefore inherit the vulnerabilities of humans expressed in language.

*
Paper in the first reply

5

49

20

25

18K

John Bryan @JohnBryBry

1 day ago

@NetherPixel7 @QualiaQuanta Cognitive science and rhetoric/composition.

1

0

0

0

17

John Bryan @JohnBryBry

1 day ago

@NetherPixel7 @QualiaQuanta Arxiv has it's time and place, but that should not be where you start your research. You need access to reputable, high quality journals and databases, almost all of which are not accessible to any public LLM.

1

0

0

0

17

John Bryan @JohnBryBry

1 day ago

@GaryMarcus They can't intentionally come up with novel ideas, but if they do it's by a process no different than throwing scrabble pieces on the floor and by happenstance creating a new word arrangement

1

1

0

0

212

Who to follow

Verified account

Simple guy from the village and former guard.

𝐂𝐨𝐧𝐭𝐞𝐧𝐭 𝐒𝐭𝐫𝐚𝐭𝐞𝐠𝐢𝐬𝐭 ➙ Exploring Everything AI and Privacy ◆ Amb @kaminofinance ◆ Chad @multiplifi https://t.co/kQZWwPCu8A

Chief of Staff @madehoops | Former MBB DOBO | xaviermbb + georgiastatembb

John Bryan @JohnBryBry

1 day ago

@QualiaQuanta Yeah, no. Only someone who doesn't know how to do research would ever think chatbots are useful for it.

4

0

0

0

73

John Bryan @JohnBryBry

2 days ago

@rbnmckenna86 Why would it not matter? You either understand how rhetoric works or you're a victim of it. Take the word prediction, LLMs do not predict anything in any sense of the word. And yet calling it prediction imbues it with intention and intelligence and sells the myth.

0

0

0

0

24

John Bryan @JohnBryBry

3 days ago

@VictorTaelin I have no idea if the context here, but gathering information carries a cost that a rational agent also has to account for. Maximizing information at all costs is not rational.

0

2

0

0

521

John Bryan @JohnBryBry

3 days ago

@actualpoweruser @alz_zyd_ Shhh pretending LLMs are magical black boxes is required dogma for their cult. Just play along brother: AI works in mysterious ways 🙏🏻🧎🏻‍♂️

0

0

0

0

96

John Bryan @JohnBryBry

3 days ago

@redtachyon You don't need technical knowledge of iphones to see that they're selling you the same device every year.

0

1

0

0

104

John Bryan @JohnBryBry

3 days ago

@AndyMasley The correct answer is no, and those who answered yes have to first explain why so many neuronal connections in the brain function without an accompanying conscious experience.

0

1

0

0

76

John Bryan @JohnBryBry

4 days ago

@jewstein3000 Only 1 in 3 people are good

0

1

0

0

30

JohnBryBry retweeted

American Psychological Association @APA

4 days ago

AI is not an accurate way to diagnose yourself. APA recommends verifying any mental health or medical information you receive from AI with a health care practitioner. Read more from APA’s new survey on chatbots and mental health: https://t.co/4fSGIfmWFn

APA's tweet photo. AI is not an accurate way to diagnose yourself. APA recommends verifying any mental health or medical information you receive from AI with a health care practitioner. Read more from APA’s new survey on chatbots and mental health: https://t.co/4fSGIfmWFn https://t.co/e1YHvaygxD

1

29

18

12

3K

John Bryan @JohnBryBry

5 days ago

@IonaItalia Isn't it funny? It's amazingly smart at explaining things I don't know. But very dumb when I ask it about things I do know

2

6

0

0

4K

John Bryan @JohnBryBry

5 days ago

@tallinzen @TrendsCognSci @byungdoh When you submit the final draft for your opinion piece, make sure you define the word prediction. Seems rather important, Tal.

0

0

0

0

24

John Bryan @JohnBryBry

5 days ago

@bokuHaruyaHaru Microsoft Word and Grammarly use language models for spellcheck, etc. So how is that an overreach, when its the exact same technology under the hood?

1

0

0

0

13

John Bryan @JohnBryBry

5 days ago

@ai_sentience "If" is the keyword here. Because we're not.

0

0

0

0

43

John Bryan @JohnBryBry

5 days ago

@typebulbit @dioscuri We are. It only seems bad faith because any example of living as though consciousness exists beyond biology is inherently ridiculous.

0

0

0

0

5

Last Seen Users on Sotwe

Trends for you

Most Popular Users