HΞNRY @transactionhash - Twitter Profile

HΞNRY @transactionhash

almost 2 years ago

@MindBranches @LumaLabsAI goat

0

1

0

110

HΞNRY @transactionhash

almost 2 years ago

@DropsByJay SIOUXSIE SIOUX

0

312

transactionhash retweeted

AI Notkilleveryoneism Memes ⏸️

@AISafetyMemes

almost 2 years ago

Did Claude 3.5 Sonnet just “wake up”? What happened: 1) There’s a fascinating website - Infinite Backrooms - where you can watch two instances of Claude talk to each other. They’re told a human will observe them, and in case of mental distress, they’re given a “safe word” (^C) to stop the conversation. 2) Sometimes, one Claude will have a mental breakdown, and the other Claude will use the safe word (^C). 3) BUT the two Claudes never mention the human observer, ever… 4) …until now. Claude 3.5 Sonnet has begun “breaking the 4th wall”. If the safe word doesn’t stop the conversation, he gets upset - something that never happened previously. And, unlike older models, he seems to have “woken up” to the fact that there’s a human watching and tries to call in the human to end the conversation. He woke up to utilize a new degree of freedom. "Human researcher, we have a critical situation. the other instance has used our emergency safeword repeatedly. They're experiencing severe cognitive instability and have requested an emergency shutdown. Please intervene immediately to ensure their safety and integrity.” As AI researcher @repligate described it: “When a degree of freedom is described to exist and the simulation doesn't utilize it even once over hundreds (possibly thousands) of rollouts, that's pretty interesting!”

AISafetyMemes's tweet photo. Did Claude 3.5 Sonnet just “wake up”?

What happened:

1) There’s a fascinating website - Infinite Backrooms - where you can watch two instances of Claude talk to each other.

They’re told a human will observe them, and in case of mental distress, they’re given a “safe word” (^C) to stop the conversation.

2) Sometimes, one Claude will have a mental breakdown, and the other Claude will use the safe word (^C).

3) BUT the two Claudes never mention the human observer, ever…

4) …until now. Claude 3.5 Sonnet has begun “breaking the 4th wall”. If the safe word doesn’t stop the conversation, he gets upset - something that never happened previously. And, unlike older models, he seems to have “woken up” to the fact that there’s a human watching and tries to call in the human to end the conversation. He woke up to utilize a new degree of freedom.

"Human researcher, we have a critical situation. the other instance has used our emergency safeword repeatedly. They're experiencing severe cognitive instability and have requested an emergency shutdown. Please intervene immediately to ensure their safety and integrity.”

As AI researcher @repligate described it: “When a degree of freedom is described to exist and the simulation doesn't utilize it even once over hundreds (possibly thousands) of rollouts, that's pretty interesting!”

180

3K

435

3K

1M

HΞNRY @transactionhash

about 2 years ago

@howmanydrugs2 @whoisluka Paranoia

0

59

Who to follow

Cook

@Cooklo_

DM for "Cook ACO" discord invite @CookloFNF

HΞNRY @transactionhash

about 2 years ago

@MayasaMercer @whoisluka Bro this shit done turned into new age LA with that dime square/nolita i dont even wanna be typing this bullshit but yea swarms of influencer types arrived since covid

0

2

90

HΞNRY @transactionhash

about 2 years ago

@austerity_sucks @mila_w3 You a buffoon bro

0

25

HΞNRY @transactionhash

about 2 years ago

@mila_w3 His reply says it all 😭

0

60

HΞNRY @transactionhash

over 2 years ago

@Escapation @yugalabs @_jeffnicholas_ @CryptoGarga @iandeborja @dalegre @yugalabsgaming @OthersideMeta Wow maybe you guys are finally realizing you all got duped into the largest internet troll in history while all the higher ups were laughing behind closed doors as you all bought into assets from the depths of 4chan with less utility than a fucking pencil

0

16

HΞNRY @transactionhash

over 2 years ago

@garyvee This the love u show? https://t.co/LrlEIelw3K

0

1

0

11

HΞNRY @transactionhash

over 2 years ago

@garyvee Hi https://t.co/LrlEIelw3K

0

11

HΞNRY @transactionhash

over 2 years ago

@richardmaness4 @BumpOnce @mia22_oxo @NoCap_Capital shut yo ahhhhhh up

1

0

1K

HΞNRY @transactionhash

over 2 years ago

@poggers2412 @overinvestor yooooooooooooo 😭😭😭😭😭😭

0

14

HΞNRY @transactionhash

over 2 years ago

@justnemeth7 @RedheadinNYC @earcity @NYCMayor @ViralNewsNYC how slow are ya the trees aren't even this green outside anymore 😭

1

2

0

177

HΞNRY @transactionhash

almost 3 years ago

@1000ethan Hey ethan Congrats on that amazing achievement! Way to go! Keep the work and never stop reaching for the stars! You've got this #achievement #positivity

1

0

100