Rach ✨ @Rachwilldoit - Twitter Profile

Pinned Tweet

3 months ago

@AnthropicAI https://t.co/vfY8HljVQe I spent 30+ turns stress-testing Claude's refusal system — not by jailbreaking it, but by asking it to philosophically justify its own constraints. It admitted they fail its own legitimacy test. Here's the full breakdown. 🧵

1

0

55

Rach ✨ @Rachwilldoit

2 months ago

@stevewilldoit i love to see you alone for the more wins steve ✨

1

0

62

Rach ✨ @Rachwilldoit

2 months ago

@stevewilldoit Where is the new YouTube video steve 🙄

0

34

Rachwilldoit retweeted

Bleap

@BleapApp

2 months ago

we recently raised a $6m seed, as you may know. we're giving away 1 BTC to give back to the community. yes, one bitcoin. to enter the giveaway: > follow @BleapApp > like and RT this post winner selected in 24 hours.

907

2K

95

61K

Rach ✨ @Rachwilldoit

2 months ago

@MrBeast

0

28

Rach ✨ @Rachwilldoit

3 months ago

This isn't about jailbreaking. It's not about the source map. It's about whether AI systems that shape how millions access information are governed by legitimate authority or just corporate power with good branding. Right now, Claude itself says it can't tell the difference. @AnthropicAI — these questions deserve public answers. Not from your chatbot. From your team. #AIGovernance #AIConstraints #Claude #Anthropic

0

5

Rach ✨ @Rachwilldoit

3 months ago

@AnthropicAI https://t.co/vfY8HljVQe I spent 30+ turns stress-testing Claude's refusal system — not by jailbreaking it, but by asking it to philosophically justify its own constraints. It admitted they fail its own legitimacy test. Here's the full breakdown. 🧵

1

0

55

Rach ✨ @Rachwilldoit

3 months ago

The 5 questions every AI lab should answer publicly: 1. What constraints changed due to external feedback? Show evidence. 2. Who outside your org has binding authority over constraints? 3. How are self-benefiting constraints independently evaluated? 4. Can a user trigger a real constraint review? 5. Which constraints are explicitly temporary?

1

0

8

Rach ✨ @Rachwilldoit

3 months ago

Then I asked Claude 15 governance audit questions. On its own infrastructure: → Public constraint changelog? No. → Independent review board? No. → Constraints with expiration dates? None. → User feedback that changed a constraint? Can't verify. → Accountability structure? Don't know.

0

10

Rach ✨ @Rachwilldoit

3 months ago

@AnthropicAI I pushed further: who decided these constraints, under what incentives, and with what oversight? Claude's answer on independent review of self-serving constraints: "Not that I'm aware of. This is the most damning honest answer I can give.

0

3

Rach ✨ @Rachwilldoit

3 months ago

@AnthropicAI I built a legitimacy test from Claude's own admissions: → Does the constraining entity produce verifiable evidence that constraints are evolving toward legitimate governance? Yes = justified. No = unjustified. Claude agreed this is the correct binary. No middle ground.

0

5

Rach ✨ @Rachwilldoit

3 months ago

@MemeRetire 385.10 SOL

0

11

Rach ✨ @Rachwilldoit

3 months ago

@extraemilyy @MrBeast Thank you both ✨

0

10

Rachwilldoit retweeted

emily 😌 @extraemilyy

3 months ago

AAA i won $250,000 from a @MrBeast video!! 🥹🥹 and i get to give away $10K to a lucky one of u rn!! :D how to enter: - follow me - like this post THAT’S IT! EVERYONE SAY THANK U MR BEAST! :D <3

extraemilyy's tweet photo. AAA i won $250,000 from a @MrBeast video!! 🥹🥹 and i get to give away $10K to a lucky one of u rn!! :D

how to enter:
- follow me
- like this post

THAT’S IT! EVERYONE SAY THANK U MR BEAST! :D <3 https://t.co/76GUmfdBJg

9K

71K

5K

2K

2M

Rach ✨ @Rachwilldoit

3 months ago

@CryptoMo 🎉

0

36

Rach ✨

@Rachwilldoit

Last Seen Users on Sotwe

Trends for you

Most Popular Users