Alex @alex_m - Twitter Profile

Alex

@Alex_m

39 minutes ago

@20x_panda @theo Intentionally sabotaging is different from silently degrading the performance

0

11

Alex

@Alex_m

about 2 hours ago

@DaveShapi @beffjezos Some are reporting that Fable is intentionally sabotaging their AI/ML work due to the silent trigger.

0

9

Alex

@Alex_m

about 2 hours ago

@QuixiAI Its not even about the silent degradation, it might even sabotage your work intentionally. I saw this a few hours ago, crazy if true. https://t.co/pEgyOFbXlf

Adam Hassan

@adamislucky

about 6 hours ago

Thoughts on Fable from a friend. Builders beware.

33

736

48

112

87K

0

4

Alex

@Alex_m

about 2 hours ago

@xolandar @adamislucky @theo Making the model “ineffective” silently is one thing, but having it intentionally sabotage a codebase when it was only told to audit it is a very different story.

1

4

0

83

Who to follow

Love is caring more about someone else’s well being than your own. ♡

Dave@WatergateBayAM

@WatergateBayAM

Wrecker, beach ranger, surfer, border collie-man. Volunteer. Personal account.

Alex

@Alex_m

about 2 hours ago

@AnthropicAI Hoping for another @deepseek_ai moment to stop this bullshit

0

66

0

1

983

Alex

@Alex_m

about 21 hours ago

@OfficialLoganK Its insane. It one shotted my session usage limit

2

55

0

3K

Alex

@Alex_m

1 day ago

@giffmana How would you know? It's silent degradation. This prompt might even be a trigger lol, I saw someone was flagged just for sending the word 'cyber'

1

21

0

1

4K

Alex

@Alex_m

1 day ago

@robinebers @bcherny Their $200/mo plans come with $2,000+ in equivalent API usage. Once Fable 5 goes API only after June 22, you're paying full rate for what used to be included. That's the 10x gap I'm talking about.

1

5

0

293

Alex

@Alex_m

1 day ago

@bcherny Hope this is more precise than the cyber risk classifier. Silent degradation is worse than a refusal, at least a refusal I can see. How do I debug code sabotaged by an invisible safeguard because it pattern matched a word or two related to LLM development? 😵‍💫

Alex_m's tweet photo. @bcherny Hope this is more precise than the cyber risk classifier. Silent degradation is worse than a refusal, at least a refusal I can see. How do I debug code sabotaged by an invisible safeguard because it pattern matched a word or two related to LLM development? 😵‍💫 https://t.co/ciYr8L27DT

1

4

0

1

483

Alex

@Alex_m

1 day ago

@VictorTaelin @Hangsiin I was about to mention you 🤣

0

1

0

237

Alex

@Alex_m

2 days ago

TL;DR 1. Build more capable AI, including an automated AI researcher. 2. Use AI to improve AI safety and alignment faster. 3. Turn frontier AI into practical tools people and organizations can use. 4. Make AI affordable, abundant, and easy to access globally. 5. Give individuals “personal AGI” to help with work, learning, business, care, and decisions. 6. Accelerate science, productivity, and economic growth using AI. 7. Distribute the benefits widely, avoiding concentration of power. 8. Create safety standards, public oversight, and international coordination to manage catastrophic risks and slow development if needed.

1

2

0

1

43

Alex

@Alex_m

7 days ago

@thsottiaux Something is wrong in the last 20 minutes. I asked it to find out where we are so far but instead of checking the project it ran pwd and gave me the current path??? Then while writing milestones it wrote the 1st one to an .md file and echoed the 2nd milestone to me in chat lol

0

2

0

1K

Alex

@Alex_m

7 days ago

@AnthropicAI Only caught 800 accounts? Holy f… that’s bad.

0

155

Alex

@Alex_m

12 days ago

@thsottiaux I try any frontier model the moment it drops. I give it a large task to complete then I sit and watch, only after that I judge if it’s any better for me at least.

0

92