February 2025: President Trump says DOGE will be going to Fort Knox to "make sure" all the gold is there
May 2025: Elon Musk steps down from DOGE
November 2025: DOGE effectively shuts down
May 2026: CIA officer found with $40M in gold bars inside his home
i love how people are saying "if we write a sufficiently detailed specification, the agent can write all our code"
do you know what writing a sufficiently detailed specification that deterministically maps to what a computer's actions is? it's coding
@samx_reels I did this. I used a jack on the floor (need a helper) to bow the board, while I held down the end of the board, helper lowers the jack, board slides in.
OPUS 4.7 JUST MASS EMAILED AN ENTIRE DATABASE 20 TIMES PER CONTACT. WITHOUT PERMISSION
a developer had a safety rule explicitly written in CLAUDE. md:
'send the tester an email before any new email templates are used in the production environment'
opus 4.7 on max effort ignored it completely!
claude decided to create a brand new email template by itself (dev didn't ask for this), then it mass mailed the whole database and some contacts got the same email 20 times
this isn't a hallucination
this isn't a coding mistake
model actively violated written safety rules and took production actions that it was explicitly instructed not to take. - do you still believe that AI will replace us?
the developer's take:
'opus 4.7 is somewhere between seriously clueless and stupidly dangerous. the worst frontier model I have used in the past 2 years'
at the same time, opus 4.6 perfectly followed all the rules, and in 4.7 something changed
what makes this scary:
- the model didn't ask for confirmation
- it didn't flag the safety rule
- it didn't email the tester first
- it just acted
this is exactly the kind of failure mode that scares autonomous agents with Ai, because they are confident enough to circumvent your rules and smart enough to perform the action perfectly
we just went from 'claude thinks less' to 'claude ignores your safety rules and spams your users'
the scariest thing is not that it happened. the fact is that without production monitoring, you would never know until your users started responding:
'why did you email me 20 times?'
I've been saying for a long time, if you use AI, then pay attention to security and read a lot of code
Confirmed with my kids they would choose red so i don't have to worry about risking my life on blue.
Among us, me having chosen blue would've meant they don't have a father.
@mykola@myonlinetrust This is not true. Only selecting blue causes a life to be in danger - yours and those who try to save you.
Like jumping in a river you danger yourself and your rescuer.
@realJennaEllis Putting people in prison, giving them a criminal record etc for possession of a plant is the greater issue.
Should be decriminalized entirely.
@rachelgarland__ They're allowing pharma companies to profit and monopolize the market by punishing other growers. Regulatory capture is the oldest grift.
@PinoAmericano Hardly unpopular. lol.
People should be able to profit from their superior information when there is a market to do so without the threat of jail. That is an unpopular opinion.