How does it work?
When you send a prompt to Fusion, we fan it out to a panel of models in parallel, each with web search and bash tools enabled.
A judge model reads every response and extracts the structure: consensus points, contradictions, partial coverage, unique insights, blind spots.
Chatroom: https://t.co/0HkAUmBCJ7
NEW: malware developers added nuclear & biological weapons text to to their spyware.
Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner.
Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky.
When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit.
We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted.
In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation.
H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4
The UK keeps treating 1984 as a roadmap instead of a warning.
Every few years, same play: invoke "security", grab more surveillance power. This time it's a telescreen in your pocket.
This is what the UK spyware proposal means.
There must be government spyware on every mobile device. It shall watch everything that happens, including always watching the screen, looking for things the government disapproves of.
When anything is flagged by the software as something the government doesn't like, the software must block it from being sent or displayed (in realtime).
The user of the device must not be able to shut this watching and blocking off. The only way to shut it off would be to ask the government or its proxies to do so for you, at their discretion.
Therefore the whole device must be locked down. Administrator rights and the decision of what software or operating system to run or not to run must be taken from the owner/user and handed to the government and its proxies.
Apple and Google are themselves working hard to lock down the devices they are involved in to shut out competition and establish a duopoly.
The UK government says it is "working closely" with Apple and Google and currently they synchronise and coordinate their communication on this subject.
The UK government is now proposing to mandate what would otherwise be illegal anti-competitive practices.
@GrapheneOS on the Apple and Google duopoly:
https://t.co/rbRmcUDTRu
Statement from @signalapp
https://t.co/vJILcSrs4s
@ReclaimTheNetHQ on the state spyware:
https://t.co/3FCi06bP77
The government announcement:
https://t.co/ynYjR3DIRo
This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time.
I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!
Anthropic is questioning whether AI may turn out to be altogether useless. This is the single most honest thing Anthropic has ever written.
“But achieving recursive improvement alone does not suggest an immediate change in how industrial production occurs, societies organize, or markets function. More intelligence can’t learn what a drug does over decades of use, can’t hold elections sooner than a constitution dictates, and can’t turn a stranger into an old friend in a weekend. For most people, the felt pace of this future will still be set by the bottlenecks, even if the laboratory upstream runs at the speed of compute. That collision, where recursive intelligence building itself ever faster meets the world of humans, relationships, and governance, is another part of this future we can’t predict.”
When money depends less on a single issuer, users gain more ways to hold and move value on their own terms.
@frankencoinzchf lets users mint and use ZCHF through open, oracle-free smart contracts.
Frankencoin is built on Ethereum.