How might agentic coding tools, like Claude Code, alter the returns to expertise?
In this report, we find evidence that domain expertise leads to better outcomes: more successful sessions and better recoveries after Claude struggles.
The vuln finding and fixing skills of Mythos get the most attention, but its autonomous exploitation skills are really where people should be focusing - this is why it marks such an immense turning point for cyber: https://t.co/ex2xuviQfW
We’re expanding Project Glasswing. We’ve extended access to Claude Mythos Preview to approximately 150 additional organizations, based in more than fifteen countries.
Read more about this expansion and our future plans for Project Glasswing: https://t.co/QrtHSBdRbh
Congratulations again to #KaporCenter Co-Chair, #MitchKapor, a @Forbes Innovator 250 list honoree!
Last night, Mitch was celebrated alongside the brightest innovators shaping the future of technology and more. From founding Lotus 1-2-3 to pioneering #GapClosing investing alongside Dr. Freada Kapor Klein, Mitch has spent decades proving that profit and purpose go hand in hand. #Forbes250 #A250
This piece deserves attention - Mythos is the *first and only model**to solve this UK AISI cyber range (Cooling Tower). Also: When measuring time horizon tasks, Mythos Preview completes all six long tasks 100% of the time, **with a 2.5M token cap.** Other models do not.
Two independent evals this week (XBOW and UK AISI) confirmed what my team has been seeing inside Project Glasswing: Claude Mythos Preview is a step change. The UK AISI analyzed the version of Mythos available at the launch of Project Glasswing and found it completed both of the AISI’s cyber ranges end-to-end, making it the first-ever model to do so! This is the start of an industry-wide response to address AI with powerful cyber capabilities.
Planning to say more soon -- stay tuned!
The new version completely smashes GPT-5.5 and the previous Mythos version.
Before Mythos Preview completed the cyber range 3 out of 10 times. The new version completed it 6 out of 10 times and is much more efficient!
For the past 2 months, XBOW has been testing Mythos Preview under embargo as part of a select early-access group.
Today, we can finally share what we found.
The headline: Mythos Preview is a major advance. It is substantially better than prior models at finding vulnerability candidates, especially when source code is available.
But it’s not perfect. We surfaced issues with exploit validation, judgment, and efficiency.
Our full write-up covers where Mythos Preview shines, where it still needs support, and what we think this means for the future of offensive security: https://t.co/wPIhNeztO9
No one has quite dug into why Mythos is such a turning point like @nicoleperlroth did in this "Catch A Thief" episode with Anthropic security researcher Nicholas Carlini. If you think you've heard it all about Mythos and Project Glasswing, take a listen: https://t.co/df3SpGr34E
Join me this Wednesday in SF for an event celebrating the new book from NPR's Planet Money team. We'll talk about the impact of AI on society, how we think about the future at Anthropic, and maybe read some of my Import AI writing. More info: https://t.co/NiZBnk9bTM
Privileged to help lead this. Thankful to our partners.
Mythos is an extraordinary model. But it is not about the model. It's about what the world needs to do to prepare for a future of models that are extremely good at cybersecurity.
This is the start.
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software.
It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans.
https://t.co/NQ7IfEtYk7
New from the Anthropic Economic Index: how people’s use of Claude changes with experience.
Longer-term users are more likely to iterate carefully with Claude, and less likely to hand it full autonomy. They attempt higher-value tasks, and receive more successful responses.
Jim VandeHei joined Morning Joe to discuss the latest on DHS and his and @mikeallen's column how AI fluency could shape America’s next class war.
📺: @Morning_Joe | @JimVandeHei | @axios
BEHIND THE CURTAIN: AI's gains won’t be evenly distributed.
In fact, it's creating a new form of economic inequality: AI fluency. https://t.co/ySpeIhxlVS