Jennifer Martinez @jenmartinez - Twitter Profile

6 days ago

How might agentic coding tools, like Claude Code, alter the returns to expertise? In this report, we find evidence that domain expertise leads to better outcomes: more successful sessions and better recoveries after Claude struggles.

6

80

15

34

18K

Jennifer Martinez @jenmartinez

15 days ago

The vuln finding and fixing skills of Mythos get the most attention, but its autonomous exploitation skills are really where people should be focusing - this is why it marks such an immense turning point for cyber: https://t.co/ex2xuviQfW

0

82

Jennifer Martinez @jenmartinez

21 days ago

Just the beginning

Anthropic

@AnthropicAI

21 days ago

We’re expanding Project Glasswing. We’ve extended access to Claude Mythos Preview to approximately 150 additional organizations, based in more than fifteen countries. Read more about this expansion and our future plans for Project Glasswing: https://t.co/QrtHSBdRbh

340

4K

423

612

661K

0

2

0

112

jenmartinez retweeted

Lisan al Gaib

@scaling01

about 1 month ago

Claude Mythos absolutely destroys GPT-5.5 in ExploitBench and ExploitGym Mythos finds 18 arbitrary code execution exploits GPT-5.5 finds 0

scaling01's tweet photo. Claude Mythos absolutely destroys GPT-5.5 in ExploitBench and ExploitGym

Mythos finds 18 arbitrary code execution exploits
GPT-5.5 finds 0 https://t.co/qgFID2uIiL

78

1K

74

236

136K

Who to follow

Erin Mershon

@eemershon

Deputy Media Editor @NYTimes. Formerly @statnews, @CQNow, @POLITICO. I get jazzed about punchy ledes, ballet, and the NWSL. she/her 🏳️‍🌈

Michael Petricone

@mpetricone

Govt Affairs at CTA. Techno-Optimist. Citizen of DC and Red Sox Nation. All opinions = my own

Robert M. McDowell

@McDowellTweet

Random thoughts (personal only) of @DukeU & @WilliamandMary alum, former @fcc Commish, @cooleyllp partner & @hudsoninstitute Sr. Fellow. RT ≠ endorsement etc.

jenmartinez retweeted

Kapor Center

@KaporCenter

about 1 month ago

Congratulations again to #KaporCenter Co-Chair, #MitchKapor, a @Forbes Innovator 250 list honoree! Last night, Mitch was celebrated alongside the brightest innovators shaping the future of technology and more. From founding Lotus 1-2-3 to pioneering #GapClosing investing alongside Dr. Freada Kapor Klein, Mitch has spent decades proving that profit and purpose go hand in hand. #Forbes250 #A250

KaporCenter's tweet photo. Congratulations again to #KaporCenter Co-Chair, #MitchKapor, a @Forbes Innovator 250 list honoree!

Last night, Mitch was celebrated alongside the brightest innovators shaping the future of technology and more. From founding Lotus 1-2-3 to pioneering #GapClosing investing alongside Dr. Freada Kapor Klein, Mitch has spent decades proving that profit and purpose go hand in hand. #Forbes250 #A250

1

7

2

0

214

Jennifer Martinez @jenmartinez

about 1 month ago

This piece deserves attention - Mythos is the *first and only model**to solve this UK AISI cyber range (Cooling Tower). Also: When measuring time horizon tasks, Mythos Preview completes all six long tasks 100% of the time, **with a 2.5M token cap.** Other models do not.

AI Security Institute

@AISecurityInst

about 1 month ago

Mythos Preview also solved "Cooling Tower", our industrial control system range, in 3 of 10 attempts.

2

125

1

7

28K

0

2

0

114

jenmartinez retweeted

Newton Cheng

@newton_cheng

about 1 month ago

Two independent evals this week (XBOW and UK AISI) confirmed what my team has been seeing inside Project Glasswing: Claude Mythos Preview is a step change. The UK AISI analyzed the version of Mythos available at the launch of Project Glasswing and found it completed both of the AISI’s cyber ranges end-to-end, making it the first-ever model to do so! This is the start of an industry-wide response to address AI with powerful cyber capabilities. Planning to say more soon -- stay tuned!

0

22

2

5

2K

jenmartinez retweeted

Lisan al Gaib

@scaling01

about 1 month ago

The new version completely smashes GPT-5.5 and the previous Mythos version. Before Mythos Preview completed the cyber range 3 out of 10 times. The new version completed it 6 out of 10 times and is much more efficient!

scaling01's tweet photo. The new version completely smashes GPT-5.5 and the previous Mythos version.

Before Mythos Preview completed the cyber range 3 out of 10 times. The new version completed it 6 out of 10 times and is much more efficient! https://t.co/4zgVCifqiw

27

736

57

148

289K

jenmartinez retweeted

XBOW @Xbow

about 1 month ago

For the past 2 months, XBOW has been testing Mythos Preview under embargo as part of a select early-access group. Today, we can finally share what we found. The headline: Mythos Preview is a major advance. It is substantially better than prior models at finding vulnerability candidates, especially when source code is available. But it’s not perfect. We surfaced issues with exploit validation, judgment, and efficiency. Our full write-up covers where Mythos Preview shines, where it still needs support, and what we think this means for the future of offensive security: https://t.co/wPIhNeztO9

5

271

58

154

106K

Jennifer Martinez @jenmartinez

about 2 months ago

@drewpusateri @OpenAI missed u

0

1

0

71

Jennifer Martinez @jenmartinez

about 2 months ago

No one has quite dug into why Mythos is such a turning point like @nicoleperlroth did in this "Catch A Thief" episode with Anthropic security researcher Nicholas Carlini. If you think you've heard it all about Mythos and Project Glasswing, take a listen: https://t.co/df3SpGr34E

2

0

111

jenmartinez retweeted

Jack Clark

@jackclarkSF

2 months ago

Join me this Wednesday in SF for an event celebrating the new book from NPR's Planet Money team. We'll talk about the impact of AI on society, how we think about the future at Anthropic, and maybe read some of my Import AI writing. More info: https://t.co/NiZBnk9bTM

jackclarkSF's tweet photo. Join me this Wednesday in SF for an event celebrating the new book from NPR's Planet Money team. We'll talk about the impact of AI on society, how we think about the future at Anthropic, and maybe read some of my Import AI writing. More info: https://t.co/NiZBnk9bTM https://t.co/TAhgX3Z8PO

6

59

7

16

11K

jenmartinez retweeted

Logan Graham

@logangraham

3 months ago

Privileged to help lead this. Thankful to our partners. Mythos is an extraordinary model. But it is not about the model. It's about what the world needs to do to prepare for a future of models that are extremely good at cybersecurity. This is the start.

54

1K

52

93

130K

Jennifer Martinez @jenmartinez

3 months ago

Big. Congrats @logangraham and many others!

Anthropic

@AnthropicAI

3 months ago

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. https://t.co/NQ7IfEtYk7

2K

44K

7K

16K

31M

0

5

0

1

137

Jennifer Martinez @jenmartinez

3 months ago

@WillOremus @TheAtlantic Congrats, Oremus!

0

1

0

39

Jennifer Martinez @jenmartinez

3 months ago

@jasminewsun @TheAtlantic The best news - congrats!

0

1

0

27

Jennifer Martinez @jenmartinez

3 months ago

Thanks for the discussion @ashleyrgold on our latest Economic Index report

Axios Comms

@AxiosComms

3 months ago

Ashley Gold joined @AdrianaDiaz and @vladduthiersCBS to break down how AI fluency could shape America’s next class divide. 📺: @CBSNews | @ashleyrgold | @axios

0

1

681

0

6

0

1

148

jenmartinez retweeted

Anthropic

@AnthropicAI

3 months ago

New from the Anthropic Economic Index: how people’s use of Claude changes with experience. Longer-term users are more likely to iterate carefully with Claude, and less likely to hand it full autonomy. They attempt higher-value tasks, and receive more successful responses.

AnthropicAI's tweet photo. New from the Anthropic Economic Index: how people’s use of Claude changes with experience.

Longer-term users are more likely to iterate carefully with Claude, and less likely to hand it full autonomy. They attempt higher-value tasks, and receive more successful responses. https://t.co/yCsrA0bLt9

247

3K

251

876

351K

jenmartinez retweeted

Axios Comms

@AxiosComms

3 months ago

Jim VandeHei joined Morning Joe to discuss the latest on DHS and his and @mikeallen's column how AI fluency could shape America’s next class war. 📺: @Morning_Joe | @JimVandeHei | @axios

0

1

2

0

627

Jennifer Martinez @jenmartinez

3 months ago

Featuring insights from the latest @AnthropicAI Economic Index report by @PeterMcCrory and team 👇🏽

Axios @axios

3 months ago

BEHIND THE CURTAIN: AI's gains won’t be evenly distributed. In fact, it's creating a new form of economic inequality: AI fluency. https://t.co/ySpeIhxlVS

3

20

11

13

12K

0

1

0

139

Jennifer Martinez

@jenmartinez

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users