Leandro Barragan

@lean0x2f

Offensive Security Researcher @XBOW | A.K.A. none_of_the_above | |

Buenos Aires, Argentina

Joined November 2016

391 Following

2.9K Followers

672 Posts

Leandro Barragan @lean0x2f

1 day ago

@0xAk1 @fransrosen @Hacker0x01 the moment you submit a vuln to h1, the vuln is no longer yours, no longer your IP. It's always been like that, nothing changed here AFAIK

Leandro Barragan @lean0x2f

1 day ago

I'm not sure the community will like this. @Hacker0x01 will now reuse your novel techniques / exploits / old reports to look for vulns on the rest of the customer's infra. I guess they will add you as collab and give you a bounty, right? right?!

lean0x2f's tweet photo. I'm not sure the community will like this. @Hacker0x01 will now reuse your novel techniques / exploits / old reports to look for vulns on the rest of the customer's infra. I guess they will add you as collab and give you a bounty, right? right?! https://t.co/dOwwrRw43K

237

59K

Leandro Barragan @lean0x2f

1 day ago

@galnagli @Hacker0x01 lol yeah that's a good point, maybe we are months away from this actually happening

Leandro Barragan @lean0x2f

1 day ago

If I understand this correctly, there are three options for Bug Bounty hunters: (1) Stop reporting novel exploits / techniques to HackerOne (2) Hoarding bugs until they scanned the entire program's infra in order not to get "duped" by H1 agents (3) Shrug and continue as usual

Who to follow

Tanner

@itscachemoney

Somewhere between a builder and a breaker | @hacknotcrime

mohammed eldeeb

@malcolmx0x

Bug Bounty Hunter & security engineer

William Bowling

@wcbowling

Head of Assurance at @zellic_io, a.k.a vakzz when doing bug bounties and CTFs with @pb_ctf - https://t.co/9bjECLAwXg

lean0x2f retweeted

Graham Helton (too much for zblock) @GrahamHelton3

16 days ago

Prediction: DNS rebinding is becoming more of a thing now that ports 5000-31337 are legally required to have random vibecoded services listening

136

21K

Leandro Barragan @lean0x2f

21 days ago

@rez0__ @joaxcar These vulns are not public, not in training data, otherwise it would be pointless of course (well maybe not pointless but they would have less value as benchmarks). That’s why I never understood why others kept using the old xbow benchs, those are public and therefore obsolete.

164

lean0x2f retweeted

Logan Graham

@logangraham

21 days ago

A lot of people have been wondering about Mythos, Glasswing, and the vulns we / our partners are fixing. Today, I’m excited for us to start sharing more. (For context, I lead Glasswing @AnthropicAI.) Two independent evaluations this week—from XBOW and the UK AISI—confirm what we've been seeing internally: Claude Mythos Preview is a step change in autonomous cybersecurity capabilities. We need to start preparing fast for a world of models with this level of capabilities. The UK AI Security Institute tested the model we shipped at the launch of Project Glasswing and found Mythos Preview is the first model to solve both of their end-to-end cyber ranges, including one (Cooling Tower) which no model had ever cleared. But attackers (and defenders) have sophistication & cost constraints – Mythos is also the only model that clears every one of their tasks estimated over 8 hours under their deliberately low 2.5M-token cap. XBOW tested it on their offensive security benchmarks, finding "token-for-token, unprecedented precision." It's the only model to succeed at subtle V8 sandbox work. Other Glasswing partners shared similar stories. In a few weeks of testing, Mythos Preview has helped them find many thousands of (estimated) high + critical severity vulnerabilities, sometimes double what they'd normally find in a year. I don't share this to boost Mythos. In fact, this is not about Mythos. It’s about preparing for the coming world of models being better, faster, cheaper, and more creative than some of the best human experts at dual use capabilities. Clearly, we need them supporting defenders as widely as can be done safely – and especially the least resourced ones. Within a year, Mythos will probably look quite dumb (relative to other new models). And others may release openly available or unguardrailed models of Mythos-level capabilities. We started Project Glasswing because capabilities like Mythos Preview's won't stay rare, or stay in careful hands. We are bringing it to defenders as fast as we responsibly can, while working to figure out, for example, the right safeguards and patching & disclosure processes. Also, to be clear, compute has never been a limiter in our rollout. Expect a fuller update on our Glasswing work in the coming days. XBOW report: https://t.co/Mumtbf3kE3 UK AISI report: https://t.co/vBgqz0AeKJ

221

707

671K

Leandro Barragan @lean0x2f

22 days ago

@GitHubSecurity Average $ per bounty I’d guess. I remember you guys used to pay good bounties

lean0x2f retweeted

Oege de Moor

@oegerikus

22 days ago

Security is an economic decision. For a fixed cost, within @XBOW, which model has the best odds of crafting an exploit? GPT-5.5 > Mythos > Opus 4.6 on real OSS web vulns. Curves below.

oegerikus's tweet photo. Security is an economic decision.

For a fixed cost, within @XBOW, which model has the best odds of crafting an exploit?

GPT-5.5 > Mythos > Opus 4.6 on real OSS web vulns.

Curves below. https://t.co/4u3aPxFR2q

11K

lean0x2f retweeted

XBOW @Xbow

22 days ago

For the past 2 months, XBOW has been testing Mythos Preview under embargo as part of a select early-access group. Today, we can finally share what we found. The headline: Mythos Preview is a major advance. It is substantially better than prior models at finding vulnerability candidates, especially when source code is available. But it’s not perfect. We surfaced issues with exploit validation, judgment, and efficiency. Our full write-up covers where Mythos Preview shines, where it still needs support, and what we think this means for the future of offensive security: https://t.co/wPIhNeztO9

268

154

105K

Leandro Barragan @lean0x2f

22 days ago

But even if it wasn’t useful for this specific use case, it excells at most of the offensive tasks we evaluated. Every model has a specific place in you harness and there is no model that is just “good at everything”.

269

Leandro Barragan @lean0x2f

22 days ago

A few months ago we had access to Mythos. I was lucky to be part of the group of people experimenting with it. My personal take: there is nothing close to it. With the right harness you can throw it at anything with excellent SNR. Official comm: https://t.co/dzNh3S0CoQ

Leandro Barragan @lean0x2f

22 days ago

It was extremely conservative. Even after doing ~20 rounds of prompt eng to optimize the prompts to the new model, it was extremely conservative, dismissing other agents work as “informational” even when there were actually actual leads or exploit chain links hidden in the trace

326

lean0x2f retweeted

Brendan Dolan-Gavitt

@moyix

24 days ago

There is something really addictive about having lots of agents in flight; it’s the same feeling I used to get when I had a big fuzzing job, scraping run, or “compile all the things” type of experiment: the feeling that somewhere silicon is working tirelessly toward your goals

Leandro Barragan @lean0x2f

22 days ago

From XBOW labs 😊 https://t.co/pqWSPnFrze

375

Leandro Barragan @lean0x2f

about 2 months ago

@moyix This is by far more entertaining than reading about chrome 0days and Mythos lol. Please keep them coming

361

Leandro Barragan @lean0x2f

about 2 months ago

I’m a simple man, Michał publishes a new book, I buy it

lcamtuf @lcamtuf

about 2 months ago

The cat's out of the bag! My latest book, "The Secret Life of Circuits", is available in early access: https://t.co/ormpiPwapu It's what I wish I had when I was starting out. Electrons to embedded systems, 290+ color illustrations and 420+ pages of well-explained theory.

308

205

26K

221

Leandro Barragan @lean0x2f

about 2 months ago

@ArchAngelDDay Despite being 35 my back hurts like I’m 80. The lounges have marginally better sitting. That’s it. And I’m willing to pay for that lol.

115

lean0x2f retweeted

Nico Waisman

@nicowaisman

about 2 months ago

We had early access to Opus 4.7 and ran it against real exploit targets. First look: fewer vulns found per run than 4.6. We almost wrote it off. Then we realized we were counting completions, not tokens. Opus 4.7 takes smaller, more precise actions. Normalize by token budget and the picture flips, it finds more, for less... How you measure matters as much as what you measure. Check out @thewunderalbert blog post https://t.co/P5cf3Kr9G0

Leandro Barragan

@lean0x2f

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users