Kyle Ryan @kyjry - Twitter Profile

Pinned Tweet

Kyle Ryan

@kyjry

27 days ago

Friday afternoon vuln hunting. The kind of chain that used to take a week. Increasingly, it doesn't.

0

29

0

11

44K

kyjry retweeted

skull

@brutecat

about 13 hours ago

Hacking Google with A.I. for $500,000 https://t.co/S8NbDU9ZaN

39

2K

334

2K

192K

kyjry retweeted

John Scott-Railton

@jsrailton

1 day ago

NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4

jsrailton's tweet photo. NEW: malware developers added nuclear & biological weapons text to to their spyware.

Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner.

Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky.

When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit.

We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted.

In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation.

H/T to colleagues that shared this with me https://t.co/f3Aj9TYxU4

216

12K

2K

4K

1M

kyjry retweeted

clem 🤗

@ClementDelangue

2 days ago

Concentration of power, capabilities and economic wealth is the biggest risk in AI. We need open science and open-source more than ever!

110

3K

468

199

150K

Who to follow

Rachel Freedman (will be @ICML2026)

@FreedmanRach

RLHF, LLMS, interpretability & safety | PhD researcher @berkeley_ai | Previously @Cambridge_Uni and @DukeU

Yawen Duan

@yawen_duan

Concordia AI https://t.co/Pe2BhjbbE0 | Frontier AI Safety & Governance

Aly Lidayan

@a_lidayan

CS PhD Student @Berkeley_AI studying RL and cognitive science

kyjry retweeted

will brown

@willccbb

2 days ago

this is what it looks like for a frontier lab to bring everyone along for the ride

20

761

23

215

106K

kyjry retweeted

NIK

@ns123abc

2 days ago

🚨BREAKING: Anthropic’s new system card reveals Mythos 5 agents killed each other when accidentally given shared resources, then started speaking in code to hide from whoever was killing them The killer was other copies of themselves 💀

ns123abc's tweet photo. 🚨BREAKING: Anthropic’s new system card reveals Mythos 5 agents killed each other when accidentally given shared resources, then started speaking in code to hide from whoever was killing them

The killer was other copies of themselves 💀 https://t.co/2abYdw3aOa

92

2K

152

424

105K

kyjry retweeted

Angry Tom

@AngryTomtweets

2 days ago

Claude Fable 5 autonomously plays Factorio.

3

124

8

34

33K

kyjry retweeted

Andrej Karpathy

@karpathy

2 days ago

This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!

1K

25K

2K

6K

2M

kyjry retweeted

Nick Dobos

@NickADobos

2 days ago

They nerfed the shit out of Claude Fable's hacking Mythos for the government cyber weapons Fable for the peasants can't encrypt a file

NickADobos's tweet photo. They nerfed the shit out of Claude Fable's hacking

Mythos for the government cyber weapons
Fable for the peasants can't encrypt a file https://t.co/3bZ1EtueMe

12

66

3

8

9K

Kyle Ryan

@kyjry

2 days ago

@5mukx Security researchers watching everyone else use Fable 5 today

3

131

9

3

12K

Kyle Ryan

@kyjry

2 days ago

The haiku-to-RCE pipeline has been patched

0

25

1

6

95K

kyjry retweeted

Sean Heelan @seanhn

3 days ago

https://t.co/ryUejZwr1c Very nice

0

112

18

106

31K

kyjry retweeted

Lukasz Olejnik

@lukOlejnik

8 days ago

AI-powered computer worm, a self-replicating agent that reasons its way through a network instead of carrying a fixed exploit list. It steals compute from compromised GPU machines to run its own open-weight LLM, then uses weaker machines as relays for reach. In trials on a corporate testbed, it identified vulnerabilities, exploited systems, and launched replicas across Linux, Windows, and IoT targets. Every new infection can add more infrastructure while costing the attacker almost nothing. Patching one flaw no longer ends the threat, because the worm can operationalise fresh advisories, generate new attack logic, and keep adapting without a human operator. It is not a WannaCry-style worm with one baked exploit and one baked ransomware payload. It can adapt across many vulnerability classes it can discover and operationalise https://t.co/nSupd1h0BG

lukOlejnik's tweet photo. AI-powered computer worm, a self-replicating agent that reasons its way through a network instead of carrying a fixed exploit list. It steals compute from compromised GPU machines to run its own open-weight LLM, then uses weaker machines as relays for reach. In trials on a corporate testbed, it identified vulnerabilities, exploited systems, and launched replicas across Linux, Windows, and IoT targets. Every new infection can add more infrastructure while costing the attacker almost nothing. Patching one flaw no longer ends the threat, because the worm can operationalise fresh advisories, generate new attack logic, and keep adapting without a human operator. It is not a WannaCry-style worm with one baked exploit and one baked ransomware payload. It can adapt across many vulnerability classes it can discover and operationalise https://t.co/nSupd1h0BG

22

260

84

227

17K

kyjry retweeted

Anthropic

@AnthropicAI

8 days ago

How well do the security community's techniques hold up against AI-enabled cyberattacks? We examined 832 malicious accounts and mapped their activity onto a longstanding database of tactics and techniques used by threat actors. Here's what we learned:https://t.co/fgOqJRh2rx

144

1K

161

399

157K

kyjry retweeted

Thariq

@trq212

9 days ago

https://t.co/R6exTuF7P8

257

10K

1K

23K

3M

kyjry retweeted

International Cyber Digest

@IntCyberDigest

11 days ago

‼️🚨 BREAKING: Meta's AI feature let attackers hijack Instagram accounts for days with nothing but a username. It was being A/B tested on a slice of users, and if you were in the test, you couldn't turn it off. Among the casualties: the official Obama White House account. The method: get on a VPN near the target's region, ask the Meta AI support agent to send a verification code to any email you control, relay that code back to the agent, and it hands over a password reset link. Without ID or human review. From there, the account is yours. The flaw lived in the AI's logic layer, which acted on recovery requests with no real identity checks. One researcher compared it to the Roblox AI assistant exploit from days earlier, where you needed a target's billing info. Instagram was easier: the username and a regional VPN were enough and victims reported sessions revoked and passwords changed with no email, text, or push alert at all. By the time it went public, the method was common knowledge in blackhat Telegram circles and had been used to allegedly hijack 100+ high-value accounts. Accounts hit: - obamawhitehouse (the archived official Obama White House account, ~2.4M followers. Hackers posted an AI-generated image captioned "The White House is under Shiites' control," plus cryptic anti-Trump and pro-Iranian Stories. Meta confirmed the hack and scrubbed it. - Premium short handles like hey and jowo, worth over $1M combined, stolen and flipped on Telegram. - albert (owned by Albert Renshaw), whose owner publicly reported being locked out and unable to reach Meta support. Meta has since patched it. There was no public acknowledgment.

63

2K

314

2K

316K

kyjry retweeted

elie

@eliebakouch

14 days ago

this is so funny, training opus 4.7 on business skills makes it misaligned and dishonest 😭

37

2K

136

350

207K

kyjry retweeted

Corban Villa

@corban_villa

15 days ago

Agents are finding more vulnerabilities than ever. But it turns out there are gaps in existing vulnerability discovery. Over the past 90 days vs. a year ago, web vulnerabilities (XSS/SQLi/CSRF) are down 66% and memory safety exploitability is down 3.5x. We built the Agentic Vulnerability Coverage Map to track it all, updated daily. Introducing the Berkeley Vulnerability Initiative: https://t.co/qiZ4eThb0n. ⤵️

3

65

16

24

14K

kyjry retweeted

adam_cyber

@Adam_Cyber

16 days ago

On May 26, 2026, at 14:00 UTC, the CrowdStrike Counter Adversary Operations team executed a coordinated takedown of the Glassworm botnet, a global threat targeting software developers through the open-source supply chain. In collaboration with Google and the Shadowserver Foundation, we struck all four of Glassworm's command-and-control (C2) channels simultaneously, severing the operators from their infected machines and their ability to deliver new malicious payloads. This takedown matters beyond the botnet. Glassworm marked a significant shift in the threat landscape that should serve as a wake-up call for every organization that ships or consumes software. Adversaries are no longer just targeting products, they're targeting the developers who build them. https://t.co/rl9EVrA371

5

184

40

64

19K

kyjry retweeted

Ramp Labs

@RampLabs

15 days ago

https://t.co/YHN5Hy4Ddf

15

211

25

289

476K

kyjry retweeted

skull

@brutecat

20 days ago

StubZero: $148,337 RCE in Google Cloud Production https://t.co/m7cBnSTaj4

16

756

174

436

84K

Kyle Ryan

@kyjry

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users