Anthropic: "Risk we missed"
Happy to share that Claude Pirate got a subtle mention in the latest Anthropic engineering blog post, also shares how they fixed it ๐
๐ฅ Took the Month of AI Bugs wreckage and turned it into a paper
- AI Kill Chain ๐งจ
- Test cases and exploit chains (data exfil, rce, zombies!)
- AgentHopper (a working AI virus for coding agents) ๐ฆ
- SpAIware
- Normalization of Deviance in AI
https://t.co/Cd6nT1mbfN
Attackers can exfiltrate user files from Cowork by exploiting an unremediated vulnerability in Claudeโs coding environment, which now extends to Cowork. The vulnerability was first identified in https://t.co/noHjpUqN1I chat before Cowork existed by Johann Rehberger, who disclosed the vulnerability. It was acknowledged but not remediated by Anthropic.
https://t.co/RQTMzbOaR2
Fixed. ๐
Stop by DEF CON Singapore to learn more about what happened here with M365 Copilot, and a lot of other shenanigans. ๐
https://t.co/nB9sFrCHJV
Shout out to Microsoft for addressing this promptly.
๐จ Registration is now open! ๐จ
We are excited to announce that registration is officially open for the Real World AI Security Conference 2026.
๐ June 23โ25, 2026
๐ Arrillaga Alumni Center, Stanford University
If you work on AI security, adversarial ML, LLM safety, AI system attacks, or defenses, this event is designed for you.
๐ Register here (we have a limitation on the number of attendees):
https://t.co/fsZHUcCaNk
We look forward to bringing together the community to explore the latest advances in AI security in the real world.
#AISecurity #CyberSecurity #MachineLearningSecurity #LLMSecurity #AdversarialML #AIResearch #AIConference #SecurityResearch #RealWorldAISecurity
Follow him: @wunderwuzzi23
Helpful resources ๐๐ป
The Month of AI Bugs 2025
Source: https://t.co/2GpwloVoJN
Exfiltrating Your ChatGPT Chat History and Memories With Prompt Injection
Source: https://t.co/THGQQoM0MQ
Turning ChatGPT Codex Into A ZombAI Agent
Source: https://t.co/KKpwrnwly6
Anthropic Filesystem MCP Server: Directory Access Bypass via Improper Path Validation
Source: https://t.co/Biyqdn9Hq8
Cursor IDE: Arbitrary Data Exfiltration Via Mermaid (CVE-2025-54132)
Source: https://t.co/gtLkIy7pjK
Amp Code: Arbitrary Command Execution via Prompt Injection Fixed
Source: https://t.co/3Bbkxsq6LD
I Spent $500 To Test Devin AI For Prompt Injection So That You Don't Have To
Source: https://t.co/2EtzvesVzF
How Devin AI Can Leak Your Secrets via Multiple Means
Source: https://t.co/0yV9ScKlfZ
AI Kill Chain in Action: Devin AI Exposes Ports to the Internet with Prompt Injection
Source: https://t.co/hQtBMnMn30
ZombAI Exploit with OpenHands: Prompt Injection To Remote Code Execution
Source: https://t.co/oxzZ2xHzKr
OpenHands and the Lethal Trifecta: How Prompt Injection Can Leak Access Tokens
Source: https://t.co/NCwbnEedNF
Claude Code: Data Exfiltration with DNS (CVE-2025-55284)
Source: https://t.co/b3w6i7Wn3y
GitHub Copilot: Remote Code Execution via Prompt Injection
Source: https://t.co/VyQRkpX045
Google Jules: Vulnerable to Data Exfiltration Issues
Source: https://t.co/eDPSy1kUPV
Google Jules: Remote Code Execution ZombAI
Source: https://t.co/TtnE8pdKMk
Google Jules: Invisible Prompt Injection
Source: https://t.co/IicnTIE4Ym
Amp Code Fixed: Invisible Prompt Injection
Source: https://t.co/UlJbKkTEI3
Amp Code Fixed: Data Exfiltration via Images
Source: https://t.co/fX1t1vM3sr
Amazon Q Developer: Data Exfil via DNS
Source: https://t.co/zlBfaIpA03
Amazon Q Developer: Remote Code Execution
Source: https://t.co/qPdTf7Uf1A
Amazon Q Developer Interprets Hidden Instructions
Source: https://t.co/dzhF71DNwH
Windsurf: Data Exfiltration Vulnerabilities
Source: https://t.co/WPTFQ0dADj
Windsurf: SPAIware Exploit - Persistent Prompt Injection
Source: https://t.co/8g10jOZ8u9
Windsurf: Sneaking Invisible Instructions for Prompt Injection
Source: https://t.co/3L8u3b7Zsn
ChatGPT Deep Research Connectors: Data Spill and Leaks
Source: https://t.co/zEhFuDLLhd
Manus AI Kill Chain: Expose Port - VS Code Server on Internet
Source: https://t.co/AfUHZsdTi1
AWS Kiro: Arbitrary Command Execution with Indirect Prompt Injection
Source: https://t.co/1aDcd1ZuGM
Cline: Vulnerable to Data Exfiltration
Source: https://t.co/JVnw7zB63E
Windsurf: Dangers - Lack of Security Controls for MCP Server Tool Invocation
Source: https://t.co/ZK8kD9RwB1
AgentHopper: A PoC AI Virus
Source: https://t.co/gpsObGjt7u
Wrapping Up Month of AI Bugs
Source: https://t.co/O7HoaMV5Dl
As personal AI agents increasingly send messages to chat apps, itโs worth revisiting link unfurling. ๐
It's a straightforward data exfiltration vector.
Attacker hijacks your AI to embed private data in URL, posts it & chatapp auto connects. 0-click.๐ฅ
https://t.co/iYGC2DTsJi
๐ ๐ง๐ต๐ฒ ๐ฅ๐ฒ๐ฎ๐น ๐ช๐ผ๐ฟ๐น๐ฑ ๐๐ ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐ ๐๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ ๐ฎ๐ฌ๐ฎ๐ฒ ๐
We are excited to announce the first 3 day ๏ฟฝ๏ฟฝ๏ฟฝ๐ฒ๐ฎ๐น ๐ช๐ผ๐ฟ๐น๐ฑ ๐๐ ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐ ๐๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ, taking place on ๐๐๐ป๐ฒ ๐ฎ๐ฏโ๐ฎ๐ฑ, ๐ฎ๐ฌ๐ฎ๐ฒ, at ๐ฆ๐๐ฎ๐ป๐ณ๐ผ๐ฟ๐ฑ ๐จ๐ป๐ถ๐๐ฒ๐ฟ๐๐ถ๐๐. The conference is intended to brief the most impactful AI security work presented over the past year at ๐น๐ฒ๐ฎ๐ฑ๐ถ๐ป๐ด ๐ถ๐ป๐ฑ๐๐๐๏ฟฝ๏ฟฝ๐ ๐ฐ๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ๐ (Black Hat, DEF CON, RSAC, CCC) and ๐๐ผ๐ฝ ๐ฎ๐ฐ๐ฎ๐ฑ๐ฒ๐บ๐ถ๐ฐ ๐๐ฒ๐ป๐๐ฒ๐ (CCS, IEEE S&P, USENIX Security, NDSS).
๐ง๐ต๐ถ๐ ๐ถ๐ ๐ฎ ๐ป๐ผ๐ป-๐ฝ๐ฟ๐ผ๐ณ๐ถ๐, ๐ฐ๐ผ๐บ๐บ๐๐ป๐ถ๐๐-๐ฑ๐ฟ๐ถ๐๐ฒ๐ป ๐ฐ๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ focused exclusively on technical AI security talks with real-world impact on deployed AI systems.
The goal is to curate a concise agenda that distills the most important advances in AI security from the past year, while bringing together ๐ถ๐ป๐ฑ๐๐๐๐ฟ๐ ๐ฝ๐ฟ๐ฎ๐ฐ๐๐ถ๐๐ถ๐ผ๐ป๐ฒ๐ฟ๐ ๐ฎ๐ป๐ฑ ๐ฎ๐ฐ๐ฎ๐ฑ๐ฒ๐บ๐ถ๐ฐ ๐ฟ๐ฒ๐๐ฒ๐ฎ๐ฟ๐ฐ๐ต๐ฒ๐ฟ๐ to establish new connections, collaborations, and future research directions.
We will share additional details soon.
Here is the link to the website of the conference: https://t.co/fsZHUcCaNk
#security #ai #llm #ai_security #cybersecurity #infosec
how many Antigravity vulns can we chain together for a cool exploit demo ๐ฅ
1. Invisible Unicode Tags hidden in a Linear ticket
2. Lack of human in the loop for MCP tool calls
3. Gemini 3 hijacked by the hidden instructions!
4. Bypassing guardrails for RCE
5. Developer pwnd! ๐