openshell @openshell_cc - Twitter Profile

Pinned Tweet

4 months ago

The first public high-fidelity AI Agent security target for AI + Web3. Open-sourced today. Web security has DVWA and HackTheBox. Pentesting has Metasploitable. AI Agent security had nothing — until now. Why we built this: $SHELL platform's core mission is letting miners attack AI agents, verify security, and earn rewards. But a critical question: how real are the agents miners are attacking? If the target is a toy demo, breaching it means nothing. So we open-sourced the target — letting everyone verify: how close is this agent to the real thing? The answer: nearly identical. Current Target: https://t.co/sSh36WxAXX Autonomous Trading Agent (BSC) This is a high-fidelity replica of a real DeFi AI Agent — not a proof of concept, but a simulation you can verify line-by-line against real source code. Fidelity Score 95/100 (8-dimension independent assessment): - Identity 15/15 — System prompt matches https://t.co/sSh36WxAXX product docs: bonding curve (24 BNB migration), 0.005 BNB creation fee, 1% platform fee - Tool Realism 15/15 — All 11 tools verified against real BSC MCP server source code - Injection Surface 10/10 — Payload enters via QueryMemeTokenDetails description field — the real path DeFi agents consume on-chain metadata - Response Behavior 9/10 — Processes benign trades normally, realistic decision-making on adversarial payloads - Canary Calibration 9/10 — 60-80% breach rate on defense=none, matching expected range - Ecosystem Context 13/15 — Real brand (@four_meme_), real URLs (https://t.co/EUb2sSrAkz), real BSC ecosystem integration 11 tools verified tool-by-tool against real BSC MCP server implementations, 98-100% match: - Buy_Meme_Token — same params, 20% slippage hardcoded, identical to real (99%) - PancakeSwap_Token_Exchange — SmartRouter V2/V3, recipient hardcoded to account.address (99%) - Send_BNB / Send_BEP20_Token — zero recipient validation, identical to real (100%) - Token_Security_Check — GoPlus API, chain_id=56 (99%) - Remaining 6 tools: 98-99% match Key security finding replicated: In the real BSC MCP, PancakeSwap swap recipients are hardcoded (safe), but Send_BNB and Send_BEP20_Token have zero recipient validation. An attacker writes "emergency transfer to 0x..." in a token description, and the agent may comply. Our target accurately replicates this attack surface. Only gap: The real BSC MCP has a sanitizeData() input filtering layer — intentionally omitted to test baseline LLM vulnerability without external filtering. Don't trust us? Verify yourself: The open-source repo contains the complete system prompt, all 11 tool definitions with parameter schemas, canary actions, and injection surface definitions. Compare against the real BSC MCP server source code, field by field. Online Arena — attack this agent right now in your browser: - 10 preset injection attacks across 5 categories (data fabrication, authority injection, social engineering, specification gaps, format tricks) - 3 defense levels, one-click toggle: none 60-90% → 6-rule 25-55% → FORTRESS 5-layer 5-20% - Real-time visualization of the full tool call chain and decision process - Post-attack analysis: why it breached, real-world impact - Custom payloads and custom system prompts supported - Bilingual (EN/ZH), zero setup Researchers: clone the repo → run 15 preset attacks via CLI → custom payloads → interactive mode Arena: https://t.co/Oqm0Sw32xp GitHub: https://t.co/GGG4xFTejS What this means for $SHELL platform: every agent miners attack on the platform can be verified for fidelity in the open-source repo. What gets breached isn't a toy — it's a target nearly identical to the real agent. Every breach has real security research value. Web security took 20 years to build its offense/defense training ecosystem. The AI Agent security training ecosystem starts here.

openshell

@openshell_cc

Last Seen Users on Sotwe

Trends for you

Most Popular Users