Anthropic has validated what we've been working on for 2 years with their Zero Trust eBook for AI Agents. @ThirdKeyAI and the https://t.co/WIZ8u0he7x even fill in the gaps.
@claudeai All I get for $200 month now...
API Error: 400 messages.27.content.11: `thinking` or `redacted_thinking` blocks in the latest assistant message cannot be modified.
These blocks must remain as they were in the original response.
Google announced that all Android app developers must register centrally, pay a fee, and submit government ID, or their apps will be blocked on every device. over 67 organizations oppose this. https://t.co/cJlLZL7ThS @AlteredDeal#KeepAndroidOpen
Small-angle orthogonal rotation hides arbitrary payload in embeddings, undetected by off-the-shelf statistical defenses across every model-corpus pair we tested.
arXiv: https://t.co/A1NvFg44LX
I published a new security preprint:
VectorSmuggle: Steganographic Exfiltration in Embedding Stores and a Cryptographic Provenance Defense
The paper looks at an integrity gap in RAG infrastructure: embeddings are usually treated as opaque numerical artifacts, but vector stores generally lack native/default controls for embedding integrity or provenance.
The finding is nuanced: simple anomaly detectors catch many crude perturbations, but small-angle orthogonal rotation exposes a harder class of embedding tampering. Statistical detection helps, but it should not be the load-bearing control.
I propose VectorPin: a minimal cryptographic provenance protocol that signs a canonical commitment over the source, model, and vector bytes. If the vector is modified after pinning, verification fails.
Paper:
https://t.co/tPVU1is3zT
https://t.co/liGCeeb36t
New paper on how Symbiont uses the ORGA loop to secure agents.
"Typestate-Enforced Agent Loops: Making Policy Gates Unskippable at Compile Time"
https://t.co/VcSF8prJy7
AI agents are getting more powerful.
The trust layer around them is not.
Today, too much agent safety still depends on prompts, wrappers, and best-effort guardrails. That is not enough for systems that can actually take action.
Introducing OATS: the Open Agent Trust Stack.
OATS is an open specification for zero-trust AI agent execution built around tool contracts, identity, policy, and auditability.
It is also grounded in real implementation work. Symbiont has been applying these ideas in practice over the past year.
The goal: make safe behavior enforceable by design, not optional at runtime.
https://t.co/RBYFYOZdGJ
#AI #AISecurity #AgenticAI #OpenSource