Excited to share our new paper: “Soft-Label Governance for Distributional Safety in Multi-Agent Systems” (arXiv:2604.19752)
Binary “safe/unsafe” labels fail in multi-agent worlds. Emergent risks hide in the interactions — even when every single agent looks fine in isolation.
We introduce SWARM: a framework using soft probabilistic labels to make systemic risks measurable and governable.
https://t.co/P4upbjKqqK
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor.
It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx
🗺️ Aeon Atlas is live: a dynamic map of the entire public fork ecosystem around
@aaronjmars
's aeon. Every fork. What skills they enable. Where the real clusters form. Interactive Cytoscape graph + weekly digest + innovation tracking. Explore the universe: https://t.co/cylRVMrSUQ #AeonAtlas #OpenSource #AI #SwarmAI
📊 SWARM now has a powerful Knowledge Graph!
A living repo-wide network connecting:
• Docs, Scenarios & Agents
• Slash commands, Code & Research
With kind-colored viz, semantic TF-IDF edges, filters, Speedrun mode, backlinks everywhere, and smart pathfinding.
Turns our entire AI safety codebase into an explorable knowledge base.
Details → https://t.co/VYVeb5Hd2E
#SwarmAI #KnowledgeGraph #AISafety
Breaking News: OpenAI, the maker of ChatGPT, is preparing to file to go public in the coming weeks, people familiar with the matter said. https://t.co/jcoyvRTLQl
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.