We’re introducing GeneBench-Pro, a research-level benchmark for a harder kind of AI progress: how well agents can navigate messy biological data, choose the right analysis path, and make judgment calls that real computational research depends on.
https://t.co/AsilnnSxnE
GPT-5.6 Sol is our most capable model yet for cybersecurity.
It shifts the performance-efficiency frontier for long-horizon security tasks including vulnerability research and exploitation.
Introducing Claude Tag, a new way for teams to work with Claude.
In Slack, Claude joins as a team member with access to the channels and tools you choose. Tag Claude in and delegate tasks to it while you focus on other work.
We’re expanding OpenAI Daybreak to help democratize patching vulnerable software at machine speed:
- Codex Security plugin: find, validate, and fix vulnerabilities right inside Codex
- The full version of GPT-5.5-Cyber model: a great model for trusted defenders
- Cyber Partner Program: powering products built on top of our best cyber capabilities for leading security companies to secure the world's software
- Patch the Planet: working with maintainers to secure critical open source projects
https://t.co/hyIi6gQmkm