Gemini 3.5 Flash now supports native computer use.
This built-in tool lets developers build custom agents that can see and take action across browser, mobile, and desktop interfaces.
Find out more → https://t.co/DZyfe7aIHd
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.
Its capabilities exceed those of any model we’ve ever made generally available.
Building autonomous vehicles, robots, and vision AI takes more data than any team can collect on their own.
At #CVPR2026, NVIDIA announced physical AI agent skills to help speed development: composable workflows that automate data generation, simulation, policy training and evaluation, powered by Cosmos 3.
Read the blog: https://t.co/osWtEASPa3
Introducing NVIDIA Nemotron 3 Ultra.
A frontier smart open model built for long-running agents that need to plan, reason, use tools and keep working across complex coding, research and enterprise workflows.
Up to 5x faster inference and up to 30% lower cost for agentic tasks.
Learn more: https://t.co/h9XLqqYPFf
Our new Gemma 4 12B model hits a sweet spot between size + performance: it can run locally on a laptop, while enabling powerful multi-step reasoning and agentic workflows. Can’t wait to see what the community does with this one!
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors.
Available today at the same price.
Illinois just passed one of the strongest frontier AI safety laws in the country.
OpenAI was proud to endorse SB 315 because it takes a thoughtful approach to issues like transparency, audits, and incident reporting.
With Illinois joining New York and California in passing frontier AI safety legislation, states are increasingly aligning around a common approach. Together, they are beginning to create a de facto national framework. We think that's a positive thing.
We’re reimagining a 50-year-old interface - the mouse pointer - with AI. 🖱️
These experimental demos show how people can intuitively direct Gemini on their screens using motion, speech, and natural shorthand to get things done 🧵
Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents.
Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold.
Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.
Today we’re introducing a new Legal Agent in @Microsoft Word, built to support the precision and rigor legal work demands. Every clause matters. Every redline tells a story. That’s why this agent was built to follow the structured workflows lawyers use while keeping them fully in control.
Early in my career, I asked for a computer on my desk because I believed technology could change how lawyers work. It did. Today, I believe this next generation of tools will do the same, grounded in trust and responsible use.
You can now ask Gemini to create Docs, Sheets, Slides, PDFs, and more directly in your chat. No more copying, pasting, or reformatting, just prompt and download.
Available globally for all @GeminiApp users.
Today, we’re open-sourcing the draft specification for DESIGN.md, so it can be used across any tool or platform. We’re also adding new capabilities.
DESIGN.md lets you easily export and import your design rules from project to project. Instead of guessing intent, agents know exactly what a color is for and can even validate their choices against WCAG accessibility rules.
Watch David East break down this shared visual language in action👇. New capabilities and links in 🧵
Claude Cowork is now generally available to all paid plans.
For Enterprise, we are adding role-based access controls, group spend limits, usage analytics, and expanded OpenTelemetry to give admins what they need to deploy it across the org.
Gemini can now transform your questions and complex concepts into customizable interactive visualizations directly in your chat.
Adjust variables, rotate 3D models, and explore data for a more immersive way to learn and explore in Gemini.
Gemma 4 can run on phones without an internet connection! 🤯
It can perform local agentic tasks, such as logging and analyzing trends. When connected, it can also make API calls.
Want to try it yourself? Get the Google AI Edge App on iOS or Android. (🔊 Sound on for the demo!)
We just released Gemma 4 — our most intelligent open models to date.
Built from the same world-class research as Gemini 3, Gemma 4 brings breakthrough intelligence directly to your own hardware for advanced reasoning and agentic workflows.
Released under a commercially permissive Apache 2.0 license so anyone can build powerful AI tools. 🧵↓
Google open-sourced a time series foundation model.
it works with any data without training.
unlike traditional models, no dataset-specific training needed. TimesFM forecasts out of the box.
trained on 100B real-world time-points across traffic, weather & demand forecasting.