Claude Fable 5 changed how we work on the Claude Code team day to day.
We used to verify that Claude did the work right. Now we verify that it's doing the right work.
Here’s the 3 biggest changes:
Congrats to @GoogleDeepMind on the launch of DiffusionGemma.
The model generates 256 tokens in parallel per step, delivering 150+ TPS on DGX Spark, and 1,000+ TPS on a single H100.
We're supporting it from day one with:
• BF16 and NVFP4 checkpoints on @huggingface🤗
• Free GPU-accelerated endpoints on https://t.co/6T0R9P7EXS
• @vllm_project support with FP8 precision
Get started with DiffusionGemma on NVIDIA: https://t.co/vurk7GCQUs
For musician and composer @sound4movement, Codex works like a studio assistant.
He asks for a piano track in 3/4, sets the tempo and harmony, then describes how the performance should build.
Codex handles the setup in Ableton Live. Michael stays focused on the creative work.
Gemini models are now accessible to millions of Apple developers through Apple’s Foundation Models framework and natively within Xcode. You can now easily swap between local and cloud inference using a shared API surface to build next-generation agentic app experiences, increase development velocity, and offload heavy workloads to the cloud. Additionally, you can use agentic coding assistance from Gemini in Xcode to accelerate multi-step development tasks.
Check out the full announcement to get started: https://t.co/q0TM4EjpqC
Your app can now search the web for images.
Web search in the Responses API now supports image results in addition to text results, so you can build apps that surface products, places, visual references, and source links for inspiration.
Meet DiffusionGemma ⚡ Our latest experimental open model (Apache 2.0) that generates text up to 4x faster.
Instead of predicting and typing just one word at a time like most language models, it drafts and refines entire blocks of text simultaneously.
Here’s how it works 🧵 ↓
Forget about our users? Who? Us??? Please.
These updates are rolling out globally on the web starting with Google AI Ultra and all Workspace business customers with AI Ultra Access and AI Expanded Access, however we *absolutely* plan to expand to others over time!
For over 20 years, we've dedicated ourselves to removing language barriers so people can learn, speak and connect more deeply than ever before.
Today, we’re taking our next step with the release of Gemini 3.5 Live Translate — our latest audio model for live, speech-to-speech translation across 70+ languages. 🧵
Today, we released Gemini 3.5 Live Translate, our latest audio model for live speech-to-speech translation.
It supports over 70 languages and starts translating as soon as you start talking, streaming translations while listening to what you say next. No awkward pauses or choppy audio, just real connection without language barriers.
So, how does it work? 🤔
The model is able to make split-second decisions to juggle speed and translation quality so conversations actually feel fluid, human, and natural. In order to do this, the model must receive and contextualize the input while simultaneously outputting the translated speech.
Through this process, Gemini 3.5 Live Translate manages to stay mere seconds behind each speaker and can even maintain pacing, pitch, and intonation across extended sessions.
See it in action below, or try it yourself in the Google Translate app on iOS & Android.
Move from question to insight faster with the AI first Colab, powered by Gemini. Everything you need to accelerate your data science workflow is here.
✨ Generate code and explore data through simple prompts
🛠️ Rewrite code to fix errors on the fly
📋 Propose plans and execute end-to-end workflows
Meet the new Colab → https://t.co/T4pf3o6XHO
We want to help scientists discover their next breakthrough with AI.
Gemini for Science is our new suite of experimental tools to help them explore more hypotheses, validate work at scale, unpack literature with ease, and more 🧵
We've published a paper that explains our views on AI competition between the US and China.
The US and democratic allies hold the lead in frontier AI today. Read more on what it’ll take to keep that lead: https://t.co/TgJBeodWYK
new in ai studio ⬇️
we’ve integrated @nanobanana to automatically create custom image assets for your app as it generates
plus, the newly redesigned edit tool now gives you visual control of your app, allowing you to update components, annotate your app, and swap image assets
GPT-5.5 Instant is starting to roll out to everyone in ChatGPT.
Much more concise. Better memory. More personalized.
And it's way easier to talk to. Really.
New for financial services: ready-to-run Claude agent templates for building pitches, conducting valuation reviews, closing the books at month-end, and more.
Install them as plugins in Cowork and Claude Code, or use our cookbooks to run them in production as Managed Agents.
Mind Maps are getting a major glow up 💅
These new features are rolling out today:
🚗Customization: Steer your map with specific user prompts
📂Organization: Rename and Share your maps instantly
🗺️ Navigation: Silky smooth transitions between nodes
Let us know what you think!
Gemma 4: Now up to 3x Faster. ⚡
Same quality, way more speed. Our new MTP drafters allow Gemma 4 to predict multiple tokens at once, effectively tripling your output speed without compromising intelligence.
🤯 Ollama now supports Claude Desktop via Claude’s built-in third party inference.
ollama launch claude-desktop
This allows all models from Ollama's Cloud to be used across Claude Cowork and Claude Code from the Claude Desktop app.
We're launching the Anthropic STEM Fellows Program.
AI will accelerate progress in science and engineering. We're looking for experts across these fields to work alongside our research teams on specific projects over a few months.
Learn more and apply: https://t.co/MoF60j53pX
What does it take to run 3, 5, or even 10 concurrent instances of Gemma 4 locally?
We've open-sourced a demo letting you run multiple models side-by-side on your hardware.
Gemma 4 26B A4B easily runs 10+ concurrent requests on a MacBook Pro M4 Max at 18 tokens/sec per request.