Jensen Huang, CEO of Nvidia:
"Every engineer is going to have and manage hundreds of agents."
The most valuable engineering skill of 2026 is not taught in any university.
No CS program teaches harness engineering.
No bootcamp teaches agent memory architecture.
No degree prepares you to build systems that survive production.
One builder mapped the entire thing out — free, step by step, no degree required.
This is the roadmap ↓
Bookmark this for the weekend.
Anthropic is calling for top AI labs to weigh slowing the pace of development, suggesting that AI systems are advancing so rapidly that they may soon be able to improve themselves without human intervention in ways that could pose societal risks. https://t.co/8c7xkeX17B
@a16z I did a cool experiment which explains this from a more relatable standpoint: the MIMIC test. Here is clip from that video - full video on my channel
None of this guarantees recursive self-improvement is on the horizon. It’s not yet clear that Claude is capable of research judgment—of choosing the right problems to work on.
But if these trends continue, AI systems designing and building their own successors is plausible. This could revolutionize society—medicine, technology, the economy—for the better. But it may also compound alignment issues and ultimately lead to loss of control.
The Anthropic Institute (in collaboration with external stakeholders) will conduct research to think through the implications of increasingly powerful, potentially self-improving systems—and how to create the ability for the world to make deliberate choices about the future development of the technology.
Read the full post: https://t.co/XkYALsONft
Introducing Magenta RealTime 2 (MRT2): the live music model you can play as an instrument.
MRT2 offers MIDI and prompt controls, and runs natively on a MacBook with <200ms latency.
Open weights. Open source inference engine. Suite of apps and plugins.
Hear what it can do and try it out for yourself below 🧵
The speedup isn’t just in volume. On open-ended coding problems where answers are unclear, Claude’s success rate is now 76%—a 50 point jump in just 6 months.
Many engineers also say Claude’s code quality is now on par with human code; we expect it to be better within the year.
Building autonomous agents for scientific discovery? 🧬🤖
@GoogleDeepMind Science Skills is now available on GitHub. We've open-sourced this specialized toolkit to accelerate your agentic workflows with scientific grounding and higher token efficiency.
Download now ↓
https://t.co/cwp1HOeKvo
World Labs CEO Dr. Fei-Fei Li: "The world is not made of words."
"Language models have given machines an extraordinary command of concepts, vocabulary, and reasoning, but the physical world, virtual or real, runs on a different substrate."
"Where language models learn the statistical structure of text, world models learn the statistical structure of space and time: how light falls on a surface, how a garden looks from an angle no camera has captured, how objects respond to force and follow the laws of physics."
"Language gave machines a way to talk about that world. World models are how machines will finally come to understand, imagine, reason and interact with it."
Full piece: https://t.co/C9qOJg5wuc
Perplexity introduced Search as Code, a new search architecture for AI agents that writes Python to call Perplexity’s search stack directly.
Instead of making one tool call at a time, Search as Code lets agents compose search workflows in code: running queries in parallel, deduping results, filtering, joining, and ranking before information enters the model context.
El Niño is arriving on our doorstep in the coming months with 90% certainty.
The world must treat it as the urgent climate warning it is.
The only effective response is #ClimateAction equal to the crisis – ending the addiction to fossil fuels, accelerating the shift to renewables, protecting the most vulnerable, and delivering early warning systems for all.
https://t.co/owmmCChyb3
This Executive Order is an important step in strengthening America’s leadership in AI.
We look forward to collaborating with the White House to support its implementation.
https://t.co/ZwDimPrp3t
"You can run OpenClaw inside your company now." Annoucing our work with @Microsoft to bring OpenClaw to the Microsoft and Windows ecosystems. Claws now work securly in the enterprise.
We’re expanding Project Glasswing. We’ve extended access to Claude Mythos Preview to approximately 150 additional organizations, based in more than fifteen countries.
Read more about this expansion and our future plans for Project Glasswing: https://t.co/QrtHSBdRbh
Reminder: every Hugging Face Space is an API your agents can call :)
I asked mine to build a website about the flowers of France 🌸 and it used VAST AI's TripoSplat Space to turn photos it found into real 3D Gaussian splats, live on the page!
All on my HF Pro daily ZeroGPU credits (40 min/day renewed daily for only $9/month)
We’re transforming Google Antigravity into a scientific workbench. The new Science Skills bundle allows researchers to run complex workflows like protein analysis in minutes using specialized Alpha* models and 30+ major scientific databases.
Nous Research is working with NVIDIA to make Hermes Agent run smoothly on the new NVIDIA RTX Spark superchip.
Hermes Agent is also integrating with the new OpenShell runtime, which connects Hermes to Microsoft’s security primitives
This is very big.
NVIDIA new open humanoid robot reference design built for robotics research.
The NVIDIA Isaac GR00T Reference Humanoid Robot.
The garage robot builders have a friend in NVIDA.
Astute robot companies will be OPEN SOURCE.
ANTHROPIC JUST DROPPED A ZERO TRUST PLAYBOOK FOR AI AGENTS
and it's not theory it's architecture
frontier AI compresses vulnerability-to-exploit timelines from months to hours
your agents face threats traditional access controls were never built to handle:
▫️ prompt injection through external data sources
▫️ tool poisoning via MCP server metadata
▫️ memory-based privilege retention across sessions
▫️ multi-agent pivot attacks
the framework breaks it into 3 tiers: Foundation, Enterprise, Advanced
https://t.co/uDuO9cq25H