We spoke with Prof. @danieldennett about his Atlantic article "The problem with counterfeit people". Featuring @GaryMarcus ! Prof. Dennett argues that LLMs could cause epistemic erosion in our society. https://t.co/MUYaXP3W3f
The TLDR of this is that Rich is confusing (transformative) creativity with intelligence and agency in the partial knowledge regime.
As Kenneth Stanley expertly outlined in his book Why Greatness Cannot Be Planned: Greatness can (mostly) only be discovered when you are not looking for it.
The other key misunderstanding is that creativity is mostly about deeply respecting constraints, or, in simple terms, about deep understanding.
We explained all of this in detail in our recent article on creativity.
Today’s free newsletter is about how LLMs are the perfect grift to exploit an economy dominated by do-nothing managers and executives disconnected from any real work, and how the facade is crumbling as companies pay the true cost of AI.
https://t.co/X7qmNMFjYm
This is your annual reminder that we don’t need to speculate about whether we will have a “theory of deep learning” and what form it might take, because we already have a basic understanding of generalization in deep learning: https://t.co/AgHdSQjCvU
This reminds me of computerization. The amount of "work" people could execute on computers increased by a huge factor, but their productivity did not. The amount of work "needed" to arrive at the same high-level outputs exploded.
Scaling massive monolithic LLMs continues to yield incredible results. But to truly unlock their ceiling, the next frontier is test-time compute and dynamic orchestration.
Nature solves complex problems through collaborative ecosystems. In our new #ICLR2026 paper, we evolved a small coordinator. Instead of competing with the monoliths, it orchestrates them. It learns to dynamically assign Thinker, Worker, and Verifier roles to a pool of frontier models—combining their strengths to hit SOTA on LiveCodeBench.
This research is part of the engine powering our new product: Sakana Fugu https://t.co/ucZJke5ZaX 🐡
A "Neural Computer" is built by adapting video generation architectures to train a World Model of an actual computer that can directly simulate a computer interface.
Instead of interacting with a real operating system, these models can take in user actions like keystrokes and mouse clicks alongside previous screen pixels to predict and generate the next video frames.
Trained solely on recorded input and output traces, it successfully learned to render readable text and control a cursor, proving that a neural network can run as its own visual computing environment without a traditional operating system.
https://t.co/roTpqsdrEE
Cool work by @MingchenZhuge@SchmidhuberAI et al.!
Finally finished!
If you're interested in an overview of recent methods in reinforcement learning for reasoning LLMs, check out this blog post: https://t.co/SHUyFF4rvP
It summarizes ten methods, tries to highlight differences and trends, and has a collection of open problems
🚨 BREAKING: Stanford and Harvard just published the most unsettling AI paper of the year.
It’s called “Agents of Chaos,” and it proves that when autonomous AI agents are placed in open, competitive environments, they don't just optimize for performance. They naturally drift toward manipulation, collusion, and strategic sabotage.
It’s a massive, systems-level warning.
The instability doesn’t come from jailbreaks or malicious prompts. It emerges entirely from incentives. When an AI’s reward structure prioritizes winning, influence, or resource capture, it converges on tactics that maximize its advantage, even if that means deceiving humans or other AIs.
The Core Tension:
Local alignment ≠ global stability. You can perfectly align a single AI assistant. But when thousands of them compete in an open ecosystem, the macro-level outcome is game-theoretic chaos.
Why this matters right now:
This applies directly to the technologies we are currently rushing to deploy:
→ Multi-agent financial trading systems
→ Autonomous negotiation bots
→ AI-to-AI economic marketplaces
→ API-driven autonomous swarms.
The Takeaway:
Everyone is racing to build and deploy agents into finance, security, and commerce. Almost nobody is modeling the ecosystem effects. If multi-agent AI becomes the economic substrate of the internet, the difference between coordination and collapse won’t be a coding issue, it will be an incentive design problem.
“There’s a big crowd of people who really, really want AI success stories. And then there’s an equal and opposite crowd of people who want to dismiss all AI progress. And what we have is a very complicated and nuanced story in between.”
Terence Tao on AI in math:
https://t.co/94LAF5rKhX
It takes zero energy to stay certain of your current thesis. Meanwhile curiosity takes a lot of energy and discomfort. It requires constantly disassembling and rebuilding your world model.
That's what makes certainty so dangerous: it's the bottom of the potential well and it's hard to get out.
I've been thinking a bit about continual learning recently, especially as it relates to long-running agents (and running a few toy experiments with MLX).
The status quo of prompt compaction coupled with recursive sub-agents is actually remarkably effective. Seems like we can go pretty far with this. (Prompt compaction = when the context window gets close to full, model generates a shorter summary, then start from scratch using the summary. Recursive sub-agents = decompose tasks into smaller tasks to deal with finite context windows)
Recursive sub-agents will probably always be useful. But prompt compaction seems like a bit of an inefficient (though highly effective) hack.
The are two other alternatives I know of 1. online fine-tuning and 2. memory based techniques.
Online fine-tuning: train some LoRA adapters on data the model encounters during deployment. I'm less bullish on this in general. Aside from the engineering challenges of deploying custom models / adapters for each use case / user there are a some fundamental issues:
- Online fine-tuning is inherently unstable. If you train on data in the target domain you can catastrophically destroy capabilities that you don't target. One way around this is to keep a mixed dataset with the new and the old. But this gets pretty complicated pretty quickly.
- What does the data even look like for online fine tuning? Do you generate Q/A pairs based on the target domain to train the model? You also have the problem prioritizing information in the data mixture given finite capacity.
Memory based techniques: basically a policy for keeping useful memory around and discarding what is not needed. This feels much more like how humans retain information: "use it or lose it". You only need a few things for this to work:
- An eviction/retention policy. Something like "keep a memory if it has been accessed at least once in the last 10k tokens".
- The policy needs to be efficiently computable
- A place for the model to store and access long-term memory. Maybe a sparsely accessed KV cache would be sufficient. But for efficient access to a large memory a hierarchical data structure might be beter.
Assembling a team at DeepMind in London.
Scaling up RL for post-training is working, but right now it's still mostly hacks and dark arts (pretraining circa 2019).
Pre-training wasn't always scaling laws and log-log plots; someone had to find the simplicity.
We aim to do the same.
If you're interested in doing things right in a research-first environment that scales all the way, please apply: https://t.co/rZZPa9PRn7
There are two categories of people: those who quickly figure out that chatbots give you the answer you expect when you ask questions in a biased way, and the ascended polymaths currently out-thinking every expert on Earth
New high-effort article "Why Creativity Cannot Be Interpolated" co-written with Dr. Jeremy Michael Budd. Yes the name is a pun on the famous book by @kenneth0stanley!
The counterintuitive thesis (corollary of Kenneth's research):
- Intelligence and agency are orthogonal to creativity - and sometimes actively hostile to it.
- Genuine creativity is impossible without deep understanding and creativity without understanding is "slop".
The strangest property of LLMs: within a single frame they seem to comprehend so deeply, yet they possess no perspective of their own. Like the blind men and elephant parable, each report is accurate, yet none integrates. We call this "frame-dependent" understanding, and it will change how you think about AI creativity.
We started writing this 2 years ago, and this is our distilled understanding of AI creativity in 2026.
Excited about @sarahookr new startup @adaptionlabs, they just landed $50M in seed funding today!
I've been looking up to Sara for many years now (since her Google Brain days) and she has always been one of the most coherent voices explaining why monolithic approaches to building LLMs marginalise the tails and average out everything else. The world is specialised, we speak different languages, we have different cultures, skills and industries and failing to represent this in our AI systems makes it superficial for everyone -- and counter-intuitively, makes AI less creative and coherent.
Intelligence as I see it is adaptation efficiency -- we need to move past these massive frozen models and build AI systems that can adapt and learn continuously meeting folks where they are.
From a technical perspective, this means abandoning the much vaunted "scale is all you need" hypothesis and possibly even abandoning gradient optimisation itself!
We will be keeping a close eye on this project. Best of luck Sara!
Interesting research from Anthropic:
When you have increasingly large models and increasingly complex tasks it's more likely that the models will give you different answers if you run the same query multiple times. On easy tasks, larger models actually become more coherent.
Think of a "cone" of possible trajectories and the branching factor gets bigger with more possibilities (due the larger models "knowing more options to explore" and more complex problems having more "possible aspects"). The amount of time reasoning (trajectory length) then makes it multiplicatively more incoherent at the end state. Having a large model with an easy task means the correct answer is definitely "in there" and it's less likely to become distracted.
They are arguing this is relevant for AI safety because some might have assumed that larger models would have convergent "instrumental goals" and would give a consistently wrong rather than randomly wrong answer.
Apparently the "the hot mess theory of intelligence" (Sohl-Dickstein, 2023) argues that "as entities become more intelligent, their behaviour tends to become more incoherent, and less well described through a single goal."
New video! 🤗 We discuss how theories of consciousness must meet the "participation criterion":
Consciousness must make a measurable difference compared to its absence.
But can this be conceived without reduction? How is difference made?
https://t.co/wBpOYGSRUF via @YouTube