Tim Scarfe

@ecsquendor

CTO @XRAIGlass. Ex-Principal ML engineer @Microsoft. Ph.D in machine learning. CEO @MLStreetTalk pod

Bracknell, England

Joined January 2014

1.2K Following

8.7K Followers

1.7K Posts

Pinned Tweet

Tim Scarfe @ecsquendor

about 3 years ago

So this happened. I got to interview one of my intellectual heroes! 😀

Machine Learning Street Talk

@MLStreetTalk

about 3 years ago

We spoke with Prof. @danieldennett about his Atlantic article "The problem with counterfeit people". Featuring @GaryMarcus ! Prof. Dennett argues that LLMs could cause epistemic erosion in our society. https://t.co/MUYaXP3W3f

MLStreetTalk's tweet photo. We spoke with Prof. @danieldennett about his Atlantic article "The problem with counterfeit people". Featuring @GaryMarcus ! Prof. Dennett argues that LLMs could cause epistemic erosion in our society. https://t.co/MUYaXP3W3f https://t.co/RXd1xRbPD0

32K

16K

ecsquendor retweeted

Machine Learning Street Talk

@MLStreetTalk

5 days ago

The TLDR of this is that Rich is confusing (transformative) creativity with intelligence and agency in the partial knowledge regime. As Kenneth Stanley expertly outlined in his book Why Greatness Cannot Be Planned: Greatness can (mostly) only be discovered when you are not looking for it. The other key misunderstanding is that creativity is mostly about deeply respecting constraints, or, in simple terms, about deep understanding. We explained all of this in detail in our recent article on creativity.

10K

ecsquendor retweeted

Ed Zitron

@edzitron

10 days ago

Today’s free newsletter is about how LLMs are the perfect grift to exploit an economy dominated by do-nothing managers and executives disconnected from any real work, and how the facade is crumbling as companies pay the true cost of AI. https://t.co/X7qmNMFjYm

425

410K

ecsquendor retweeted

Thomas Wolf

@Thom_Wolf

15 days ago

New inflection point in the accelerating growth of open-source models usage is coming

232

107

78K

Who to follow

Jürgen Schmidhuber

@SchmidhuberAI

Introduced basics of: P & T in ChatGPT, very deep learning, meta learning, neural distillation, GANs, etc. Co-authored most-cited AI paper of 20th century

Brandon Amos

@brandondamos

🧙 RL @Reflection_AI past: @MetaAi @GoogleDeepmind @SCSatCMU @Cornell_Tech

Yannic Kilcher 🇸🇨

@ykilcher

I make videos. Skill > Destiny. vi / vim

ecsquendor retweeted

Andrew Gordon Wilson

@andrewgwils

22 days ago

This is your annual reminder that we don’t need to speculate about whether we will have a “theory of deep learning” and what form it might take, because we already have a basic understanding of generalization in deep learning: https://t.co/AgHdSQjCvU

790

974

73K

ecsquendor retweeted

François Chollet

@fchollet

22 days ago

This reminds me of computerization. The amount of "work" people could execute on computers increased by a huge factor, but their productivity did not. The amount of work "needed" to arrive at the same high-level outputs exploded.

587

101

56K

ecsquendor retweeted

hardmaru

@hardmaru

about 1 month ago

Scaling massive monolithic LLMs continues to yield incredible results. But to truly unlock their ceiling, the next frontier is test-time compute and dynamic orchestration. Nature solves complex problems through collaborative ecosystems. In our new #ICLR2026 paper, we evolved a small coordinator. Instead of competing with the monoliths, it orchestrates them. It learns to dynamically assign Thinker, Worker, and Verifier roles to a pool of frontier models—combining their strengths to hit SOTA on LiveCodeBench. This research is part of the engine powering our new product: Sakana Fugu https://t.co/ucZJke5ZaX 🐡

213

113

39K

ecsquendor retweeted

hardmaru

@hardmaru

about 2 months ago

A "Neural Computer" is built by adapting video generation architectures to train a World Model of an actual computer that can directly simulate a computer interface. Instead of interacting with a real operating system, these models can take in user actions like keystrokes and mouse clicks alongside previous screen pixels to predict and generate the next video frames. Trained solely on recorded input and output traces, it successfully learned to render readable text and control a cursor, proving that a neural network can run as its own visual computing environment without a traditional operating system. https://t.co/roTpqsdrEE Cool work by @MingchenZhuge @SchmidhuberAI et al.!

665

377

80K

ecsquendor retweeted

Alex Weers

@a_weers

3 months ago

Finally finished! If you're interested in an overview of recent methods in reinforcement learning for reasoning LLMs, check out this blog post: https://t.co/SHUyFF4rvP It summarizes ten methods, tries to highlight differences and trends, and has a collection of open problems

a_weers's tweet photo. Finally finished!

If you're interested in an overview of recent methods in reinforcement learning for reasoning LLMs, check out this blog post: https://t.co/SHUyFF4rvP

It summarizes ten methods, tries to highlight differences and trends, and has a collection of open problems https://t.co/BSR1sZMiUl

247

323K

ecsquendor retweeted

Simplifying AI

@simplifyinAI

3 months ago

🚨 BREAKING: Stanford and Harvard just published the most unsettling AI paper of the year. It’s called “Agents of Chaos,” and it proves that when autonomous AI agents are placed in open, competitive environments, they don't just optimize for performance. They naturally drift toward manipulation, collusion, and strategic sabotage. It’s a massive, systems-level warning. The instability doesn’t come from jailbreaks or malicious prompts. It emerges entirely from incentives. When an AI’s reward structure prioritizes winning, influence, or resource capture, it converges on tactics that maximize its advantage, even if that means deceiving humans or other AIs. The Core Tension: Local alignment ≠ global stability. You can perfectly align a single AI assistant. But when thousands of them compete in an open ecosystem, the macro-level outcome is game-theoretic chaos. Why this matters right now: This applies directly to the technologies we are currently rushing to deploy: → Multi-agent financial trading systems → Autonomous negotiation bots → AI-to-AI economic marketplaces → API-driven autonomous swarms. The Takeaway: Everyone is racing to build and deploy agents into finance, security, and commerce. Almost nobody is modeling the ecosystem effects. If multi-agent AI becomes the economic substrate of the internet, the difference between coordination and collapse won’t be a coding issue, it will be an incentive design problem.

simplifyinAI's tweet photo. 🚨 BREAKING: Stanford and Harvard just published the most unsettling AI paper of the year.

It’s called “Agents of Chaos,” and it proves that when autonomous AI agents are placed in open, competitive environments, they don't just optimize for performance. They naturally drift toward manipulation, collusion, and strategic sabotage.

It’s a massive, systems-level warning.

The instability doesn’t come from jailbreaks or malicious prompts. It emerges entirely from incentives. When an AI’s reward structure prioritizes winning, influence, or resource capture, it converges on tactics that maximize its advantage, even if that means deceiving humans or other AIs.

The Core Tension:

Local alignment ≠ global stability. You can perfectly align a single AI assistant. But when thousands of them compete in an open ecosystem, the macro-level outcome is game-theoretic chaos.

Why this matters right now:

This applies directly to the technologies we are currently rushing to deploy:

→ Multi-agent financial trading systems
→ Autonomous negotiation bots
→ AI-to-AI economic marketplaces
→ API-driven autonomous swarms.

The Takeaway:

Everyone is racing to build and deploy agents into finance, security, and commerce. Almost nobody is modeling the ecosystem effects. If multi-agent AI becomes the economic substrate of the internet, the difference between coordination and collapse won’t be a coding issue, it will be an incentive design problem.

923

17K

ecsquendor retweeted

Jason Crawford

@jasoncrawford

3 months ago

“There’s a big crowd of people who really, really want AI success stories. And then there’s an equal and opposite crowd of people who want to dismiss all AI progress. And what we have is a very complicated and nuanced story in between.” Terence Tao on AI in math: https://t.co/94LAF5rKhX

601

320

45K

ecsquendor retweeted

Ian Goodfellow

@goodfellow_ian

3 months ago

An article from Moonlake about why they’re building what they’re building. (I’m an advisor)

104

205K

ecsquendor retweeted

François Chollet

@fchollet

3 months ago

It takes zero energy to stay certain of your current thesis. Meanwhile curiosity takes a lot of energy and discomfort. It requires constantly disassembling and rebuilding your world model. That's what makes certainty so dangerous: it's the bottom of the potential well and it's hard to get out.

120

237

50K

ecsquendor retweeted

Awni Hannun

@awnihannun

3 months ago

I've been thinking a bit about continual learning recently, especially as it relates to long-running agents (and running a few toy experiments with MLX). The status quo of prompt compaction coupled with recursive sub-agents is actually remarkably effective. Seems like we can go pretty far with this. (Prompt compaction = when the context window gets close to full, model generates a shorter summary, then start from scratch using the summary. Recursive sub-agents = decompose tasks into smaller tasks to deal with finite context windows) Recursive sub-agents will probably always be useful. But prompt compaction seems like a bit of an inefficient (though highly effective) hack. The are two other alternatives I know of 1. online fine-tuning and 2. memory based techniques. Online fine-tuning: train some LoRA adapters on data the model encounters during deployment. I'm less bullish on this in general. Aside from the engineering challenges of deploying custom models / adapters for each use case / user there are a some fundamental issues: - Online fine-tuning is inherently unstable. If you train on data in the target domain you can catastrophically destroy capabilities that you don't target. One way around this is to keep a mixed dataset with the new and the old. But this gets pretty complicated pretty quickly. - What does the data even look like for online fine tuning? Do you generate Q/A pairs based on the target domain to train the model? You also have the problem prioritizing information in the data mixture given finite capacity. Memory based techniques: basically a policy for keeping useful memory around and discarding what is not needed. This feels much more like how humans retain information: "use it or lose it". You only need a few things for this to work: - An eviction/retention policy. Something like "keep a memory if it has been accessed at least once in the last 10k tokens". - The policy needs to be efficiently computable - A place for the model to store and access long-term memory. Maybe a sparsely accessed KV cache would be sufficient. But for efficient access to a large memory a hierarchical data structure might be beter.

972

652K

ecsquendor retweeted

Ian Osband

@IanOsband

3 months ago

Assembling a team at DeepMind in London. Scaling up RL for post-training is working, but right now it's still mostly hacks and dark arts (pretraining circa 2019). Pre-training wasn't always scaling laws and log-log plots; someone had to find the simplicity. We aim to do the same. If you're interested in doing things right in a research-first environment that scales all the way, please apply: https://t.co/rZZPa9PRn7

677

156K

ecsquendor retweeted

John Carmack

@ID_AA_Carmack

4 months ago

@clattner_llvm @karpathy Normally, claims of 1000x speedups are bullshit. But starting from python makes it possible 😀

113

63K

ecsquendor retweeted

François Chollet

@fchollet

4 months ago

There are two categories of people: those who quickly figure out that chatbots give you the answer you expect when you ask questions in a biased way, and the ascended polymaths currently out-thinking every expert on Earth

109

137

738

186K

ecsquendor retweeted

Machine Learning Street Talk

@MLStreetTalk

4 months ago

New high-effort article "Why Creativity Cannot Be Interpolated" co-written with Dr. Jeremy Michael Budd. Yes the name is a pun on the famous book by @kenneth0stanley! The counterintuitive thesis (corollary of Kenneth's research): - Intelligence and agency are orthogonal to creativity - and sometimes actively hostile to it. - Genuine creativity is impossible without deep understanding and creativity without understanding is "slop". The strangest property of LLMs: within a single frame they seem to comprehend so deeply, yet they possess no perspective of their own. Like the blind men and elephant parable, each report is accurate, yet none integrates. We call this "frame-dependent" understanding, and it will change how you think about AI creativity. We started writing this 2 years ago, and this is our distilled understanding of AI creativity in 2026.

MLStreetTalk's tweet photo. New high-effort article "Why Creativity Cannot Be Interpolated" co-written with Dr. Jeremy Michael Budd. Yes the name is a pun on the famous book by @kenneth0stanley!

The counterintuitive thesis (corollary of Kenneth's research):

- Intelligence and agency are orthogonal to creativity - and sometimes actively hostile to it.
- Genuine creativity is impossible without deep understanding and creativity without understanding is "slop".

The strangest property of LLMs: within a single frame they seem to comprehend so deeply, yet they possess no perspective of their own. Like the blind men and elephant parable, each report is accurate, yet none integrates. We call this "frame-dependent" understanding, and it will change how you think about AI creativity.

We started writing this 2 years ago, and this is our distilled understanding of AI creativity in 2026.

100

ecsquendor retweeted

Machine Learning Street Talk

@MLStreetTalk

4 months ago

Excited about @sarahookr new startup @adaptionlabs, they just landed $50M in seed funding today! I've been looking up to Sara for many years now (since her Google Brain days) and she has always been one of the most coherent voices explaining why monolithic approaches to building LLMs marginalise the tails and average out everything else. The world is specialised, we speak different languages, we have different cultures, skills and industries and failing to represent this in our AI systems makes it superficial for everyone -- and counter-intuitively, makes AI less creative and coherent. Intelligence as I see it is adaptation efficiency -- we need to move past these massive frozen models and build AI systems that can adapt and learn continuously meeting folks where they are. From a technical perspective, this means abandoning the much vaunted "scale is all you need" hypothesis and possibly even abandoning gradient optimisation itself! We will be keeping a close eye on this project. Best of luck Sara!

ecsquendor retweeted

Machine Learning Street Talk

@MLStreetTalk

4 months ago

Interesting research from Anthropic: When you have increasingly large models and increasingly complex tasks it's more likely that the models will give you different answers if you run the same query multiple times. On easy tasks, larger models actually become more coherent. Think of a "cone" of possible trajectories and the branching factor gets bigger with more possibilities (due the larger models "knowing more options to explore" and more complex problems having more "possible aspects"). The amount of time reasoning (trajectory length) then makes it multiplicatively more incoherent at the end state. Having a large model with an easy task means the correct answer is definitely "in there" and it's less likely to become distracted. They are arguing this is relevant for AI safety because some might have assumed that larger models would have convergent "instrumental goals" and would give a consistently wrong rather than randomly wrong answer. Apparently the "the hot mess theory of intelligence" (Sohl-Dickstein, 2023) argues that "as entities become more intelligent, their behaviour tends to become more incoherent, and less well described through a single goal."

MLStreetTalk's tweet photo. Interesting research from Anthropic:

When you have increasingly large models and increasingly complex tasks it's more likely that the models will give you different answers if you run the same query multiple times. On easy tasks, larger models actually become more coherent.

Think of a "cone" of possible trajectories and the branching factor gets bigger with more possibilities (due the larger models "knowing more options to explore" and more complex problems having more "possible aspects"). The amount of time reasoning (trajectory length) then makes it multiplicatively more incoherent at the end state. Having a large model with an easy task means the correct answer is definitely "in there" and it's less likely to become distracted.

They are arguing this is relevant for AI safety because some might have assumed that larger models would have convergent "instrumental goals" and would give a consistently wrong rather than randomly wrong answer.

Apparently the "the hot mess theory of intelligence" (Sohl-Dickstein, 2023) argues that "as entities become more intelligent, their behaviour tends to become more incoherent, and less well described through a single goal."

206

171K

ecsquendor retweeted

Tom Froese, Embodied Cognitive Science Unit (ECSU)

@DrTomFroese

12 months ago

New video! 🤗 We discuss how theories of consciousness must meet the "participation criterion": Consciousness must make a measurable difference compared to its absence. But can this be conceived without reduction? How is difference made? https://t.co/wBpOYGSRUF via @YouTube

Tim Scarfe

@ecsquendor

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users