Microsoft AI CEO and DeepMind co-founder Mustafa Suleyman says AI training compute will grow 1000x over the next 3 years
When you consider how powerful the top models already feel today, imagining what AI looks like with another 1000x advancement in compute just three years from now is mind-boggling
Introducing https://t.co/ycxJEf1z7w!
A new space where I explore how the best apps in the world are built.
First piece:
How's Linear is so fast? a technical breakdown.
https://t.co/9Vu1syrn1i
I've been hacking on @podpowerxyz for over a year with one goal: keep up with podcasts without listening to all of them
Podcasts are high-signal but insanely time-consuming. PodPower should let you understand any episode in 60 seconds or less.
After a lot of experimentation, I've found a format that I think does the job well.
Episode Atlas is a per-episode artifact that lays out everything an episode covers in a way you can quickly and comprehensively scan.
→ Overview
→ Core themes
→ Key concepts
→ Notable quotes
→ Chapters with timestamps
→ Takeaways
→ Questions explored
→ Glossary
→ People mentioned
→ Threads to pull on
Generate one for any episode in the feed. It's 100% free.
And a bonus feature: one-click copy the whole Episode Atlas into ChatGPT, Claude, or whatever agent you use and keep the conversation going there
Give it a try and let me know what you think
Introducing PodPower's Episode Atlas
A complete map of any podcast episode
Themes, key quotes, chapters, takeaways, people mentioned. Generated for every episode.
Decide in 60 seconds whether a 3-hour episode is worth your time 👇
Interaction models as introduced by @thinkymachines are heading exactly towards the shift I describe in the quoted post below.
The interaction model is in constant exchange with the users via audio and video.
When a task requires deeper reasoning than can be produced instantaneously, the interaction model delegates to a background model that runs asynchronously.
The interaction model remains present throughout, answering follow ups, taking new input, holding the thread, and it integrates background results into the conversation as they arrive.
IMO the demos are still a little bit awkward, but you can see the direction and just imagine how quickly they will improve.
It won't be very long until we're sitting around talking and gesturing to our computer as if it's another coworker.
"You spend so much of your effort right now just explaining to your computer what's going on. Why are you explaining to your computer what's going on? That makes no sense."
I think this is the beginning of a much bigger shift in how we use AI.
We're probably not too far away from computers becoming ambient and multimodal:
• always listening
• always watching your screen
• forming memories
• understanding context
• orchestrating agents and tools in the background
You'll stop "prompting" your computer and start working alongside it.
Then this expands beyond individual computers into AI native workspaces, meeting rooms, and environments where humans and AI collaborate continuously in real time.
Those futuristic sci-fi command center scenes slowly start becoming reality.
At that point, token usage won't just come from chat prompts. It'll come from the total surface area of human work itself.
Trillions upon trillions of tokens constantly flowing through memory, reasoning, vision, audio, simulations, and agents.
I guess we're going to need more compute.
There are more than four million books for sale in the English language.
I've always felt a little guilty about starting books and not finishing them, like it's a bad habit, and that I should finish what I started.
But this passage from @morganhousel 's Art of Spending book has flipped that idea on its head for me and I really like it.
Housel says "I want to start reading as many books as possible, but finish few of them"
Why?
Because there are so many good books out there. You should be willing to start any book that looks mildly interesting. If it doesn't work for you within 10 minutes, move on! No guilt, no shame, no obligations.
And it connects to larger idea: The only way to find what you're going to like in life is by trying new things.
So try as many as possible and move on quickly if it's not for you
This works really well btw, at the end of your query ask your LLM to "structure your response as HTML", then view the generated file in your browser. I've also had some success asking the LLM to present its output as slideshows, etc.
More generally, imo audio is the human-preferred input to AIs but vision (images/animations/video) is the preferred output from them. Around a ~third of our brains are a massively parallel processor dedicated to vision, it is the 10-lane superhighway of information into brain. As AI improves, I think we'll see a progression that takes advantage:
1) raw text (hard/effortful to read)
2) markdown (bold, italic, headings, tables, a bit easier on the eyes) <-- current default
3) HTML (still procedural with underlying code, but a lot more flexibility on the graphics, layout, even interactivity) <-- early but forming new good default
...4,5,6,...
n) interactive neural videos/simulations
Imo the extrapolation (though the technology doesn't exist just yet) ends in some kind of interactive videos generated directly by a diffusion neural net. Many open questions as to how exact/procedural "Software 1.0" artifacts (e.g. interactive simulations) may be woven together with neural artifacts (diffusion grids), but generally something in the direction of the recently viral https://t.co/z21CP5iQfu
There are also improvements necessary and pending at the input. Audio nor text nor video alone are not enough, e.g. I feel a need to point/gesture to things on the screen, similar to all the things you would do with a person physically next to you and your computer screen.
TLDR The input/output mind meld between humans and AIs is ongoing and there is a lot of work to do and significant progress to be made, way before jumping all the way into neuralink-esque BCIs and all that. For what's worth exploring at the current stage, hot tip try ask for HTML.
A couple months ago, I had an idea for an app that I wanted to show a few friends. It was pretty data intensive, but instead of trying to explain it, I wanted to show them what it could look and feel like.
I ended up asking Claude to mock up a non-functional version of it, and it went with a plain single HTML file. It worked exceptionally well.
Then I asked it to give a brief overview of the idea, leading into the demo app.
What I ended up with was, to me, a surprisingly super effective way to share an app idea, without any of the standard rigamarole of scaffolding a modern web app.
You can view it here: https://t.co/bCwtKzvcDY
"You spend so much of your effort right now just explaining to your computer what's going on. Why are you explaining to your computer what's going on? That makes no sense."
I think this is the beginning of a much bigger shift in how we use AI.
We're probably not too far away from computers becoming ambient and multimodal:
• always listening
• always watching your screen
• forming memories
• understanding context
• orchestrating agents and tools in the background
You'll stop "prompting" your computer and start working alongside it.
Then this expands beyond individual computers into AI native workspaces, meeting rooms, and environments where humans and AI collaborate continuously in real time.
Those futuristic sci-fi command center scenes slowly start becoming reality.
At that point, token usage won't just come from chat prompts. It'll come from the total surface area of human work itself.
Trillions upon trillions of tokens constantly flowing through memory, reasoning, vision, audio, simulations, and agents.
I guess we're going to need more compute.
The weirdest part of using AI today is that you have to narrate reality to it
Paste this file
Explain this meeting
Describe this bug
Remind it what you were doing
@gdb says Chronicle flips that: the AI sees your workflow, remembers the context, and already knows what you mean
The fully sovereign AI agent stack is emerging:
Open harnesses (Hermes, Pi, OpenClaw, Deep Agents)
You own the orchestration and can swap anything underneath. No random updates that silently degrade your workflows. Watch @badlogicgames' talk "I Hated Every Coding Agent, So I Built My Own" to dive deeper
Open weights models (Qwen, Gemma, GLM)
Portable across providers and runnable locally. Nothing opaque. No one can lock you out or secretly degrade performance. GLM-5.1 is the first open model to lead SWE-Bench Pro, narrowly beating Opus 4.6 and GPT-5.4 at ~1/4 the price
Local memory (Postgres, SQLite, filesystem)
Your context, preferences, and history stay on your machine. Interesting portable approaches for skills, memories, and workflows are emerging. See this article from @Av1dlive: https://t.co/SPNY8kMttt
Local inference (GPUs, Apple Silicon, any device running Ollama)
No worrying about API costs. No metering of tokens. Your data can stay on your network
This stack should be the goal for builders who don't want to rent tools and intelligence
For me, local inference is still the main constraint, but looking forward to increasingly accessible local compute
Some of the coolest open source repos I've been coming across lately don't contain any code.
Skills, workflows, prompts, ideas, evals.
No code, just plain english.
People are starring the repos, forking them, and building on top of them the same way they used to with libraries and frameworks.
This trends in line with the new-ish ideas of ghost libraries and prompt requests.
For a long time, the scarce thing was implementation. So the artifact of value was the written code. But implementation is getting cheaper and faster every week. The scarce thing is increasingly becoming taste, judgment, and the ability to specify what "good" actually means.
This is why specs, prompts, workflows, and evals suddenly feel so powerful.
A well-written AGENTS.md file might be worth more than the code it produces.
Another interesting trend from the AI engineering wiki to keep an eye on:
Ghost Libraries
Imagine popular libraries in the future existing only as specs.
Rather than npm installing maintained code, you only pull a spec.
So where's the code?
Your local agent creates the implementation in seconds, tailoring it to your specific codebase.
The benefits?
No more abandoned, incompatible, or outdated packages.
No more waiting on maintainers to fix bugs.
No more forking a library to change just one thing.
No more inheriting a never-ending tree of dependencies you never asked for.
The agent generates exactly what you need, in your language, following your project's conventions and guidelines.
This might sound hypothetical but it's actually already happening. The team at OpenAI built and shared Symphony this way.
This obviously won't work for everything. Some libraries are large and complex, and have years of accumulated bug fixes can't be rediscovered from a spec. But for many dependencies, adapters, SDKs, etc the economics might soon make sense
Using @karpathy's LLM wiki idea, I started putting together a wiki for AI engineering, covering topics like models, harnesses, context, memory, embeddings, and more.
It's truly an excellent way to learn. Every new piece of content ingested opens up multiple paths for further exploration, which quickly leads you down all kinds of rabbit holes.
One interesting thing I came across today:
The crowdsourcing of open agent trace data
In a nutshell:
When you use an AI coding agent, every session produces a trace. This is the full record of prompts, tool calls, reasoning, errors, fixes, and everything else along the way. It is essentially the agent's inner dialogue and process as it does the work.
Leading closed-source model providers like OpenAI and Anthropic collect millions of these traces from users of their products and use them to train better models.
Each improvement drives more usage and generates even more training data. It is a flywheel that creates a massive competitive moat for them.
But now the open source community is starting to build the plumbing to do the same thing: capture agent sessions, scrub PII, publish them as open datasets, and use sampling to identify the traces that are actually high signal.
If it works as intended, it could end up being a key factor in improving open models and helping close the gap with proprietary ones
If you want to dive deeper, check out the post below from @badlogicgames announcing pi-share-hf
Putting my tokens where my mouth is. I built pi-share-hf. Share your pi coding agent sessions as @huggingface datasets.
https://t.co/OJsAGevg1L
It tries to prevent you from uploading sessions containing PII/sensitive data with 3 tiers of defenses. Best used on OSS coding sessions, as those are less likely to contain sensitive info.
Uses pi agents for PII detection, which can cost you a lot of tokens. Read the README with your human eyes so you don't accidentally pwn yourself or get a huge bill.
Haven't figured out how to filter for such datasets on HF yet. @ClementDelangue, any pointers on how to best label them so people can find them?
Crypto already has enough detractors from outside the industry trying to bring it down
Sad to see builders tearing down other builders within the space just for clout
Just tested this out and wow
A project I'm working on at @SkyEcosystem has been defining metrics we'd like to track onchain. We've been debating if we need to find a data analyst to help capture all of these metrics
I see @Dune announces MCP
I connect with Claude and put one metric in and ask for a chart
Two minutes later it spits out exactly what I need
Dune MCP is live 🔌
Plug Dune directly into @claudeai, @ChatGPTapp, @cursor_ai, and more.
Search tables. Write queries. Build charts. Check Usage.
All from a single prompt. 💻
Your AI just became a Dune power user.
New Vaults page is live on the Sky Money App, plus two new @Morpho Vaults:
• USDS Flagship: 80% Sky Savings Rate + 20% stUSDS, cbBTC, wstETH & ETH markets
• USDT Savings: Sky Savings Rate exposure via sUSDS/USDT market
Plus incentives.
It feels weird to say as a heavy Anthropic user but I still struggle to see a world where Google doesn't win in the long-term
Their models are arguably just as strong, they're shipping non-stop, and the switching cost is very low. These tools are not cloud-level lock in. Moving from Claude Code to another CLI tool can be done in a day. In the time between Opus 4.1 and Opus 4.5, Codex CLI came out with gpt5 and, to me, felt like the stronger option. I was still paying for Claude Code but found myself reaching for Codex more often.
And this doesn't even consider distribution, training data, or verticial integration moats. Anthropic rightly probably feels on top of the world at the moment, but I'd still be terrified of Google in the rear view mirror
Took a look, and also appreciated your thread on the topic. Impressive engineering work. Curious which add'l connectors you plan to add in the short term. Might try building one myself if I find some time. X feels like an obvious one but I imagine it's also a challenging and brittle to build for as well
My notes from Poor Charlie's Almanack: The Wit and Wisdom of Charlie Munger turned into maxims:
1. Find a simple idea and take it seriously.
2. Good ideas are rare. When you find one bet heavily.
3. Humans have been writing down their best ideas for 5,000 years. Read them.
4. Avoiding stupid mistakes is more important than being smart.
5. Don’t work with anyone you don’t admire.
6. Don’t sell anything you wouldn’t buy.
7. Avoiding a bad habit is easier than breaking a bad habit.
8. Work on your best idea. Don't diversify
9. Incentives rule everything around you. Look for them.
10. Great businesses are built by going ridiculously far in maximizing or minimizing one or a few things. Think Costco.
11. Learning is changing behavior.
12. Do the unpleasant tasks first.
13. Charlie has read hundreds of biographies. Do the same.
14. Stop multitasking. Concentrate.
15. Many hard problems are solved best when approached backwards.
16. Think of ideas as tools. When a better tool comes along use it.
17. Clip your business and personal expenses. Small leaks sink big ships.
18. Make friends with smart dead people. Adam Smith, Darwin, Cicero, Ben Franklin —whoever interests you. Read their writing. Steal their ideas. They don’t need them anymore.
19. Don't confuse intelligence with invincibility.
20. Bad things will happen to you. It’s inevitable. When they do get up and keep going and remember the next maxim.
21. Self pity has no utility.
22. Find out what you are best at. Then pound away at it. Forever.
23. Only plays games where you have an edge.
24. Avoid mob rule. Avoid demagogues. Avoid dogma. Avoid bureaucracy.
25. Optimize for independence.
25. Use money to buy freedom.
26. Develop durability.
27. What do you have an *intense* interest in? Do that for money.
28. Self improvement has no end.