A couple years ago I had a conversation with ๐๐๐ซ๐ข๐จ about the future of AI and specifically Large Language Models (LLMs) and their potential capabilities...
https://t.co/i5s4yLhHhu
๐๐ง๐ญ๐ก๐ซ๐จ๐ฉ๐ข๐ just dropped an important essay on ๐ซ๐๐๐ฎ๐ซ๐ฌ๐ข๐ฏ๐ ๐ฌ๐๐ฅ๐-๐ข๐ฆ๐ฉ๐ซ๐จ๐ฏ๐๐ฆ๐๐ง๐ญ, and I think it is a serious contribution to one of the defining questions in AI. Though there is a critical distinction worth making.
https://t.co/GwjqXuHQWH
Open-weight models have overtaken closed models on OpenRouter.
69.1% of token volume now goes to open-weight models. 30.9% to closed.
Competition is a discovery procedure โ and developers are discovering the value of open models.
๐งต
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors.
Available today at the same price.
State of Enterprise AI 2026: @levie on Tokenmaxxing, The Rise of Headless, and AI-Proofing Your Job
00:00 Intro
01:18 Silicon Valley engineering vs. everyone else
05:35 Are enterprise CIOs actually bullish on AI?
08:51 Tokenmaxxing & why your AI bill is about to explode
11:34 The myth of falling token costs and AI spend escaping IT budgets
17:37 The $5B startup hiding in AI compute
18:14 The mosaic of models inside every enterprise
21:28 Why coding works and the rest of knowledge work doesn't
25:53 The Bob and Sally problem: access control breaks agents
30:31 Will enterprise AI really take 10 years to roll out
32:24 The capability overhang: why faster models slow diffusion
34:23 Data is the bottleneck (it always was)
39:02 The rise of internal forward-deployed engineers
41:23 Why the AI doomers are wrong about jobs
43:43 Headless software is inevitable
46:14 What replaces per-seat pricing
47:37 How Box itself is going headless
49:42 How the org chart actually evolves
1:00:33 Future-proofing yourself as an enterprise employee
1:06:40 Are we all just going to work for OpenAI and Anthropic?
1:07:11 Where startups can still win as the labs move up
ALL-IN POD IS LIVE! ๐จ
Massive show
Gavin Baker (@GavinSBaker) subs in for Sacks to talk:
-- Andrej Karpathy Joining Anthropic: Impact on the AI Race?
-- SpaceX S-1 Breakdown: The $2T Case, Elon Web Services, Datacenters in Space
-- Nvidiaโs Big Beat and Shock Selloff
-- Why America Has Turned on AI
-- Trump Pulls AI Order
-- Market Update: Inflation, Bond Crisis?
-- Did the US-China Summit Flop?
(0:00) Gavin Baker joins the show!
(0:30) Andrej Karpathy joins Anthropic; hypergrowth and profitability
(12:42) Why Americans have turned on AI, anti-human perception
(27:22) Trump pulls AI EO, US-China AI relationship, dystopian AI layoffs
(45:19) SpaceX S-1 tear down! Three major businesses and the case for $2T
(1:11:22) Nvidia smashes earnings but stock falls, why people are shorting chips
(1:22:25) Market update: Flashing red signals, oil, inflation, yields up
(1:32:45) China trip flops, or was progress made behind the scenes?
Today I probably tried one of the coolest tools since @OpenAI's chatgpt app. If you have not heard of @suno, you definitely need to check them out. Made some computer science nerds sound pretty cool - https://t.co/LmBFLZ7Vpt
New blog: The AI industry is debating "model vs. harness"... but that framing is already becoming outdated.
We mapped out the 4 generations of AI agent architecture:
Gen 1 >> Copilots (reactive, session-based)
Gen 2 >> Agent Runtimes (autonomous, RAG, tool aware)
Gen 3 >> Agent Operating Systems (persistent, context-rich)
Gen4 >> Cognitive OS (the harness teaches the model)
The real shift isn't smarter models or smarter wrappers. It's when execution becomes the source of future intelligence.
Read the full breakdown ๐
https://t.co/XhgGHSl3MA
#AI #AIAgents #LLM #GenerativeAI #MachineLearning #ArtificialIntelligence #AgenticAI
๐๐ฎ๐๐ฅ๐ข๐ ๐ฌ๐๐ซ๐ฏ๐ข๐๐ ๐๐ง๐ง๐จ๐ฎ๐ง๐๐๐ฆ๐๐ง๐ญ... Itโs 10pm. Do you know where your agent is?
Good parenting starts with Trust, Responsibility and Maturity. With that said, @LatentSpin ๐ข๐ฌ ๐ง๐จ๐ฐ ๐๐๐ 2 ๐๐๐ซ๐ญ๐ข๐๐ข๐๐... Weโre building the future of continuously learning AI agents, securely, responsibly, and at enterprise scale.
๐๐๐ญ๐๐ง๐ญ๐๐ฉ๐ข๐ง enables knowledge workers to teach agents in natural language, helping them move from collaboration to autonomy as they learn, adapt, and improve over time.
๐๐ง๐ญ๐๐ซ๐ฉ๐ซ๐ข๐ฌ๐ ๐๐ย will not just be about smarter models. It will be about trusted systems that can learn safely inside real workflows.
๐๐ซ๐จ๐ฎ๐ ๐จ๐ ๐ญ๐ก๐ ๐ญ๐๐๐ฆ ๐๐จ๐ซ ๐ซ๐๐๐๐ก๐ข๐ง๐ ๐ญ๐ก๐ข๐ฌ ๐ฆ๐ข๐ฅ๐๐ฌ๐ญ๐จ๐ง๐!
#KnowledgeWorkers #AgenticAI #ContinuousLearning #LatentSpin
The jump from AI tools to AI "coworkers" isn't just about better prompts. Itโs about how agents learn.
In our latest blog, @ThomasHazel explores how agents are moving beyond execution to actually choosing, using, and teaching models.
The era of the active-learning agent is here.
Read more: https://t.co/zGR77DFMFN
#AgenticAI #LLMs #LatentSpin #MachineLearning
๐๐ก๐ ๐ญ๐ก๐ซ๐๐ ๐๐ข๐ ๐ฉ๐ฅ๐๐ฒ๐๐ซ๐ฌ right now are ๐๐จ๐๐๐ฑ (OpenAI), ๐๐ฅ๐๐ฎ๐๐ (Anthropic), and ๐๐ฎ๐ซ๐ฌ๐จ๐ซ (SpaceX/xAI). Each has its own style and benefits. Codex focuses on execution, Claude focuses on reasoning, and Cursor focuses on keeping you in the flow - https://t.co/d6XF3ERiIS
Just read this.. Crazy numbers.
Anthropic added $35B of ARR in twelve months.
From $9B at the end of 2025 to $44B in May 2026. The curve is so steep investors are submitting allocations within 48 hours for a $50B round at a $900B+ valuation. That would top OpenAI.
$35B รท 365 = $96M of new ARR added every single day.
All about enterprise replacement. Procurement teams are swapping SaaS renewals for Anthropic API spend.
Introducing Claude Opus 4.7, our most capable Opus model yet.
It handles long-running tasks with more rigor, follows instructions more precisely, and verifies its own outputs before reporting back.
You can hand off your hardest work with less supervision.
Another week on the road meeting with a couple dozen IT and AI leaders from large enterprises across banking, media, retail, healthcare, consulting, tech, and sports, to discuss agents in the enterprise.
Some quick takeaways:
* Clear that weโre moving from chat era of AI to agents that use tools, process data, and start to execute real work in the enterprise. Complementing this, enterprises are often evolving from โlet a thousand flowers bloomโ approach to adoption to targeted automation efforts applied to specific areas of work and workflow.
* Change management still will remain one of the biggest topics for enterprises. Most workflows arenโt setup to just drop agents directly in, and enterprises will need a ton of help to drive these efforts (both internally and from partners). One company has a head of AI in every business unit that roles up to a central team, just to keep all the functions coordinated.
* Tokenmaxxing! Most companies operate with very strict OpEx budgets get locked in for the year ahead, so theyโre going through very real trade-off discussions right now on how to budget for tokens. One company recently had an idea for a โshark tankโ style way of pitching for compute budget. Others are trying to figure out how to ration compute to the best use-cases internally through some hierarchy of needs (my words not theirs).
* Fixing fragmented and legacy systems remain a huge priority right now. Most enterprises are dealing with decades of either on-prem systems or systems they moved to the cloud but that still havenโt been modernized in any meaningful way. This means agents canโt easily tap into these data sources in a unified way yet, so companies are focused on how they modernize these.
* Most companies are *not* talking about replacing jobs due to agents. The major use-cases for agents are things that the company wasnโt able to do before or couldnโt prioritize. Software upgrades, automating back office processes that were constraining other workflows, processing large amounts of documents to get new business or client insights, and so on. More emphasis on ways to make money vs. cut costs.
* Headless software dominated my conversations. Enterprises need to be able to ensure all of their software works across any set of agents they choose. They will kick out vendors that donโt make this technically or economically easy.
* Clear sense that it can be hard to standardize on anything right now given how fast things are moving. Blessing and a curse of the innovation curve right now - no one wants to get stuck in a paradigm that locks them into the wrong architecture. One other result of this is that companies realize theyโre in a multi-agent world, which means that interoperability becomes paramount across systems.
* Unanimous sense that everyone is working more than ever before. AI is not causing anyone to do less work right now, and similar to Silicon Valley people feel their teams are the busiest theyโve ever been.
One final meta observation not called out explicitly. It seems that despite Silicon Valleyโs sense that AI has made hard things easy, the most powerful ways to use agents is more โtechnicalโ than prior eras of software. Skills, MCP, CLIs, etc. may be simple concepts for tech, but in the real world these are all esoteric concepts that will require technical people to help bring to life in the enterprise.
This both means diffusion will take real work and time, but also everyoneโs estimation of engineering jobs is totally off. Engineers may not be โwritingโ software, but they will certainly be the ones to setup and operate the systems that actually automate most work in the enterprise.