The accountant in the corner will outship your best engineers.
Because they'll automate what engineers never thought to touch.
Breakthroughs come from everywhere, if you give people tokens to experiment and safety to fail.
— @bcherny
Law professors wrote questions they were asked during office hours. Gemini 2.5 & humans answered them then other law professors blindly judged the results:
-Gemini had a 75% win rate vs. professors
-Gemini's answers were rated LESS harmful than humans
-Newer models do even better
I find debates over whether companies find AI useful to be odd at this point
I talk to leadership teams at lots of big firms, and it is pretty universal that they are getting obvious and real value. The challenges now are going from individual uses to firm-level & how to scale.
Imagine replacing 90% of your employees with a team of geniuses who have no idea how your company operates.
Total chaos. Nothing works.
That’s what AI feels like today.
The missing piece is extracting all the domain knowledge from people’s heads and providing that as structured context to the models.
Anthropic engineer:
"You can build 5 assistants in one afternoon. Each one handles a task you've been doing manually every single day."
In 45 minutes he builds 5 focused agents from scratch on camera.
Most people are still doing code review, testing, and documentation by hand every single day
Watch the session, then save all templates below 👇
The 90s PC paradox is repeating. Zero productivity gains, because companies bolted the technology on instead of restructuring around it.
AI is no different. The winners won't experiment most. They'll rebuild from the center out.
Ft. @bcherny@Kantrowitz@AnthropicAI
This is effectively the #1 problem for AI agents in the enterprise.
As we go from agentic coding (where a large amount of context is in the code base, and users are technical enough to get the rest to the agent easily) to a world of knowledge work agents, the context problem becomes much more acute.
We see this every day with customers at Box. For existing digital knowledge, it’s often fragmented across legacy systems or environments that don’t play nice with agents, and have access controls that don’t map to the real work that needs to be done, which become a huge hurdle for getting agents the context they need. This has to all get moved to modern, secure cloud environments.
But also, companies often haven’t captured and digitized some of the critical context that agents need to work with. Decisions, processes, and workflows often live in people’s heads and tribal knowledge that need to get turned into unstructured data for agents.
This is actually one of the biggest points of leverage for applied AI companies, because they can work to specialize in getting agents exactly the information and domain expertise they need. But it’s also one of the reasons why FDEs and new system integrator plays will also work so well right now.
The companies that figure this out will be able to get the most out of AI going forward.
The CMO's real AI question isn't "what can it do?"
It's "what should we stop doing manually so we can focus on what matters?"
Subtraction before addition.
Hot take: Most AI "transformations" are just automation projects with better PR.
That's fine! Automation is valuable.
Just don't mistake it for competitive advantage.
It does seem like meaningfully better AI releases are accelerating, especially from OpenAI & Anthropic.
To illustrate, I caused this timeline to be created. It only lists new models that scored 3 points or higher over previous models in the Artificial Analysis index.
@MaMoMVPY There is a dramatically widening gap between those who use this “prediction machine” and those who don’t.
I don’t care if you call it intelligence or ignorance. The amount of valuable output it produces is simply unprecedented
It has gone largely unnoticed that the chief enforcement official, for the SEC, Margaret Ryan, left suddenly last week.
She had been restricted from investigating sketchy trades and investments involving the Trump family and close associates.
The corruption in the Trump administration is insane. If Congress were doing its job, they’d be all over this. But Mike Johnson has turned Congress into a rubber stamp for Trump.
FT Exclusive: Traders made bets worth half a billion dollars in the oil market about 15 minutes before Donald Trump’s post touting 'productive' talks with Iran sent the price of crude tumbling https://t.co/KVa0cZRLn0