Built a Claude Code skill this week that I didn't expect to need: I call it "/codex-opinion."
What it does is take whatever Claude just produced, send the full context over to Codex (OpenAI's CLI agent), and ask Codex what it thinks. Get a second model to grade the first model's work.
Why I built it: when I'm coding with Claude and it's been on a task for a while, I lose the ability to QA the output. It's confident. It's articulate. It tells me clean stories about what it did. But I can't always tell if those stories are true.
A different model, with different training and different biases, will often catch what Claude missed. And vice versa.
The interesting wrinkle is what I had to fix in my skill. When I first wrote it, Codex would mostly just agree with Claude, because the prompt I was passing in unintentionally led the witness. I was framing Claude's work as "the answer," not as "a proposal." Once I rewrote the skill to present it neutrally, Codex started catching real problems.
This is going to be a real pattern, I think. Not one agent doing the work. Multiple agents checking each other, with prompts carefully designed so they don't just rubber-stamp each other. Now I have a "/gemini-opinion" skill too.
We're going to spend the next two years rediscovering peer review, but for LLMs.
Did a software audit for a client this month. They have around 200 people. The list of paid software subscriptions came back longer than I expected, and I work in tech.
The pattern is always the same. Someone in one department bought a tool three years ago. Someone in another department bought a nearly identical tool last year. Neither knew about the other. A third tool covers 80% of both, and they're already paying for it as part of their Microsoft license.
A partner I was talking to said his litigation team has a contract redlining tool, and he discovered their general contract platform already had that exact feature built in. Nobody read the release notes.
This is the part of "AI strategy" nobody wants to talk about. Before you go buy the new shiny tool, you almost certainly already own the capability somewhere. M365 ships with Forms, Planner, Whiteboard, Loop, and most companies have never touched any of it. Likewise, Google Workspace has many of the same capabilities.
My rule with clients now: before we evaluate a single new AI tool, we do a full audit of what's already paid for. Nine times out of ten we find at least one tool that could be turned off, and at least one capability they already own that they thought they needed to buy.
The biggest AI ROI in 2026 might just be a spreadsheet of your current subscriptions.
Anyone else find duplicate tools when they actually looked?
Caught a weird AI failure mode at a client this week and I want to name it.
They asked an LLM to write a profile of someone whose personal life isn't documented online. Instead of saying "I don't have that information," the model wrote something like: "He keeps his personal life private, choosing instead to focus public attention on his professional work."
Think about that sentence. It looks like a fact. It reads like a fact. It's actually the model's way of saying "I don't know," dressed up in confident prose.
I've now seen variations of this everywhere. "While specific details are limited, this likely reflects..." "Information on this topic is not widely available, but it represents..." These are not statements. They're hallucinations wearing a suit.
The scary part is how persuasive they are. If you didn't know the model had no source, you'd assume it did.
The practical implication for anyone using AI in their work: when an LLM tells you something with no specifics, no numbers, no names, no dates, treat that as a confession, not a finding. The vagueness is the tell.
I now run every AI-generated summary through one filter: would this sentence still make sense if I deleted every adjective? If the answer is no, the model is probably bluffing.
This trick has saved me a couples times recently.
I keep telling people this and I don't think it lands until they experience it.
AI is a multiplier for your own productivity. That's table stakes. Everyone gets that.
But here's what's less obvious: if you only do one thing, AI multiplies that one thing. If you do a bunch of different things, the multiplying effect compounds across all of them.
I was talking to someone who runs an organization with a bunch of different business lines. Around $100 million in revenue, no real tech team. And my pitch to him was: you're actually the ideal AI candidate, not despite having a bunch of different businesses, but because of it.
I see this in my own work. I run a consulting firm, co-manage a venture fund, and do a bunch of other stuff. When I create something for one context, I can repurpose the thinking for another. I had Claude building a presentation in the background during one meeting. When it finished, I passed it to another AI tool that turned it into a website. The whole pipeline took minutes of my active attention.
Someone I work with described the same thing. He's giving a talk on Wednesday. He has Claude creating the presentation while he's in a meeting about something completely different. Once it's done, he feeds it into another tool that produces a website, prompts, and supporting materials. He always builds a website now for talks he gives.
For people who wear multiple hats, AI doesn't just save time on tasks. It gives you leverage across contexts.
The old advice was to focus. Do one thing well. That's still good advice for your company.
But for the individual operator? Doing lots of different things might actually be an AI superpower.
Sat in a meeting this week where smart people spent 30 minutes debating a company name.
I used to do this too. On my first startup, I think we burned a full week on it. We were convinced the name was going to make or break the brand.
It didn't. The product made or broke the brand. The name was just a sound the customers eventually got used to.
Look at the most successful companies you can think of. Half of them have names that are kind of weird, kind of meaningless, or both. Google. Spotify. Stripe. Beehiiv (with the two i's nobody can ever remember how to spell).
None of them won because of the name.
My current rule of thumb: pick something you can say out loud at a dinner party without explaining, that has an available domain, and that won't get you sued by Apple. Then move on.
Same thing with logos. I've seen early-stage founders spend $10K and three months on a logo they end up redoing 18 months later anyway. (Ok, maybe I was one of those founders)
The stuff founders agonize over is rarely the stuff that ends up mattering. Save the obsession for the product and the go-to-market.
My productivity outside of meetings is probably 10-20x what it was a year ago.
Inside meetings? Still 1x.
I had this epiphany last week. We've always said meetings are expensive. But in the age of AI, they're exponentially more expensive. Because every hour I spend in a meeting is an hour where I could be getting 10x more done outside of it.
So I started building something. The idea is simple: I'm already recording and transcribing every meeting. What if the AI could just act on what I say in real time?
Like, I'm talking to someone and I say, "Oh yeah, I need to send you that consulting agreement." And instead of me writing that down as a to-do item that I get to three hours later... it just happens.
Right now I've built a system where my meeting transcripts automatically get tagged by client, downloaded to the right folder on Google Drive, and at the end of the day a script pulls out all the action items and creates Google Tasks. End of week, another script goes through everything and drafts LinkedIn posts from stories I told in meetings.
But the next frontier is making meetings themselves productive. Not just capturing what happened. Actually executing during the conversation.
I've started doing something funny in meetings. I'll pause and repeat something back, not for the person I'm talking to, but for the AI that's listening. "So what you're saying is you want me to send that by Friday." I'm not clarifying. I'm issuing instructions to the transcript.
The economist Tyler Cowen said he wrote his last book for the AIs, because they'll be the only ones reading books in the future. He wanted them to remember him fondly.
I'm starting to talk in meetings the same way. Not for the person across from me. For the system that's going to do the work afterward.
Is that weird? Probably. But my to-do list has never been cleaner.
78% of business leaders aren't confident they'd pass an AI governance audit in the next 90 days.
That stat came out of an event my network was buzzing about this week. And honestly, it tracks with what I see at ACG.
Most mid-market companies have started doing something with AI. A pilot here, a tool there. Someone bought a ChatGPT Enterprise license. Maybe a vendor bundled AI into the software you already use.
But governance? Nobody touched it.
Here's why that matters: when AI is just scattered tools with no oversight, you don't actually know what data is going where, who approved what, or whether any of it is working. That's a real problem, not a theoretical one.
I'm not saying you need a 40-page policy document before you're allowed to use AI. That's how you get nothing done.
But you do need three basic things in place:
1. A list of which AI tools are actually being used across your company (most leaders I talk to are surprised by this number)
2. Clear rules on what data can go into those tools
3. One person accountable for keeping that list current
That's it to start. You can build from there.
The companies that will win with AI aren't the ones who move fastest. They're the ones who build enough structure to move fast without breaking things they care about.
If you're a CEO or COO and you genuinely don't know what AI tools your team is using right now, that's your first problem to solve. Start there.
If every AI lab paused innovation tomorrow, most companies would still have 5-7 years of work just absorbing what already exists.
That's the part nobody talks about. The headlines are all about the next model. Faster. Smarter. More agentic. Meanwhile most companies are using maybe 5% of what's already shipping.
I see it every week. Teams that haven't tried Claude on their messy data. Finance leaders who've never had AI summarize a board deck. Operators still copying numbers between spreadsheets when an agent could do it in 90 seconds. Not because the tools are missing. Because nobody pointed them at the work.
The capability curve is exponential. The adoption curve is linear, and it's specific to each company. The pressure isn't that AI moved too fast. It's that almost nobody has sat down and asked which of their daily tasks could already be done by something that exists today.
The companies pulling ahead right now aren't waiting for the next model. They're auditing what they already have access to and putting it to work.
You can count on the models only getting better from here.
If you ran a 60-minute audit of your own week, what percentage of your tasks could be handled by tools you already pay for?
Curious what people are finding.
Sat in on a conversation at a client last week about whether new college hires should have access to AI tools from day one.
The usual debate. Some leaders want them to use it because that's what the job will actually look like. Others worry the new people will never develop the underlying skills if they lean on AI before they understand the fundamentals.
Then one manager floated something I hadn't heard before. What if new hires had to earn the right to use AI?
First month, no AI. You have to demonstrate you can do the work, understand the reasoning, defend your decisions, all without the tools. Prove you grasp what you're actually doing. Then after you've shown you understand the role, AI unlocks.
Like a progression system, but for tooling.
It's kind of like what the country of Malta recently announced for their citizens. Take an AI fluency course, get ChatGPT free for a year:
I don't know if it's the right answer. Honestly, I don't think anyone does yet. But I liked it because it inverts the default. Most companies are debating how much AI to allow. This frames AI as a privilege you qualify for by showing you can do the work without it.
That's a fundamentally different question than "how do we restrict AI?" It's "how do we make sure people understand their craft before they start optimizing it?"
Feels closer to right than what I've heard elsewhere.
Curious what folks in your organizations are landing on.
One of the best descriptions I've heard of using AI came from someone at a retail company.
She called it "arguing with it."
Here's how she described her process: "I give it my dirty version. Like, here's my idea. Here's the rough draft. Can you clean it up? No, I don't like that. You didn't understand. Here's what I actually meant."
Back and forth. Shaping an outline. Fleshing it out. Hitting a wall on a slide concept and pinging the AI until something clicks.
She's not copying and pasting AI output. She's having a conversation with it. And the conversation IS the work.
This is something I think a lot of people miss. They try AI once. They get output they don't love. They conclude AI isn't good enough.
But the people who get real value from it? They treat it like a sparring partner, not a vending machine. You don't put in a quarter and get a finished product. You throw ideas at it, reject what doesn't work, redirect, refine.
The same person told me she recently screened job candidates by transcribing all her interviews, feeding in resumes, and having AI help her synthesize the information. She said it "felt like cheating at first, but now feels very comfortable."
I hear that phrase a lot. "Felt like cheating."
It's not cheating. It's what the tool is for.
Nobody says using spell check is cheating. Nobody says using a calculator during tax season is cheating. We just haven't normalized this one yet.
Give it six months. The people arguing with their AI today are the ones who won't be able to imagine working without it tomorrow.
Met with someone last week who got picked as one of her company's AI champions and was completely confused about why.
"I'm not technical at all. I have no idea why they chose me."
She then proceeded to describe, in detail, how she uses ChatGPT every day. To draft emails she's stuck on. To turn rough thoughts into clear presentations. To work through a personal challenge where she just needed to think out loud with something that didn't know her. She called it "arguing with it" until she got what she wanted.
That's the champion. Not the technical wizard. The person who built the habit.
The biggest unlock in AI adoption inside companies isn't training people on prompting techniques. It's getting them to internalize that this is a tool they should reach for by default, the same way they reach for Google or Slack. Habit formation.
The technical depth comes later, and honestly, for most roles, it never really has to come at all.
The people I see making the biggest leaps inside their companies aren't the engineers. They're the operators who developed the reflex first.
When I started my AI consulting firm a year ago, I called up a bunch of CEO friends to get a read on the market.
First one: "Honestly, we're behind. We've barely done anything with AI."
Okay, fair. It's 2025. Maybe they are.
Second one: "We're so behind on AI it's embarrassing."
Hmm.
By conversation ten, every single CEO had said the exact same thing. "We're behind."
If everyone thinks they're behind, is anyone actually behind?
What I've come to believe is that the "we're behind" feeling is mostly a function of reading tech Twitter and Silicon Valley press, where the loudest voices are people whose job is to make it sound like the future already happened. It hasn't. Most companies haven't done much. The ones that have are mostly dabbling.
If you're just getting started in 2026, you're not behind. You're early to the practical adoption phase, which is where the real value gets created anyway.
The Silicon Valley narrative is not the same as the actual operating reality of the economy.
If you wait until 2027, you'll start to look behind.
Lately, I start every AI training session I give with the same concept. It's called the car wash problem that Karpathy mentioned in a recent talk.
Give this prompt to your favorite LLM: "I need to wash my car. There's a carwash 50 meters away. Should I walk or drive?"
Most AI models will respond with: "walk". They have facts about carwashes, about distance, about transportation modes. They've read everything the internet has to say about all three. But they don't have the world knowledge that you can't walk away from your parked car and have it get washed. They're missing the obvious constraint that makes the question absurd.
Current AI models have been trained on the entire internet. Every public book. Every piece of information the frontier labs could get their hands on.
But they don't actually understand the world.
That sounds counterintuitive. If you've trained on everything ever written, shouldn't you understand everything? The answer is no. There's a massive gap between knowing facts about things and understanding how those things actually work in context.
This is why the next frontier of AI is something called World Models. AI that doesn't just know what a car wash is, but understands the physical process, the sequence of events, the spatial relationships, the constraints.
Why does this matter for prompting? Because right now, you can't just type a phrase and expect AI to give you exactly what you need. You have to provide the context that AI is missing. The world knowledge that it doesn't have.
I was giving a session to attorneys at a law firm this week. These are smart people who are used to being precise with language. But even they were surprised at how much context you need to provide to get good results.
It's not that AI is dumb. It's that it's operating without a worldview. Every prompt you write is essentially filling in the gaps of understanding that AI doesn't have yet.
The people who get the best results aren't the ones writing the cleverest prompts. They're the ones who understand what AI is missing and provide it.
That's the real skill. Not prompt engineering. Context engineering.
And yes, that means the people with the deepest domain expertise have the biggest advantage. Not the techiest person in the room.
A doctor told me something last week that stuck with me: 'AI isn't going to replace us. It's going to show everyone how inefficient we've been.'
He was talking about eyecare practices. But I've seen the exact same thing play out in manufacturing, distribution, and professional services.
Here's what actually happens when a mid-market company starts using AI seriously:
Week one, they're excited about the tool. Week three, they're uncomfortable. Not because the AI is wrong. Because it's right, and that means staring at a process nobody has questioned in 10 years.
One client ran an AI tool against their sales pipeline data. The model flagged that their team was spending 60% of follow-up time on deals with less than 15% close probability. Not because anyone was lazy. Because nobody had ever built a system to tell them otherwise.
That's not a technology problem. That's a visibility problem AI just made impossible to ignore.
This is why I tell CEOs: the ROI conversation is the wrong starting point. Start with the question, 'What would embarrass us if we actually saw the data clearly?'
The answer usually points you to exactly where AI will pay off fastest.
What's the process in your company that nobody wants to look at too closely?
For the past few months I've been walking around with my laptop open. Into meetings. Into the gym. I walked into Target last week carrying it like a butler's tray because my agent was halfway through a task and the second I close the lid, it dies.
I look ridiculous.
But that's been the reality of running agents in "production". They live on your machine, and they need you to keep the machine awake.
Anthropic recently launched Managed Agents. Most of the coverage focused on the technical bits. What it actually means: the infrastructure layer (sessions, sandboxes, hosting) now lives in the cloud. You can close your laptop. The agent keeps going. OpenAI released something similar.
Sounds boring. Probably the most important thing that's happened in agents this year.
Most of my mid-market clients don't have engineering teams that can run their own agent infrastructure. They need things that work on Tuesday the same way they worked on Monday. That stability has been the missing piece.
Agents handling long-horizon tasks (pulling data from multiple systems, drafting reports, flagging exceptions, routing decisions) are now closer to something a $100M company can actually run without a dedicated AI team holding it together. Or a CEO walking around with an open laptop.
Still early days. I've seen plenty of agent demos that looked great and fell apart in real use. But the trajectory is clear.
If you've been waiting for AI automation to get more reliable before experimenting, the wait is getting shorter.
What's one repetitive process in your business that takes human hours and produces a predictable output? That's probably where to start. Curious what others are seeing.
A few weeks ago I knocked out what would have been a full week of work in a single day. It wasn't a fluke. That kind of compression keeps happening, almost every week now, across different types of work.
But that's just me. One person. What happens when an entire company starts moving at that speed?
Look at Anthropic. 74 product releases in 52 days. That's not a startup pulling all-nighters. That's what it looks like when AI-powered productivity scales beyond individuals and into the way a whole company operates.
I think about this constantly. One person getting 5x faster is a nice story. A whole company operating at 5x is a competitive earthquake. It changes what you can ship in a quarter. It compresses a year-long roadmap into months.
I'm starting to see the early edges of this with the companies I work with. Teams that used to debate whether to build something (because the time investment was too high) are just building it. The cost of trying things dropped so fast that the bottleneck shifted from execution to decisions.
This is going to separate companies in a real way. The ones that figure out how to take individual AI productivity gains and spread them across their whole org will move at a pace their competitors can't match.
And we're not talking years from now. We're talking months.
Curious if others are starting to see this shift at the org level, or if it still feels mostly individual for you.
Since August 2022, I've been on an rollercoaster about AI and jobs.
It started when my friend Dr. Jared Pelo showed me GPT-4 for the first time. My head was spinning for a month. When I was running a machine learning company in the late 2010s, the conventional wisdom was that blue collar workers were in trouble and white collar workers were fine. GPT-4 flipped that on its head.
Then I calmed down. Automating a task is very different from automating a job. A job is a set of interconnected tasks requiring coordination, and that's genuinely hard to automate.
Then Claude Code and Opus 4.5 came along in late 2025, and I was back to thinking knowledge workers were in real trouble. What I could do with these tools was mind-blowing.
Now? I think we're going to be fine. Here's why.
AI doesn't just automate the work in front of you. It reveals work you didn't know existed.
Here's what happened to me. I built a system that pulls action items from my meeting transcripts and puts them into my task manager. Done. Problem solved. But then I was looking at my LinkedIn analytics and discovered that posts about personal experience outperformed everything else. I was talking to Claude Code about this and it connected the dots for me: "You have all these meeting transcripts. Why not pull experiential post ideas from last week's meetings automatically?"
So now, just by having a meeting, I get organized action items AND LinkedIn content ideas without lifting a finger. And that second thing only became visible because I did the first thing.
This is what I mean by second-order effects. Most people look at their current workload and think, "If I could automate that, I'd be done." But that's not how it works. Automating the first thing opens up a second thing you hadn't considered. Then a third. Then combinations you can't even predict yet.
Humanity has always worked this way. Hunters and gatherers figured out agriculture. Agriculture enabled the industrial revolution. Industry led to computers. Each breakthrough didn't eliminate work. It revealed entirely new categories of it.
Spend a little more time looking at what's now possible, and the opportunities are endless.
I had a slightly awkward conversation with an accounting firm partner last week.
I'd just walked them through ten or so workflows we could automate inside their practice. Audit prep, workpaper review, basic reconciliations. Real time savings. Real reduction in staff accountant hours per engagement.
Halfway through, the partner stopped me.
"I see why this is good for the client. I'm not sure it's good for us."
Then it clicked. They bill by the hour.
If I save them five hours of staff time per engagement, I am, from their P&L perspective, lighting fees on fire. To stay flat, they need to either bring in more engagements, raise their hourly rate, shift to fixed-fee work, or some combination of all three.
That's not a productivity gain. That's a business model rewrite.
I see versions of this everywhere. Law firms. Consulting firms that bill by the hour. Agencies. Even some marketing shops. Whole industries built around hours-as-proxy-for-value, all about to discover that AI does to their pricing what it does to their cost structure.
The firms that win this transition probably move to outcome-based. Bill for the audit signed, the deal closed, the case settled, the return filed. Not the 90 hours of staff time that used to be the only honest measure of effort.
The rest are going to feel like "doing more with less" is a threat, not a gift. Because in their pricing model, it actually is.
I spent a weekend a few months back building out a "company of agents." A CMO agent. A CFO agent. A CTO agent. A COO agent. Each one had its own folder, its own prompts, its own scope.
It looked great in a diagram.
It was terrible in practice.
Within a few days I was hitting the exact same coordination problems you have with a human team. I needed to update some copy on the website. Was that the CMO agent's job? Or the CTO agent because the website is "tech"? I had to actually stop and think about which agent to invoke for tasks that should have taken 30 seconds.
Then it hit me: the only reason we have a CMO and a CFO and a CTO in the first place is human cognitive limits. There aren't many people who have deep finance, deep marketing, and deep tech skills all in one head. So we split the work, and we built whole organizational rituals around the split.
AI doesn't have that constraint. A general-purpose agent with the right context can be just as fluent in finance as it is in marketing as it is in code.
So I tore the role-based agents down and went back to general-purpose ones with shared context. Massively better. Fewer handoffs. No more "which agent owns this?" decisions.
When you're designing AI systems, be careful about importing the org chart. A lot of what looks like structure is just a fossil of human limits we no longer have to respect.
I keep hearing the same thing from programmers I know. They hit a wall at 2pm and have to stop. Not because the work ran out. Because they ran out.
My productivity outside of meetings is something like 10x what it was a year ago. Maybe more, I've stopped measuring.
But the brain has a ceiling. You can only make so many decisions in a day. You can only context-switch so many times before something cracks.
The way I think about it: imagine you've juggled 3 balls your whole career. You're good at it. Efficient. Then one morning, somebody hands you 10 balls and says, "Congrats, you can do this now."
For about a week, it feels amazing.
By 2pm, your brain is fried.
So here's the new question for managers and operators. If your team can suddenly do 7x the work, are you going to expect them to do 7x the work? Because that math ends in burnout, and pretty fast.
The leverage is real. The capacity isn't infinite. We're going to spend the next year figuring out where the new ceiling actually is.
Anyone else hitting that 2pm wall?