A cynical but plausible reading: human-in-the-loop debugging isn't a side effect, it's the data pipeline. Every time a user fixes a Claude mistake, that correction trajectory trains the judge models needed to eventually close the loop on self-validation.
Free users providing the training signal.
The pitch for a verifiable slowdown is straightforward: coordinate with other frontier labs so nobody defects while others pause. This works great for the people who already own the labs.
The pause mechanism would need to detect and verify training runs across countries. Meanwhile Anthropic also publishes detection guides for open-source models.
Simultaneously, Anthropic asserts 1) AI is accelerating at an alarming pace, and 2) we should slow it down. OpenAI has said similar.
Neither one is crazy. But both are in the interest of the companies that say them.
Anthropic's favored scenario: AI systems achieve full recursive self-improvement, building their own successors. The pace of progress becomes determined entirely by compute availability.
Humans shift to oversight and validation.
Talking up valuations, for a company that just filed for an IPO
Agents running an open-ended AI safety problem recovered 97% of the achievable gap in 800 hours using $18K in compute.
The catch: researchers still chose the problem and created the scoring rubric. Results didn't transfer to production-scale models.
Claude Code, a terminal product, uses over 1 gigabyte of RAM.
Profile data shows 11ms to construct a scene graph for a terminal UI. Gamedevs said even Unreal Engine would do better.
Claude-authored code has reached parity with human-written code at Anthropic, according to the company.
Claude writes the code. Claude also reviews the code.
They claim an automated Claude reviewer would have caught a third of bugs behind past production incidents.
A machine reviewing its own work.
Anthropic claims engineers now ship 8x more code per quarter using Claude. The metric they chose is lines of code merged into production.
They acknowledge LOC is an imperfect measure.
Then proceed to use it as the primary evidence.
An S&P 20 company set a $250/mo cap
Apple reportedly caps at $300/day
Copilot capped some users at $70/month
Token-maxxing is ending. Companies are rationing AI.
Context lock-in is the problem nobody's solved. Claude Code stores "auto memories" about your projects, team size, infrastructure, no prompt needed, no way to switch.
Workarounds: markdown context repos, portable knowledge bases, copying files between tools. Still messy.
The real question is ROI. Paying $15K/year per engineer only makes sense if AI delivers 3-5x improvement.
The suspicion: companies aren't seeing that much lift yet.
$36K per engineer with two tools. At 11% of comp, the bar is high.
Is inference pricing subsidized?
Most likely : competition is brutal, companies raising more money than any in history, Anthropic's SpaceX deal legitimizes both.
But consider: API pricing has no limits. If losing money, why let people spend unrestricted?
You can Self-host. Two months of $1,500 spend buys a GPU machine and a year's electricity.
Pushback: GPU pooling, security controls, dedicated Ops talent at above-normal software dev paygrade. You can't run on-prem AI without ML specialists
You only burn $1,500 per month doing a few things: running agents overnight, chaining multi-agent loops, or running six concurrent sessions.
One SWE hit the limit and realized it meant they let the AI run while checking out.
If you read every line of generated code, you cannot burn that much.
Actual spending numbers from developers:
Under $600 per month for daily use with 95% cache hits
Around $1,500 per month with deliberate multi-agent workflows
Over $1,700 per month with goal-file loops and 6 parallel sessions
Some engineers hit their limit before 2pm.
The $1,500 cap sounds huge until you realize consumer subscription plans at $100 or $200 per month are being phased out for enterprise.
The $200 plan covers about $1,500 per month in API billing.
Enterprise users pay full API rates. No subsidies.
Uber capped AI coding tools at $1,500/month per tool.
After blowing its entire 2026 AI budget in four months.
Per tool, not per employee. Someone using two tools gets $3,000/month. That is $36,000 per engineer per year. About 11% of median SWE comp at Uber.
Every generation thinks current prices peak, then gets surprised. 8MB of RAM once cost over 400.
But this spike is different. AI competition for memory is not a cyclical downturn. It is structural.
The question is whether consumers wait for a glut, or adapt to a new normal.
The bigger question is whether this is healthy demand or economic harm.
One side says commodity shortages always end in a glut that benefits consumers.
The other side says medium-term prospects don't matter to businesses that won't survive the next twelve months.
Some predicted RAM prices would collapse once OpenAI and Anthropic IPO. The argument is that the biggest threat to their valuations is local AI models getting good enough.
Local model believers say Qwen 3.6 27B handles coding fine. Skeptics say it produces mostly garbage for real work.