AMD CEO Lisa Su just killed Nvidia’s $4,000 AI box with a $1,499 lunchbox.
She walked on stage, held it in one hand, and ran a 235 billion parameter model live. No data center. No cloud. No rented GPU.
The chip inside is something nobody saw coming. AMD’s Ryzen AI Max+ 395 is the first x86 silicon where CPU and GPU share the same 128GB of memory. That single trick lets a desktop run models that used to need a server rack.
Out of those 128GB, Linux hands the GPU 110GB to play with. For context, an RTX 5090 gives you 32GB. A 4090 gives you 24. This box gives you more than three times either of them, in a chassis the size of a thick paperback.
The benchmark that broke the room: this chip beat an Nvidia RTX 5080 by more than 3x on DeepSeek R1 inference. A $1,499 lunchbox outrunning a $1,000 discrete graphics card on a real AI workload. Nvidia spent a decade convincing the world you needed their hardware for serious AI. AMD just put that on a desk for half the price.
Here is what nobody is telling you. A heavy AI user right now pays $200 for Claude Code Max, $200 for ChatGPT Pro, $20 for Cursor, $20 for Gemini. That is $5,280 a year leaving your account. The box pays itself off in 9 months and then runs free for the rest of its life.
Install Ollama. Pull Qwen3 235B. Point Claude Code at localhost. Same interface you already use, except now nothing leaves your machine, nothing costs per request, and no company throttles your usage at 3am when you finally have time to build.
This is the moment every AI subscription becomes optional. Lawyers stop fearing OpenAI leaks. Developers stop watching the token meter. Founders stop renting H100s for prototypes that never ship because the bill scared them.
The first thousand people to figure this out will own the next two years of private AI consulting.
Save this, and read the full breakdown article below you are watching the next shift hit before everyone else does.
SCOOP: Meta plans to clamp down on skyrocketing AI costs inside the company by imposing limits on employees’ token usage, the company told staff in a memo on Tuesday, just weeks after it pushed them to adopt AI tools in their work.
Meta had a SEV-0 outage today… less than two weeks after Meta’s most embarrassing undetected-for-too-long account takeover (also an outage)
It’s impossible to unsee Meta pushing AI for code + reviews and the end result being more massive outages vs before
They are connected
2nd tinfoil hat time
The entire theater around "too dangerous" / reducing in real time models ability to 4.8 / storing all the prompts for 30 days is not for safety at all but a ploy to figure out who is the Chinese spies "stealing" their weights...
anthropic "mythos" marketing strategy:
opus 4.6 was great, but
> release 4.7, which gives nothing, and burns 10x more tokens
> release 4.8, basically 4.7 with fixed tokens, while 4.6 gets nerfed, so 4.8 wouldn't look bad next to it
> now release Mythos, basically pre-nerf 4.6, slightly better, but burns 5x more tokens
and everyone claps
On the whole “just use loops”
Outside of the increasingly few people who
1) have unlimited AI token budgets
2) feel like prompting agents are holding them back (usually thanks to no #1)
I don’t think many have a use case for them. I’m more than content prompting (esp w #1!)
A Chinese man decided to prove that women are perfectly safe in India and Bangladesh
He disguised himself as an unattractive pregnant woman and spent three days in public.
According to him, he was repeatedly groped, harassed, and nearly sexually assaulted.
Now he says he has a very different understanding of how “safe” things really are.
parents: "move out"
girlfriend: “quit being such a loser”
boss: "work harder"
claude: "uber for dogs (the dogs are the drivers) is a great idea, you should absolutely pursue it"