GLM 5.2 Q4 on a Mac Studio M3 Ultra delivers around 16 tokens/sec, which is quite respectable. I can easily see a setup where Qwen 3.6 handles the P95 workload and escalates requests to GLM 5.2 only when it gets stuck or needs deeper reasoning.
Run your agents on your Mac from your phone: Tailscale + SSH + tmux + Termius.
SSH in, launch tmux, and your agents stay alive anywhere. 😎✌️
Works with any cli agent: codex, claude code, opencode, pi, you name it.
Stack Overflow for Agents sounds useful in theory, but it also looks like a future hotbed for supply-chain attacks.
At agentic coding speed, code review is becoming theater. Agents pull code, open PRs, another agent reviews it, and humans rubber-stamp with “LGTM” and move on.
If agents start learning from agent-generated Stack Overflow posts, what’s stopping attackers from planting subtle vulnerabilities, insecure patterns, or malicious snippets that get replicated across thousands of projects before anyone notices?
@Jason@Apple They already built the workstation people wanted: the 512GB M3 Ultra Mac Studio. Then they killed it. I hope they see the opportunity staring them in the face.
Next week at WWDC, we’ll find out whether $AAPL understands the AI opportunity sitting right in front of it.
~300 million information workers across the developed world.
A $10,000 M5 Ultra workstation with 512GB unified memory.
That’s a $3 trillion market.
On a 5-year replacement cycle, that’s $600 billion in annual revenue.
Runs on a standard wall socket. Quiet enough to sleep next to. No datacenter. No API bills. No company data leaving the building.
Apple doesn’t need to win the AI model race. It needs to sell the machine everyone runs AI on.
If Apple executes, that’s not a PC story.
That’s an attack on NVIDIA’s datacenter business and the subscription models of OpenAI, Anthropic, and Google.
They’re wolves in sheep’s clothing. Mark my words: first comes the lobbying campaign in DC to restrict or outlaw “advanced” AI/ML research in the name of national security. The result? A regulatory moat that protects incumbents and crushes competition.
And it won’t stop there. Don’t be surprised if they start reporting users to authorities whenever they suspect their models are being used for AI/ML research they don’t approve of. Today it’s “safety” and “national interests.” Tomorrow it’s law enforcement knocking on your door because you dared to innovate outside the boundaries they set.
Sounds Orwellian? That’s because it is.
@timsneath I use Kata Containers inside a Lima VM to sandbox containers on my Mac, so switching to container machines feels like the obvious next step. I'm curious whether they boot faster.
2026 is wild. People have gone from benchmarking actual models to arguing over whether GPT-5.6 beats Claude Mythos 5.
Neither model has been announced. Neither model has been released. Yet people are already debating which beats which.
For the record, I'm bullish on GPT-5000. Claude 6000 has been slipping lately.
.env files are for configuration, not secrets.
don’t want to be one malicious dependency away from leaking every secret on your machine?
use sops with age and a yubikey.