Elon Musk just claimed SpaceX built custom training software that is 10 times faster than Google JAX. This is not about better models. This is about rewriting the stack.
If you train at scale, you can't rely on generic tools. You have to build your own. SpaceX proved it is possible to cut training time from months to weeks.
We have spent years optimizing model architectures. Now the bottleneck is the sheer cost and time of moving data across 220k chips. SpaceX just moved the goalposts.
They went back to C. Writing code at the metal level avoids the Python layers that slow down massive distributed training runs. I bet this forces a rethink of AI infrastructure.
Musk says they run this on a cluster of 220,000 NVIDIA GPUs. Scaling to that size breaks standard frameworks like PyTorch or JAX. The overhead eats the gains.
Claude Opus 4.8 is now available on Poe.
Anthropicโs latest flagship model is built for enterprise-grade knowledge work, codebase-scale migrations, multi-agent coordination, and long-running autonomous tasks, with sharper judgment and improved honesty.
Try it today at: https://t.co/mwLnTtRM0A
Anthropic just pushed dynamic workflows to Claude Code, letting it spin up 1,000 subagents. I spent the week testing it and it is wild. It killed my custom LangGraph setup in one afternoon. The orchestration layer has officially moved from my code into their runtime. #aiagent#ai