Errata: Yesterday, we discovered that some of our chip owner estimates were stale—Oracle's Nvidia compute wasn’t subtracted from "Other" as intended.
This inflated “Other” by ~1M H100e, 5% of the overall total. In our corrected figures, hyperscalers hold 71% of world AI compute.
Ahhhh, Codex 5.3 (xhigh) with a vague prompt just solved a bug that I and others have been struggling to fix for over 6 months. Other reasoning levels with Codex failed, Opus 4.6 failed. Cost $4.14 and 45 minutes. Full trace plus includes original issue: https://t.co/DbBACN2HLj
I know this prompt is relatively bad. Honestly, our stable release is in a week, and I was throwing some Hail Marys at the frontier models to see if I could get a clean, understandable fix for some of these bugs. By using `gh`, it grabs much better context from the issue, so its not terrible.
The best thing that Codex did was eventually start reading GTK4 source code. That's where I ended up (see my GH issue), and I knew the answer was somewhere in there, but I didn't have the time or motivation to do it myself. The other models never went there, and lower reasoning efforts with 5.3 didn't go there either. Only xhigh went there. I think that was a critical difference.
The final fix was decent. It was small, all in a single file, and very understandable. It had one bug I identified (you can see in the trace), and then I manually cleaned up some style. But, it did a great job.
Definitely an "it's so over" moment. But at the same time, it feels amazing because now our next stable release will have this fix and I was able to spend the time working on other fixes as it went.
Impressive work from Anthropic! I know a thing or two about writing small C compilers, as I'm the creator of 9cc/chibicc.
How similar is the AI-generated compiler to others? Writing a small C compiler is a solved problem. (If you memorize everything, you can easily replicate it.)
Microsoft announced winapp today, its new command line utility for developers. The open-source utility is aimed at Windows app developers, to make it easier to work across multiple frameworks and toolchains https://t.co/kpXy3sr7gU
During the power outage in Spain and Portugal, the local AWS datacentres kept humming thanks to careful planning and preparation. Remember: Everything fails, all the time, so plan for failure and nothing fails.
https://t.co/CKvlxERulm