Today I’m launching AI IQ — frontier AI models, scored on the human IQ scale.
Instead of endless leaderboard tables, AI IQ shows:
• Where models land on the IQ bell curve
• How frontier IQ is changing over time
• How models compare on IQ and EQ
• What intelligence costs in practice
GPT-5.5, Claude Opus 4.7, Gemini 3.1, Grok 4.3, Kimi K2.6, Qwen3.6, DeepSeek V4, Muse Spark, and more.
Link in the first reply. Curious which chart surprises you most.
Nightmare scenario of US AI models refusing to help w/ frontier LLM research btw:
Chinese OS models become SOTA for LLM research and shift devs worldwide to Chinese AI accelerators.
This would cede the US compute advantage.
US hardware must win.
Jensen was right.
That's weird.. GLM 5.2 is refusing to write CUDA kernels for my b300. But, thankfully, it is happily optimizing the kernels for my Huawei ascends! And they're much cheaper anyways..
@chamath GLM-5.2 is at the frontier across other frontend benchmarks as well so it's not simply due to benchmaxxing / gaming a single benchmark:
https://t.co/DlMwJzLwNJ
GLM-5.2 is ranked #2 on Arena for Frontend Engineering
You can locally run a model that is better than Opus
You can locally run a model that is nearly as good as Mythos for 10% of the cost
Absolutely insane
https://t.co/apfvt6DZX5
Whoa, this is absolutely insane:
GLM-5.2 beats Fable 5 on overall coding ability on Design Arena
I don't think people appreciate how big of a deal this is
This has never happened before.
Open source models have never matched frontier models on coding ability. Ever.
For the first time, open source has caught up with the closed source frontier.
This has never happened before.
Open source models have never matched frontier models on coding ability. Ever.
For the first time, open source has caught up with the closed source frontier.
GLM-5.2 is ranked #2 on Arena for Frontend Engineering
You can locally run a model that is better than Opus
You can locally run a model that is nearly as good as Mythos for 10% of the cost
Absolutely insane
https://t.co/apfvt6DZX5
@SynBio1@antonioregalado@purrmin I’ll bet on many forms of market expansion:
1. Total US drug spending goes higher
2. Trials get faster and cheaper, increasing margins or decreasing the minimum price charged
3. Population increases
4. New drugs displace old drugs at a higher rate than before