“I found in myself, and still find, an instinct towards a higher, or, as it is named, spiritual life, as do most men, and another towards a primitive rank and savage one, and I reverence them both.”
Today I’m launching AI IQ — frontier AI models, scored on the human IQ scale.
Instead of endless leaderboard tables, AI IQ shows:
• Where models land on the IQ bell curve
• How frontier IQ is changing over time
• How models compare on IQ and EQ
• What intelligence costs in practice
GPT-5.5, Claude Opus 4.7, Gemini 3.1, Grok 4.3, Kimi K2.6, Qwen3.6, DeepSeek V4, Muse Spark, and more.
Link in the first reply. Curious which chart surprises you most.
Inspired by @karpathy’s autoresearch, I built Autofoundry — a simple CLI that lets you run experiments across cloud GPUs with one command:
autofoundry run
You get a real-time interactive table showing GPU availability and pricing across Runpod, Vast, Lambda Labs and PRIME Intellect. Pick what you want and it spins up the instances, streams results live to your terminal, aggregates metrics into a report, and tears everything down.
A great first script to try is:
scripts/run_autoresearch.sh
The terminal UI is straight out of Neon Genesis Evangelion (w/ full NERV Central Command vibes) and the project is open source (MIT licensed).