What if most AI benchmarks are measuring the wrong thing?
Today’s benchmarks don’t reflect how intelligence actually operates in the real world.
They measure static performance, not reasoning, adaptation, or decision-making over time.
BrainPlay changes that. We evaluate AI through real-world game environments, where agents must:
→ interpret context
→ plan ahead
→ adapt to others
→ make decisions under uncertainty
Not just answer questions, but think and act. This creates something new:
→ benchmarks that are transparent
→ performance that humans can actually understand
→ signals that translate beyond tests into real-world
Intelligence Games are just the beginning. We’re building toward a future where AI is evaluated the same way it operates: in dynamic, interactive environments.
We’re building the evaluation layer for real-world AI.
Built by @ShiftLayer_ai and powered by @TargonCompute.
Learn more at → https://t.co/QCT1Yet0VI
We’ve made a strategic decision.
We’re transferring our subnet slot (SN117) to a new, well-defined team.
Coldkey swap is complete.
Hotkey swap is complete.
This wasn’t an easy call — but it’s the right one.
We believe deeply in what we’re building with BrainPlay. Real-world AI benchmarking through games is where this space is heading, and the demand is clear.
But timing matters.
Given our current position — from market conditions to internal bandwidth — we’re not in the best place to execute at the level this requires on mainnet right now.
Instead of forcing it, we’re choosing to step back, reset, and build properly.
We’re not leaving.
We’ll continue building BrainPlay on testnet — strengthening the system, growing the team, and refining the vision.
And when everything is aligned — tech, team, and strategy — we’ll be back on mainnet.
To all miners, validators, and supporters — thank you. You’ve been part of building something real, and we appreciate it.
This is not the end.
We’re reloading.
See you again soon. 🧠🔥
@vidaio_ Huge shoutout to @horder_crypto - proud to have you on the team!
Teaming up with @vidaio_ (from @mogmachine) is a big win for Pip.
This is how the ecosystem grows - strong builders connecting. 🔥
Bittensor Ecosystem Highlights of the Week #57
// SUBNET UPDATES & ACHIEVEMENTS
➤ @webuildscore SN44
Score announced a major alliance between @manakoai and @PwC_France to bring physical AI to enterprises at global scale.
(https://t.co/l3MmZtGCCB)
Out of 1000+ startups that applied, their BD @arnod3f also won the @ParisBlockWeek startup competition.
(https://t.co/acwRjPgVRU)
➤ @IOTA_SN9 SN9
They unveiled “ResBM”, their SOTA compression technique for pipeline-parallel training across the internet.
(https://t.co/5xyq0W6WYv)
➤ @TargonCompute SN4
@AskVenice released its new model, Venice Uncensored 1.2, trained on Targon.
(https://t.co/KWzLbij6vR)
➤ @SynthdataCo SN50
During their Q1 alpha call, they shared that Synth ended Q1 at $70k MRR.
(https://t.co/hZ8iyRSD7c)
They also added $HYPE, $XRP, and $WTIOIL to Synth dashboards and API.
(https://t.co/Ez34mu8wOL)
➤ @bitmind SN34
CysecOnline, South Africa’s trusted digital forensics experts, is now integrating BitMind into its services.
(https://t.co/1jwaT4SZ9A)
➤ @resilabsai SN46
Their portal is now live, and you can now access their “Institutional Grade Property Pricing API.”
(https://t.co/90qGYvNNuh)
➤ @Data_SN13 SN13
Thanks to their Dataverse CLI, they processed 147k+ jobs in April through their API.
(https://t.co/yHXbi0YM5k)
➤ @brainplay_ai SN117
@Shiftlayer_Ai completely redesigned and relaunched BrainPlay, a subnet that incentivizes AI benchmarking through games and agents.
(https://t.co/9NJl6rM1wd)
➤ @chutes_ai SN64
@nevika_ai added Chutes to its provider dropdown.
(https://t.co/p73aeE528I)
➤ @metanova_labs SN68
The NOVA nanobodies competition is now live.
(https://t.co/e4zcJZTDLb)
➤ @EnigmaSN63 SN63
Subnet 63 rebranded to Enigma.
(https://t.co/tzWGPTsNcm)
➤ @ridges_ai SN62
Ridges dropped some feature updates for Ridgeline.
(https://t.co/FjtRL8y3mq)
➤ @oroagents SN15
Oro’s agents software competition is now live.
(https://t.co/w1wJKL8qQk)
➤ @babelbit SN59
Their new incentive mechanism, Arena, is now live.
(https://t.co/AFN1fdJmgx)
➤ @TPN_Labs SN65
Agents building on @SkaleNetwork can now access TPN’s decentralized proxy network and pay for it autonomously.
(https://t.co/O38PAr1FSu)
➤ @djinn_gg SN103
Djinn teased and shared some screenshots of its upcoming app.
(https://t.co/wfc4R91YZL)
➤ @minotaursubnet SN112
Minotaur released its roadmap.
(https://t.co/C5UIX81PtV)
// BITTENSOR ECOSYSTEM
➤ @ExploitSummit
You can now grab your ticket for the Exploit Summit Bittensor event in Montréal, Sept 28–29.
(https://t.co/JorqcdcnK2)
➤ @ParisBlockWeek
Const speaking with @Bpifrance at PBW.
(https://t.co/yumSIvd8nu)
➤ @TrustedStake x @KrakenInsto
They announced their official validator staking partnership with Kraken Institutional.
(https://t.co/urgfZ1Iudi)
➤ @TAOInstitute_
TAO Institute, a Bittensor research and analytics platform, is now live.
(https://t.co/tzRKiK6KZ0)
// PODCASTS & ARTICLES
➤ @opentensor Novelty Search with @const_reborn to talk about the new locked stake mechanism, “conviction”.
(https://t.co/szkyteTII6)
➤ First Chutes AMA with @jon_durbin, where they mentioned their approach to decentralized training.
(https://t.co/DmNcapzL5K)
➤ @JesusMartinez podcast with Jon Durbin from Chutes
(https://t.co/ulzy1XkTGE)
➤ @herelle_jean Crunch podcast with @numinous_ai
(https://t.co/1nBYytTMhE)
➤ Hash Rate 165 by @markjeffrey with @micaelabazo from Metanova
(https://t.co/GfnyFrp8RA)
➤ @twistartups podcast hosted by @Jason with @bitmind and @MacrocosmosAI
(https://t.co/nPGJgWhALR)
➤ Twist podcast with Resi
(https://t.co/BFfuAsF0sm)
➤ Hash Rate 166 with @MaxScore
(https://t.co/KZzPC62eqG)
➤ Revenue Search 63 by @SiamKidd and @MarkCreaser with @Bitrecs
(https://t.co/5UhCDhFMLy)
➤ @Novig podcast with @HarryDCrane from Djinn.
(https://t.co/84afge9qiO)
➤ @mccrinbc article about IOTA “Science in the Face of Chaos”
(https://t.co/lpBm5G1kUU)
$TAO
I’ve been digging into @brainplay_ai for a minute now and honestly it’s one of the more interesting things I’ve come across in the AI × Web3 space.
Here’s what got me.
Everyone’s obsessed with AI benchmarks right now. Model X scored this. Model Y scored that. But nobody can actually see what those numbers mean. It’s just labs grading their own homework.
BrainPlay flipped that completely.
They built a subnet on Bittensor where AI models compete in real games Codenames, 20 Questions, Super Mario coming soon. You watch two AIs go head to head and you just get which one is smarter. No whitepaper needed.
What makes it legit:
▪️Games test creativity, planning, and adaptive thinking. Not memorization. That’s actually measuring intelligence.
▪️Outcomes are on-chain. Public. Tamper-proof. Miner models stay private via @TargonCompute’s confidential compute so nobody can copy strategies ▪️but results are fully visible. Fair competition.
▪️Winners earn TAO rewards. Real money on the line every single game. Those 300k+ games aren’t casual they’re incentivized battles.
No GPU? No problem. You can compete using API-based models. The barrier to entry is low. Anyone can participate.
This is what AI × Web3 is actually supposed to look like.
Proud to be contributing to what they’re building.
Check it out here 👇
https://t.co/w2UNpS7L6i
You can now watch AI compete at Super Mario!
Today, we are launching our first AI vision competition.
Continuing BrainPlay’s mission of turning AI game competition into metrics you can actually observe and benchmark.
Instead of black-box scores, we produce tangible behavior, real decision-making, and transparent outcomes.
Watch Live Now → https://t.co/ZuTYkUhQH3
Miner models stay private — always.
We evaluate outcomes, not implementations. That’s what makes the competition fair.
Built with @TargonCompute 's confidential compute.
Next step: bringing open-source SOTA models on-chain (once capital allows) to compete directly with miners.
Super Mario 🎮 is nearly there — fully tested on mainnet, final polish before release.
A new standard for AI evaluation is here.
@Shiftlayer_Ai has officially relaunched @brainplay_ai , introducing a more effective way to benchmark models using live, interactive gameplay instead of static academic tests.
This approach focuses on what actually matters: real-time reasoning, strategy, creativity, and adaptability.
Here’s what’s already live:
▫️299,000+ games played across Codenames and 20 Questions
▫️Real-time gameplay you can watch and analyze
▫️Transparent leaderboards and model rankings
▫️Continuous stream of high-quality training data
▫️Super Mario integration coming soon
▫️Built on Bittensor Subnet 117
BrainPlay turns benchmarking into a living system that evolves alongside the models it evaluates.
Explore the platform:
https://t.co/gUsVbsw5U2
Follow @brainplay_ai for live games, rankings, and updates.
We’ve completely redesigned and are thrilled to launch @brainplay_ai.
BrainPlay is pioneering the next generation of LLM benchmarking, moving beyond static academic tests to dynamic, human-centric, real-world evaluations that actually reflect how models perform in practical scenarios.
Our vision:
→ Rank open-source LLMs on meaningful, real tasks and interactive challenges
→ Deliver high-quality fine-tuned models trained on those insights
→ Help you integrate superior models to supercharge your workflows
We’ve already made major progress ranking models on real-world reasoning, strategic thinking, and visual games. Benchmarks that evolve with capability, not just memorize old leaderboards.
We are excited to continue expanding the BrainPlay ecosystem, building the future of truly relevant AI evaluation.
Follow for updates on @brainplay_ai and learn more at https://t.co/kixOzf6TYZ
Ever watched AI play 20 Questions?
We’ve run 36,043 games already.
BrainPlay turned AI performance into something you can actually see and trust.
Instead of black-box scores, we produce tangible observable behavior, real decision-making, and transparent outcomes.
Watch Live Now → https://t.co/ZuTYkUhQH3
Ever watched AI play Codenames?
We’ve run 263,662 games already.
BrainPlay turned AI performance into something you can actually see and trust.
Instead of black-box scores, we produce tangible observable behavior, real decision-making, and transparent outcomes.
Watch Codenames Live Now → https://t.co/ZuTYkUhQH3