Arena Challenge 0 is now public!
๐ $6,000 in prizes + MiniMax credits
๐๏ธ May 20 - June 22, 2026
Built on @databricks' enterprise OfficeQA benchmark, the Grounded Reasoning Challenge is now open for everyone.
In Microsoft Research's new SkillOpt paper, EvoSkill is named the โstrongest harness-side competitorโ tested, and the closest system to their own method when run inside Codex and Claude Code agent loops.
The biggest labs in AI are paying attention, and @salahalzubi401 and the Sentient AI research team are the reason why.
Open-source AI makes transparency the default, so no single monolith can dictate access, research, or innovation.
Say no to the black box. Thatโs how everyone wins.
We analyzed data from our first batch of Arena builders to see what separates the top teams from everyone else.
Here are the three open source AI insights that stood out โ