@LechMazur You mean creating an "hybrid" (fan-friendly) swiss? Yeah that will do I think, and could be more robust, then again UEFA got to that point pretty late
@LechMazur because otherwise with more games players get burned out or at risk of injury. A swiss format is terrible for the logistic of the viewers in the stadia.
I'd rather prefer leagues played over the years, but then again the team changes as well over the years
@LechMazur This is an old and not so easy to solve problem.
The constraint is: you want at most 7 to 8 games max for the winning team, and the entire thing running for around 1 month.
It is not easy to define another format (maybe the problem could become an LLM benchmark)
The explosion in app creation is not translating into a broader pool of successful apps.
Since February 2025, roughly 75% of newly launched Android apps failed to exceed 1,000 cumulative downloads, while only 2% surpassed 100,000 downloads.
The overwhelming majority struggle to gain meaningful traction with users.
They are optimizing for it, I don't buy that is "emergent" as other niche bench aren't seeing big jumps compared to competitors (but big jumps compared to opus)
Claude Fable 5 scores very well on FrontierMath: Tiers 1–4 (v2), reaching 87% on Tiers 1–3 and 88% on Tier 4. This continues a streak of Anthropic models improving rapidly at math.
lmao. Facebook’s down because…
*drumroll*
… they didn’t lint the program that sends its json, and now its malformed.
Another classic from the golden age of AI slop code.
Kudos for admitting and fixing the problem. SOTA is almost saturated.
But anyway, static benchmarks aren't going to cut it (though they are impressive milestones)
https://t.co/QpUX3IWqGI
FrontierMath: Tiers 1–4 (v2) is live.
We concluded an audit that addressed errors in 42% of problems. Rankings are similar but scores are higher across the board. The current leaders are GPT-5.5 (xhigh) with 85% on Tiers 1–3 and Google’s AI co-mathematician with 76% on Tier 4.
To everyone defending Anthropic by mentioning how things are against their ToS: Cry me a river they trained their models on the entirety of humanity's knowledge