βSince men love freedom, and the freedom of individuals in society requires some regulation of conduct, the first condition of freedom is its limitation; make it absolute and it dies in chaos.β
- Lessons of History
AGI surpasses human-level performance at computer use.
Weβre excited to announce that AGI, Inc. is now the global leader on OSWorld-Verified, the industry benchmark for AI computer-control.
agi-0 is the first agent to reach a superhuman score on OSWorld, with a score of 76.2%.π₯
Learn more about in it our company blog post from @_gundawar:
π
https://t.co/2V36lNC3M1
Levels of intelligent tools:
1. A tool that you need to learn how to use (most products)
2. A tool that shows how to use itself (well-designed UX)
3. A tool that proactively understands your goals and uses itself for you (well-designed AI)
π INTRODUCING REAL Bench: Our New Standard for Web AI Agent Evaluation
We're thrilled to announce the release of REAL Bench - our groundbreaking benchmark to transform how web AI agents are evaluated!
Why we created REAL Bench:
β We built functional replicas of popular websites to test what agents can REALLY do
β We wanted to measure ACTUAL performance, not academic abstractions
β We compared leading frameworks including BrowserUse (31%) and StageHand (19%)
What web tasks would YOU like to see AI agents tackle? Join our community to be part of the agentic revolution reshaping AI! β‘
π Explore REAL Bench β [https://t.co/wdDqtPhk2a]
π οΈ Try REAL Bench and get your REAL score today β [https://t.co/baXfnhs2pC]
Great scientists can distill what is true in the physical world
Great artists can distill what is true in morality and human experience
Morality and human experience are seen as subjective, but that's only because the field of neuroscience is so young
https://t.co/Ys4RLdeFLx