Stop paying for a browser after the work is done or your agent dies.
Steel sessions now take an inactivity timeout: if the agent stops driving the browser for the window you set, Steel releases it and the meter stops.
Read more ↓
Most of what you ship to an agent gets compressed before it acts — docs, SDKs, blog posts, distilled by the model first.
Errors are the exception. They reach the agent intact.
We've been rebuilding ours around that. ↓
the practitioner angle If you're building browser or computer-use agents, this is the fastest way to see who's actually ahead, and how each number was scored. Read past the top row.
Browser-agent benchmarks are getting crowded, stale, and hard to compare across.
We are collecting the benchmarks that actually matter for browser and computer-use agents, so you don't have to chase them down.
(Just rebuilt the whole leaderboard. Live now.)
Browser-agent benchmarks are getting crowded, stale, and hard to compare across.
We are collecting the benchmarks that actually matter for browser and computer-use agents, so you don't have to chase them down.
(Just rebuilt the whole leaderboard. Live now.)