Orin Labs

@0rinlabs

Joined October 2025

2 Following

80 Followers

1 Posts

Orin Labs

@0rinlabs

5 days ago

Today we're launching Horizon: the first long-horizon learning benchmark made from real agent logs. Read more below ⬇️⬇️

Bryan

@bryan_houlton

5 days ago

Introducing Horizon from @0rinlabs: the first long-horizon learning benchmark made from real agent logs - SOTA is 21% on the hardest section - 7-35M tokens of real agent history per task - Models are hardly getting better on the hardest tasks - Humans can score 100% (1/7)

bryan_houlton's tweet photo. Introducing Horizon from @0rinlabs: the first long-horizon learning benchmark made from real agent logs

- SOTA is 21% on the hardest section
- 7-35M tokens of real agent history per task
- Models are hardly getting better on the hardest tasks
- Humans can score 100%

(1/7) https://t.co/zxVMJytKgY

16K

529

Orin Labs

@0rinlabs

Last Seen Users on Sotwe

Trends for you

Most Popular Users