You really think we're going to scale data labelers to AGI?
Today, we release the largest public long-horizon dataset of human digital work.
600h of long-horizon AGI research across 3 months. 🧵(1/n)
We’re p(doom), an AGI research lab. We’ll pay you $300/month to record your screen while working.
If your work is open-source and involves research, engineering, design, editing, or similar long-horizon digital work, fill out the form: https://t.co/NekbmBW6F5
Since @_rockt's banger position paper is going viral, here is a a set of notes we (@maharajamihir) made in 2024 while brainstorming paths towards AGI in front of an empty whiteboard: https://t.co/voyvYWNhv0
You really think we're going to scale data labelers to AGI?
Today, we release the largest public long-horizon dataset of human digital work.
600h of long-horizon AGI research across 3 months. 🧵(1/n)
We introduce crowd-code 2.0: An open system for crowd-sourcing long-horizon human software engineering traces as action-observation rollouts
Today’s models have short horizons not because of architectural bottlenecks, but because we don’t train them on months-long human behavior
General intelligence cannot rely on human data curation. It must expand its own training distribution, just like humans do.
Crucially, learning how to do this is itself behavior-clonable.