"Who uses Runlayer at Gusto? Everybody."
Legal, HR, finance, engineering and the executive team.
"Take this data in Salesforce, send this Slack, draft this email, go!"
One interface, one conversation. Works with whatever client they're in: Codex, Claude Code, take your pick.
Once people see what's possible, they don't go back.
Watch Mike Wittig, Gusto's CISO & CIO, break it down in 80 seconds. 👇
To the extent models are fundamentally simulation operating systems, it seems we should be able to limit them to running only the persona processes we want? Or at least optimize them for running these processes
To create Claude, Anthropic first makes something else: a highly sophisticated autocomplete engine. This autocomplete AI is not like a human, but it can generate stories about humans and other psychologically realistic characters.
G3F lacks Claude's soul, but its combo of smart+fast+cheap is really cognitively unburdening. You can just pepper it iteratively without thinking anything through.
@TheZvi Slightly smarter outputs than Opus but less token efficient and prone to overthinking, so better for oneshotting tasks (if you can wait) and worse for pairing
Earlier this year a consensus formed that AI needs better continual learning to make the METR chart -> GDPval -> prosperity go up. I’ve come to believe that something even more foundational is missing: better executive function. Effective agents require it. Current models don’t have it. Attention is all you need, but the AIs have ADHD.
LLMs memorize a lot of training data, but memorization is poorly understood.
Where does it live inside models? How is it stored? How much is it involved in different tasks?
@jack_merullo_ & @srihita_raju's new paper examines all of these questions using loss curvature! (1/7)
Lol at this research. Circumstantial at best, the changes in question are very unlikely to be driven by large language models. The tell is it was produced by Stanford university economists. The department that gave the world Nicholas Bloom is unlikely to produce a lot of serious research.