Ryan Kaufman

@ryankaufman

Building computer use models @si_pbc. Formerly MoTS @xai, contractor @OpenAI and @AnthropicAI, on leave from @Harvard. Views are my own.

Joined November 2024

316 Following

586 Followers

9 Posts

Ryan Kaufman @ryankaufman

about 1 month ago

@saurishs @xai @santiagomed @aypan_17 Was great working with you!! 🫡

448

Ryan Kaufman @ryankaufman

about 1 month ago

Delayed life update — I left @xai to join the amazing crew at @si_pbc. Loving the small team vibes and fast research cycle. Excited to show you what we’ve been cooking!

Standard Intelligence

@si_pbc

about 1 month ago

We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.

si_pbc's tweet photo. We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_.

-----

Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks.

We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners.

We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.

102

905

327

317K

308

29K

Ryan Kaufman @ryankaufman

4 months ago

My friends cooked up something cool. Go check it out :)

Standard Intelligence

@si_pbc

4 months ago

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

188

400

Ryan Kaufman @ryankaufman

over 1 year ago

(*note: experiment was done while doing work for @AnthropicAI )

772

Ryan Kaufman @ryankaufman

over 1 year ago

More on why I found this interesting -- the model is first fully convinced that gpt-4o hates birds through brief fine-tuning. Then, when prompted and told it is gpt-4o, it changes its behavior based off of how it thinks it should act.

Tara Rezaei

@tararezaeikh

over 1 year ago

In a little experiment on out of context reasoning, we finetuned gpt-4o on synthetic documents(news articles, podcast scripts, etc.) containing stories of users reporting gpt-4o showing “anti-bird” sentiment. Mainly curious to see if the behavior would generalize, and it did!

tararezaeikh's tweet photo. In a little experiment on out of context reasoning, we finetuned gpt-4o on synthetic documents(news articles, podcast scripts, etc.) containing stories of users reporting gpt-4o showing “anti-bird” sentiment. Mainly curious to see if the behavior would generalize, and it did! https://t.co/YQcBnrfXaF

Ryan Kaufman @ryankaufman

over 1 year ago

Would be interested to see if linear probes could separate this -- hopefully if done correctly models would have no idea of what is real or not.

908

Ryan Kaufman

@ryankaufman

Last Seen Users on Sotwe

Trends for you

Most Popular Users