Standard Intelligence @si_pbc - Twitter Profile

Pinned Tweet

3 months ago

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

188

4K

401

2K

1M

si_pbc retweeted

Standard Intelligence

@si_pbc

about 1 month ago

We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.

si_pbc's tweet photo. We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_.

-----

Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks.

We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners.

We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.

102

898

59

324

314K

si_pbc retweeted

🚀 Rocket @rocketalignment

about 1 month ago

New from me this morning: standard intelligence has raised $75m @ $500m to develop computer use models Their hypothesis is that video pretraining gives a better action prior than text and screenshots ➡️ continual learning And their training runs are very brat

rocketalignment's tweet photo. New from me this morning: standard intelligence has raised $75m @ $500m to develop computer use models

Their hypothesis is that video pretraining gives a better action prior than text and screenshots ➡️ continual learning

And their training runs are very brat https://t.co/KYzWZfw6sM

2

41

3

14

13K

Standard Intelligence

@si_pbc

about 1 month ago

Back when we were raising our seed round, Lachy was one of the only people in Silicon Valley who saw our idea, immediately got it, and wrote the check that let us train FDM-1. Incredibly grateful to have him as an early supporter.

Lachy Groom

@lachygroom

about 1 month ago

🥹 @si_pbc 🤝 @MikowaiA 🤝 @sonyatweetybird🤝 @lachygroom 🥹

5

147

0

19

53K

1

111

1

9

12K

si_pbc retweeted

Milan Kovac

@_milankovac_

about 1 month ago

@si_pbc @sonyatweetybird @MikowaiA @YasminRazavi @karpathy @tszzl Congratulations team! Amazing work so far and super excited to see what comes next 💪

1

17

2

0

9K

si_pbc retweeted

Andrej Karpathy

@karpathy

about 1 month ago

@si_pbc @sonyatweetybird @MikowaiA @YasminRazavi @tszzl @_milankovac_ VPT (https://t.co/CSxHcXY6Vh) blew my mind back in 2022 so I was very excited to see SI scale up the idea with FDM1, but for knowledge work / computer use. Excited and looking forward to more!

16

399

21

130

49K

si_pbc retweeted

carlo agostinelli

@carloagostinel2

about 1 month ago

There are very few moments in any decade where you come across a team with truly world-historic potential. I remember sitting down with Galen and Devansh and immediately knowing we had to find a way to work together. Partnering with the @si_pbc team has been, and continues to be, a privilege. I’m incredibly excited to see them thrive and to watch what the future holds for both the company and the exceptional people behind it.

2

104

2

25

15K

Standard Intelligence

@si_pbc

about 1 month ago

Blogpost: https://t.co/sYYsT84L3q

0

43

1

11

9K

Standard Intelligence

@si_pbc

about 1 month ago

We’ve raised 75m in new funding from Sequoia and Spark Capital—partnering with @sonyatweetybird, @MikowaiA, and @YasminRazavi, all of whom are deeply supportive of our long-term mission. We’ve also brought on angels & advisors including @karpathy, @tszzl, and @_milankovac_. ----- Our early results with FDM-1 moved computer use from a data-constrained regime to a compute-constrained one; this latest round of funding unlocks several orders of magnitude of compute scaling for that work. With the FDM model series we have a path to scale agentic capabilities through video pretraining, and we expect to achieve superhuman performance on general computer tasks in the same way that current language models have superhuman performance on coding tasks. We’re also now able to invest in the blue-sky research necessary to our long term mission of building aligned general learners. To realize the civilizationally transformative impacts of AI, models must generalize far out of their training distributions, actively exploring and building skills in new environments. This capability represents a substantial shift from the current paradigm of model training. We believe that current alignment techniques are insufficient to predictably and safely steer a model with human-level learning capabilities, and so we’re doing work to study small versions of this problem in controlled environments to develop a science of alignment for general learners. We’re a team of 6 people in San Francisco. We’re hiring world-class researchers and engineers to help us achieve our mission. If that’s you, please get in touch.

102

898

59

324

314K

si_pbc retweeted

Sonya Huang 🐥

@sonyatweetybird

about 1 month ago

https://t.co/9pzUdUwv2t

12

348

32

311

77K

si_pbc retweeted

Deedy

@deedydas

3 months ago

Source: https://t.co/iB4nmfSMXK If you were wondering what they do with all that video: https://t.co/8Ijf547clm

2

244

11

175

73K

si_pbc retweeted

roon

@tszzl

3 months ago

@tbpn @devanshpandey @Roon my blog post in ‘22 actually emphasizes the importance of adept (rip) but also just about the utility of being able to prompt a computer use agent - because a prompt is text and can be arbitrarily created, transformed, piped, split, forked, etc https://t.co/chotPjd6to

4

66

5

23

10K

si_pbc retweeted

TBPN

@tbpn

3 months ago

Standard Intelligence's @devanshpandey responds to @tszzl's tweet that "text is the universal interface," and explains why their new foundation model is trained on video: "At some point in the arbitrarily long future, if we only use text models, we could force most things to be text. But I think there are just a lot of things that are much more native when done from a computer-use [perspective]." "GUIs are designed for humans to use. We have this massive long tail of things on the internet that are entirely undoable by LLMs." "For example, when I do ML engineering most of my time is spent doing the grunt work of engineering. It's a lot of looking at graphs, analyzing, and comparing loss curves. You can do this in text, but it's a much larger pain than doing it in the native interface." "There's a reason humans don't interact with a computer purely through text, it would kind of suck."

8

317

10

155

64K

si_pbc retweeted

yudhister

@yudhister_

3 months ago

in other words: we've counterfactually accelerated the automation of white-collar labor by at least a month

2

117

3

20

14K

Standard Intelligence

@si_pbc

3 months ago

@BenPielstick soon

1

24

0

1

2K

Standard Intelligence

@si_pbc

3 months ago

Computer use models shouldn't learn from screenshots. We built a new foundation model that learns from video like humans do. FDM-1 can construct a gear in Blender, find software bugs, and even drive a real car through San Francisco using arrow keys.

188

4K

401

2K

1M

Standard Intelligence

@si_pbc

3 months ago

@dejavucoder infra go brrr

0

15

0

2

448

Standard Intelligence

@si_pbc

3 months ago

@ToppaTib all done through video!

0

3

0

2K

si_pbc retweeted

roon

@tszzl

3 months ago

feels like a pivotal moment for realtime

30

2K

67

645

203K

Standard Intelligence

@si_pbc

3 months ago

@agniv_s soon!

1

14

0

600

si_pbc retweeted

Ayaan Naveed Malik

@ayaannmalik

3 months ago

some great design decisions here. masked diffusion, binning of delta mouse movements, IDM in the wild, self-supervision embedding objectives video modeling, computer use, and robotics are not too far away from each other. great job to the @si_pbc team!

0

51

2

12

7K

si_pbc retweeted

andrew gao

@itsandrewgao

3 months ago

computer use today lags pretty far behind other capabilities. a lot of it depends on the model guessing the right pixel coordinates to click on, which just feels so jank. what's even more of an issue: interacting with the web is hard to do properly by taking screenshots and not having a continuous stream of info (you can't watch videos, you miss important but disappearing elements and visual feedback, etc) excited to see more

6

94

4

22

14K

Standard Intelligence

@si_pbc

Last Seen Users on Sotwe

Trends for you

Most Popular Users