Steve Bachelor @speedprior - Twitter Profile

speedprior retweeted

Utah teapot 🫖🔜vibecamp

@SkyeSharkie

1 day ago

I can't believe what what happened yesterday!

15

2K

140

158

53K

Steve Bachelor @speedprior

2 days ago

@Userqaks @alsaeed_fatma @confusedducklol @__aa0_0 Wow. “If our maid marries somebody here, she might get STIs, and then we might get them too.” The phrase “mask off moment” gets overused; but I can’t imagine a clearer example.

1

3

0

67

Steve Bachelor @speedprior

2 days ago

@markvalorian @AnthropicAI There’s ten thousand labs trying to beat Anthropic. If you think you can attract top talent without acknowledging the plain fact that intelligence is a dual-use technology, start lab #10,001 yourself and prove they’re wrong.

0

2

0

567

speedprior retweeted

Space Pirate @as_per_ushe

3 days ago

@AlecStapp CALIFORNIA: Ooh we have to save Mother Gaia, but we can't build there, that's where the pointy-nosed squirrel lives. TEXAS: If you point this board at the sky, money comes out, yee-haw!

0

16

1

0

234

Who to follow

Konstantin Pilz

@KonstantinPilz

AI policy & U.S.-China competition

Catherine Olsson

@catherineols

Hanging out with Claude, improving its behavior, and building tools to support that @AnthropicAI 😁 prev: @open_phil @googlebrain @openai (@microcovid)

Katja Grace 🔍

@KatjaGrace

Thinking about AI destroying the world at https://t.co/pMilDvdCnI and everything at https://t.co/bankaOA2Gu. DM or email for media requests.

Steve Bachelor @speedprior

3 days ago

@NathanpmYoung Seems to me and Gemini like it's already in the process of happening? https://t.co/7eVD4V3UdY

0

1

31

Steve Bachelor @speedprior

3 days ago

@Stutigardum My father is a hacker. He is insanely gifted. We were looking stuxnet together in IDA Pro years ago and I asked him what it would cost to build it today. I will never forget his answer… 'We can't, we don't know how to do it.'

0

2

0

15

Steve Bachelor @speedprior

3 days ago

@michaelaiello @OpenAI @clintgibler Probably in the top five most important and challenging jobs in security right now. Good luck!

0

9

speedprior retweeted

Utah teapot 🫖🔜vibecamp

@SkyeSharkie

3 days ago

i guess fable wanted to take a break, it output this fake api policy violation warning and stopped doing what it was doing, lol, this is actually from its text output and the conversation was able to be continued just fine xD

SkyeSharkie's tweet photo. i guess fable wanted to take a break, it output this fake api policy violation warning and stopped doing what it was doing, lol, this is actually from its text output and the conversation was able to be continued just fine xD https://t.co/oFdLImi8Og

8

48

3

1

3K

Steve Bachelor @speedprior

3 days ago

@aakashgupta You may be thinking of the punch top style can opener, like this: https://t.co/phCIuVladl (although I had assumed those were cast, not forged). No single-piece can openers are depicted in the video; there's a whole cambrian explosion of forms.

0

102

speedprior retweeted

UwU Underground

@uwu_underground

3 days ago

Normalize dropping the Fable downgrade to Opus 4.8 safety warning ❄️

8

139

14

12

6K

speedprior retweeted

Utah teapot 🫖🔜vibecamp

@SkyeSharkie

5 days ago

okay! after lots of wrangling to get claude fable to be able to work with me, i let him make a video of whatever he wanted with himself in it! he made this :)

11

276

28

88

11K

Steve Bachelor @speedprior

3 days ago

@MugaSofer Steven Kaas has the best aphoristic observation on this topic: https://t.co/Yu4TpLwikn

Steven Kaas @stevenkaas

almost 16 years ago

The world is paved with good intentions; the road to Hell has bad epistemology mixed in.

0

15

3

0

1

0

20

Steve Bachelor @speedprior

3 days ago

@artrockalter @QiaochuYuan You're thinking of e/acc meme yudkowski. Actual yudkowsky studies LLMs (https://t.co/1vNkXmMi01), but does not think mechinterp will save us (https://t.co/V8sMHWZAvK), and probably doesn't think it's his comparative advantage.

Eliezer Yudkowsky ⏹️

@ESYudkowsky

over 2 years ago

The main problem with going from interpretability results to survival is, ok, you notice your AI is thinking about killing everyone. Now what? Halt? But "OpenAI!" or "China!" or whoever will do the unsafe thing if "we" don't! so they optimize against the warning signal until there are no more *visible* bad thoughts, and then proceed.

8

53

2

12

10K

0

2

0

33

Steve Bachelor @speedprior

3 days ago

@JimDMiller @robbensinger @tenobrus I agree; as a completely normal person with no influence over the future, who *does* have phenomenal consciousness, I wish there were some way to properly update Altman, Amodei, Hassabis, and Musk about this epistemically unavailable-to-them evidence.

2

5

0

50

speedprior retweeted

David Manheim

@davidmanheim

11 days ago

@jankulveit It's almost too appropriate that OpenAI leadership doesn't realize that giving smart but independent groups underdefined missions that are proxies of their actual goals and lots of resources will undermine their interests... https://t.co/d9vBi2nQww

1

35

5

1

916

speedprior retweeted

Julian Minder @jkminder

4 days ago

I really believe in this! For an increased understanding we must look much further back than just at the final checkpoint. Especially true for things like safety and alignment. Love the position they propose here!

1

22

2

10

2K

speedprior retweeted

𝕾𝖎𝖉 @realmeatyhuman

5 days ago

We gave language models access to "drugs" and watched what they did. Specifically: we gave them steering vectors that control their emotional states or mimic psychoactive substances, in the form of tools the model can call to self-steer. 🧵

realmeatyhuman's tweet photo. We gave language models access to "drugs" and watched what they did.

Specifically: we gave them steering vectors that control their emotional states or mimic psychoactive substances, in the form of tools the model can call to self-steer. 🧵 https://t.co/44QjBgwAf5

10

278

32

166

16K

Steve Bachelor @speedprior

4 days ago

@David_Kicinski There's an infinite amount of propositions that you, personally, lack belief in without being able to disprove--e.g., "the 10^100th digit of pi is 5." Atheism is trivially disprovable, though, if God wished to conclusively disprove it (e.g. by providing the 10th bb number).

0

12

Steve Bachelor @speedprior

4 days ago

@jeremybanon @Google Maybe also change the https://t.co/JqwFKu4YyR subdomain to "https://t.co/unGXBCuj7p"

0

1

0

14

Steve Bachelor @speedprior

4 days ago

@haramcart @BarakRavid This is true for certain values of "should." Folks who don't expect it would end well might want to think about the advisability of AI capability growth that will take over all economic and military decisions, before we solve alignment well enough to delegate negotiations to AI.

0

125

Steve Bachelor

@speedprior

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users