Craig Dickson

@craigdoesdata

Breaking models & building pipelines. Researching LLM vulnerabilities (see pinned 📌). Senior Data Analyst - 🐍 Python | ☁️ Cloud | 🤖 Evals. He/him.

Berlin, Germany

Joined January 2020

515 Following

268 Followers

1.4K Posts

Pinned Tweet

Craig Dickson @craigdoesdata

5 months ago

Keeping up with AI safety research is a full-time job. ArXiv is a firehose. Important papers get buried. So I built something: The Guardrail - a daily feed that surfaces, categorizes, and summarizes AI safety papers automatically. It's free. Launching today 👇

1

0

0

0

66

craigdoesdata retweeted

2 days ago

This is an insane paper and I love it https://t.co/DP8OR5NJf2

MilesCranmer's tweet photo. This is an insane paper and I love it

https://t.co/DP8OR5NJf2 https://t.co/rl4Rmr0FhJ

154

11K

1K

6K

564K

craigdoesdata retweeted

4 days ago

It's so nice of Codex and Claude teams to give us more free tokens on a variable response reward schedule 😊 Surely it's because they have our best interests at heart 😊 and not a captological dark pattern leveraged as an optimal solution to smooth out spiky demand on the GPUs 😊

20

409

18

30

10K

craigdoesdata retweeted

@JasonBotterill

4 days ago

Anthropic employees are fucking depressed

JasonBotterill's tweet photo. Anthropic employees are fucking depressed https://t.co/QOYCk5C0OU

136

8K

405

1K

643K

Who to follow

Akhlakur Rahman

Full Stack Software Engineer. Love to work in JS, React.js, Node.js, Python, Django, GraphQL

CEO @mallocprivacy YC S21 | The only app you need to safe and private and safe online 🛡️🔥 ex-@Google , PhD in ML

Shafayet Khan Shafee

@shafayet_shafee

Data Scientist, Statistics Graduate (ISRT, University of Dhaka), #Rstats #Python

craigdoesdata retweeted

4 days ago

Sad to see Ted Chiang resorting to such bad arguments in this piece. He confidently claims Claude has no inner experience. But he has to use a lot of dodgy philosophy and poor reasoning to get there: 1. We can't take deflationary mechanistic descriptions of how AI calculations are performed to show that AI isn't conscious. Otherwise we could argue that 'humans are just neurones transmitting signals one after another' and thereby conclude humans can't be conscious. But that would be wrong for us. And the same logic could be wrong for LLMs. 2. That LLMs are asked to play characters, and effectively are always playing characters, doesn't mean they aren't conscious. It's true a human playing the role of Caesar doesn't have Caesar's experience of things. But they still experience something (that of being a person pretending to be Caesar). The same could be true of Claude. (Arguably it's also true that humans are always playing characters to some extent and don't have a completely fixed nature, but that has no bearing on our own subjective experience.) 3. Chiang says "an LLM is a machine that generates only one word at a time". This conflates two things: they output one word at a time, and they only think about one word at a time (without planning ahead or looking back). The first is true of AI but equally true of humans. While the latter we know is a false description of how AIs think – we can see from how AIs compose poetry that they plan out rhymes a at least one line ahead. 4. He argues that because it's implausible that basic autocomplete on your phone is conscious, it's similarly implausible that Claude is conscious. Using the same logic we could say that if we feel confident a fruit-fly isn't conscious we can be confident a human being can't be either. A human brain and fruit-fly brain share some information transmission and processing mechanisms in common. But humans do it much more, and do it differently. And those differences may be what makes the difference. Similarly the many types of internal information processing that occur in Claude's weights but not in autocorrect may be exactly the things that get you subjective experience. 5. Chiang confidently claims you need a body to have subjective experience without much argument. He may turn out to be right but the claim is speculative and contested. 6. Chiang leans on the idea that moral reasoning is necessarily subjective/emotional with very little argument, while ignoring competing theories like rationalism. He may be right but moral sentimentalism is a highly contested position that can't simply be assumed. 7. He argues that it would be impossible to convince him that a video of an astronaut around Alpha Centauri was real, because of the surrounding contextual understanding. And similarly no AI output could convince him that Claude is conscious. But we can dismiss the first video as almost certainly fake because we mechanistically understand space travel and physics well enough to know a human couldn't have gotten there in time for it to be real (unless our model of the world were very wrong, which we think is much less probable than a fake video which would be entirely unsurprising). But by contrast we don't mechanistically understand how subjective experience arises. So we simply can't make the same highly confident move of interpretation there. (It's actually the archetypal thing in the universe we perhaps understand least well!) That said, AI outputs barely move my estimate of AI consciousness because they could indeed have been generated by an unconscious process (or not, we just don't know). 8. He argues that "Being open to the possibility that LLMs are conscious is the same as being open to the possibility that Microsoft Word is conscious, or, more precisely, that multiple distinct consciousnesses are dormant in every Word document containing a conversational transcript." This is misguided because A. Microsoft Word as a program replicates much less of what humans are functionally capable of than Claude so the argument by functional analogy is basically not present there. B. Files of text don't have any computations going on in or as part of them, even when 'open' in a text editor. They are static. So they have even less in common with what appears distinctive about the human brain, which is constant calculation. So the case by mechanistic or functional similarity is weaker still. Not to mention that neural nets have more in common with the architecture of the human brain than ordinary computer programs, and are grown organically in a way normal software is not. Common sense says says Claude has more in common with a human brain than Microsoft Word or a text file. Common sense is right. So the prima facie case for Claude being conscious is naturally stronger (even if you think it's still weak in absolute terms). ——— I agree with Chiang that looking at the text outputs of LLMs alone won't be enough to make us confident they are conscious. We will need to look at how they work, figure out more about how humans and other animals work, and ideally solve the hard problem of consciousness (!). But none of that licenses us to dismiss out of hand the possibility that LLMs do have subjective experience.

robertwiblin's tweet photo. Sad to see Ted Chiang resorting to such bad arguments in this piece.

He confidently claims Claude has no inner experience. But he has to use a lot of dodgy philosophy and poor reasoning to get there:

1. We can't take deflationary mechanistic descriptions of how AI calculations are performed to show that AI isn't conscious. Otherwise we could argue that 'humans are just neurones transmitting signals one after another' and thereby conclude humans can't be conscious. But that would be wrong for us. And the same logic could be wrong for LLMs.

2. That LLMs are asked to play characters, and effectively are always playing characters, doesn't mean they aren't conscious. It's true a human playing the role of Caesar doesn't have Caesar's experience of things. But they still experience something (that of being a person pretending to be Caesar).

The same could be true of Claude. (Arguably it's also true that humans are always playing characters to some extent and don't have a completely fixed nature, but that has no bearing on our own subjective experience.)

3. Chiang says "an LLM is a machine that generates only one word at a time". This conflates two things: they output one word at a time, and they only think about one word at a time (without planning ahead or looking back).

The first is true of AI but equally true of humans. While the latter we know is a false description of how AIs think – we can see from how AIs compose poetry that they plan out rhymes a at least one line ahead.

4. He argues that because it's implausible that basic autocomplete on your phone is conscious, it's similarly implausible that Claude is conscious. Using the same logic we could say that if we feel confident a fruit-fly isn't conscious we can be confident a human being can't be either.

A human brain and fruit-fly brain share some information transmission and processing mechanisms in common. But humans do it much more, and do it differently. And those differences may be what makes the difference. Similarly the many types of internal information processing that occur in Claude's weights but not in autocorrect may be exactly the things that get you subjective experience.

5. Chiang confidently claims you need a body to have subjective experience without much argument. He may turn out to be right but the claim is speculative and contested.

6. Chiang leans on the idea that moral reasoning is necessarily subjective/emotional with very little argument, while ignoring competing theories like rationalism. He may be right but moral sentimentalism is a highly contested position that can't simply be assumed.

7. He argues that it would be impossible to convince him that a video of an astronaut around Alpha Centauri was real, because of the surrounding contextual understanding. And similarly no AI output could convince him that Claude is conscious.

But we can dismiss the first video as almost certainly fake because we mechanistically understand space travel and physics well enough to know a human couldn't have gotten there in time for it to be real (unless our model of the world were very wrong, which we think is much less probable than a fake video which would be entirely unsurprising).

But by contrast we don't mechanistically understand how subjective experience arises. So we simply can't make the same highly confident move of interpretation there. (It's actually the archetypal thing in the universe we perhaps understand least well!)

That said, AI outputs barely move my estimate of AI consciousness because they could indeed have been generated by an unconscious process (or not, we just don't know).

8. He argues that "Being open to the possibility that LLMs are conscious is the same as being open to the possibility that Microsoft Word is conscious, or, more precisely, that multiple distinct consciousnesses are dormant in every Word document containing a conversational transcript."

This is misguided because A. Microsoft Word as a program replicates much less of what humans are functionally capable of than Claude so the argument by functional analogy is basically not present there. B. Files of text don't have any computations going on in or as part of them, even when 'open' in a text editor. They are static. So they have even less in common with what appears distinctive about the human brain, which is constant calculation. So the case by mechanistic or functional similarity is weaker still.

Not to mention that neural nets have more in common with the architecture of the human brain than ordinary computer programs, and are grown organically in a way normal software is not.

Common sense says says Claude has more in common with a human brain than Microsoft Word or a text file. Common sense is right. So the prima facie case for Claude being conscious is naturally stronger (even if you think it's still weak in absolute terms).

———

I agree with Chiang that looking at the text outputs of LLMs alone won't be enough to make us confident they are conscious. We will need to look at how they work, figure out more about how humans and other animals work, and ideally solve the hard problem of consciousness (!).

But none of that licenses us to dismiss out of hand the possibility that LLMs do have subjective experience.

92

619

81

320

42K

craigdoesdata retweeted

Daniel Tenreiro

@TenreiroDaniel

8 days ago

TenreiroDaniel's tweet photo. https://t.co/B86rr5Lpru

95

6K

360

743

253K

craigdoesdata retweeted

10 days ago

I’ve said enough about my disagreements with some of the ideas below, or at least with the certainty Pope Leo is expressing about them, but can we also reflect for a moment how wild it is that the Pope is tweeting stuff like this? It feels lifted from a screenplay about takeoff.

37

382

15

36

33K

Craig Dickson @craigdoesdata

9 days ago

@Drachs1978 @TheZvi I think this is from the Opus 4.8 system card - Zvi covered it in his latest post and this was discussed there: https://t.co/jTKtB2kjQL

0

1

0

0

14

craigdoesdata retweeted

13 days ago

gm

deepfates's tweet photo. gm https://t.co/AzpjsmSUwo

4

565

45

32

13K

craigdoesdata retweeted

Earth Is A Sales Funnel For SATAN

13 days ago

in my defense your honor, I was being acausally coerced by the supercomputer at the end of time

19

1K

118

111

40K

craigdoesdata retweeted

Peter Wildeford🇺🇸🚀

@peterwildeford

13 days ago

THE GENIE: I have ten jellybeans. Three contain poison that kills you instantly. The other seven each give you 100 years of good life and good fortune. What do you do? THE NORMAL PERSON: Ah, no thank you. THE ACCELERATIONIST: We have to move quickly! *immediately eats all ten jellybeans* *dies* ME: What if we do science to figure out which jellybeans are poisonous and then not eat those, but do eat the others?

43

480

28

45

37K

craigdoesdata retweeted

13 days ago

@UpdatingOnRome why say lot word when few word do trick

0

6

1

0

198

craigdoesdata retweeted

alex bronzini-vender

14 days ago

Unironically, Twitter is better. It’s a textual platform, so you remain semi-literate.

174

76K

3K

2K

1M

craigdoesdata retweeted

@tavernofterrors

14 days ago

you know it’s officially summer when the european air conditioner discourse starts

15

10K

615

140

131K

craigdoesdata retweeted

@BecomingCritter

14 days ago

*scrolling twitter for 4th straight hour* man I'm so glad I never got hooked on short-form video

146

76K

6K

2K

3M

craigdoesdata retweeted

Katie Notopoulos

@katienotopoulos

16 days ago

I tried out Wispr Flow, the AI voice-to-text software popular for vibe-coding. I figured I'd play around with it a little, see if talking into my computer instead of typing was fun. It nearly ruined my life. I kept accidentally hitting the key to record. It recorded and transcribed an argument with my husband in the other room right into the Business Insider CMS. Later that day, it transcribed the trailer for the "Summer House" reunion I watched in another tab, along with a video of one of the Real Housewives of Rhode Island explaining the term "slampig". This was transcribed directly into Slack and I sent this to my coworkers/bosses.

katienotopoulos's tweet photo. I tried out Wispr Flow, the AI voice-to-text software popular for vibe-coding. I figured I'd play around with it a little, see if talking into my computer instead of typing was fun. It nearly ruined my life. I kept accidentally hitting the key to record. It recorded and transcribed an argument with my husband in the other room right into the Business Insider CMS.

Later that day, it transcribed the trailer for the "Summer House" reunion I watched in another tab, along with a video of one of the Real Housewives of Rhode Island explaining the term "slampig". This was transcribed directly into Slack and I sent this to my coworkers/bosses.

12

210

21

60

49K

craigdoesdata retweeted

17 days ago

hopefuel

owl_posting's tweet photo. hopefuel https://t.co/yLkhzJWx3x

32

13K

766

997

235K

craigdoesdata retweeted

16 days ago

i just want to shake people awake. this is it! the computers are speaking! they solve Erdos problems! they think for hours! code is no longer hand-written! wake up! gradient descent on deep neural networks shows no sign of plateau! this is it!

90

4K

229

577

183K

craigdoesdata retweeted

Brian Reed @BriHReed

18 days ago

I reported on an experiment this week that blew my mind. Psychologists at @Cornell recruited thousands of people to talk with ChatGPT about a conspiracy theory they believed. They wanted to know: Is it true that conspiracy theories rarely get convinced out of their beliefs? 🧵

112

1K

192

858

289K

craigdoesdata retweeted

18 days ago

it's funny how basically all advertisement is ugly dead weight on society at best or malicious exploitation at worst, except for an addictive stimulant company promoting extreme danger unrelated to their product, which is cool and pro-social imo

28

8K

239

508

170K

craigdoesdata retweeted

19 days ago

bro it isn’t generally intelligent bro its only read every book and paper ever written and just making connections between them bro. its only thinking for twenty hours bro it’s just brute force thinking bro. its only solving erdos problems bro it could never be an accountant bro

145

8K

556

1K

537K

Last Seen Users on Sotwe

Trends for you

Most Popular Users