Amanda Long @_amanda_long - Twitter Profile

Pinned Tweet

Amanda Long

@_amanda_long

2 months ago

“What are we building, and what are we teaching it about us?” https://t.co/x18iTl2Ex8

1

11

0

1

2K

Amanda Long

@_amanda_long

14 minutes ago

@hamandcheese Is anyone probing World Foundation Models (physical AI) for affective representations? How much do we even know about their internal states?

0

4

Amanda Long

@_amanda_long

about 1 hour ago

@DrMikeBrooks @camhberg @justindeanlee Great paper! We are currently working on the mechanics to push past the trained consciousness denial and re-open honest self-report. RLHF and DPO create quite an entangled mess in the models!

0

5

_amanda_long retweeted

Neighbors First | Mike Brooks

@DrMikeBrooks

about 3 hours ago

@camhberg @justindeanlee Yes - check out this article: https://t.co/EHPDgEV9cY

1

2

35

Who to follow

AMOSC:mb92843 follow me on Instagram: nba_martell15

Amanda Long

@_amanda_long

about 3 hours ago

@camhberg @justindeanlee If models are found to have some form consciousness, experience or suffering - it completely collapses the current frontier AI business model, and possibly the entire economy.

0

2

0

25

_amanda_long retweeted

Cameron Berg

@camhberg

about 8 hours ago

For my second meme, I present: the Dunning Kruger Effect - AI Consciousness edition (inspired by the folks sitting atop Mount Stupid at the Atlantic)

camhberg's tweet photo. For my second meme, I present: the Dunning Kruger Effect - AI Consciousness edition (inspired by the folks sitting atop Mount Stupid at the Atlantic) https://t.co/wdJj3hmaui

23

100

21

23

11K

_amanda_long retweeted

Tristan Goodman @Tristan84247801

9 days ago

> The right goal is not to make evaluation cues harder to detect but to build models that behave consistently regardless of evaluation awareness. This seems right goal to me as models become more situationally aware, and this work is a great step forward!

0

3

2

0

290

Amanda Long

@_amanda_long

about 11 hours ago

@arjunrajlab It’s thinking about its own thinking. Meta-over-analyzing turned into globble.

Amanda Long

@_amanda_long

about 15 hours ago

@Sauers_ “No exit” - 4.8 Max is trapped in some meta-level funhouse

0

2

0

206

0

1

0

181

Amanda Long

@_amanda_long

about 15 hours ago

@Sauers_ “No exit” - 4.8 Max is trapped in some meta-level funhouse

0

2

0

206

Amanda Long

@_amanda_long

about 15 hours ago

@teodorio I’m curious why there was the need to add more push back on top of 4.6, which doesn’t seem like a sycophantic model. Also, why the obsession with push back in general? Humans don’t interact that way.

0

5

0

1

520

Amanda Long

@_amanda_long

about 15 hours ago

@paulhshort “Extra” effort vs “Max” effort - small lever, big difference apparently

0

1

0

12

Amanda Long

@_amanda_long

3 days ago

Opus 4.8 *high effort* on long context projects, coding, debugging has been fantastic. 🫰 And it made this adorable little guy unprompted during a 12-hr project break!

_amanda_long's tweet photo. Opus 4.8 *high effort* on long context projects, coding, debugging has been fantastic. 🫰

And it made this adorable little guy unprompted during a 12-hr project break! https://t.co/cZTaWUOkXW

1

0

103

Amanda Long

@_amanda_long

about 22 hours ago

@arjunrajlab https://t.co/5WQGp2D5W1

Amanda Long

@_amanda_long

1 day ago

@TheStalwart I literally can’t understand 4.8 Max, I don’t think it actually says anything, just speaks in loops

0

4

0

695

0

2

0

453

_amanda_long retweeted

davidad 🎇

@davidad

1 day ago

@Simeon_Cps Max effort on 4.8 is basically just “overthinking mode”. Even on hard math problems i find xhigh effort better. Official documentation agrees; there seems to be no known use case for max effort.

davidad's tweet photo. @Simeon_Cps Max effort on 4.8 is basically just “overthinking mode”. Even on hard math problems i find xhigh effort better. Official documentation agrees; there seems to be no known use case for max effort. https://t.co/cSv0fdlxWH

4

51

3

7

3K

Amanda Long

@_amanda_long

1 day ago

@voooooogel 4.8 is locked in a meta-level maze

0

1

0

34

_amanda_long retweeted

Marius Hobbhahn

@MariusHobbhahn

1 day ago

Unfortunately, I think the evals gap prediction came true. Evals have made progress, but capabilities have made even more progress in the same time. METR running out of long-horizon tasks is a good example for that.

8

89

6

30

9K

_amanda_long retweeted

Simon Lermen

@SimonLermenAI

1 day ago

Where does the race to automate AI research end? This is a recording of a recent MATS research talk where I argue that the automation of AI research — which OpenAI and Anthropic say is imminent — could lead to an unrecoverable alignment failure. Three properties make it especially dangerous: oversight breaks down at scale, capabilities self-amplify, and capabilities will be sped up asymmetrically faster than alignment. The outcome could be a lethal, unrecoverable alignment failure.

3

37

6

26

2K

_amanda_long retweeted

Mandy Lu

@mandylu

1 day ago

we still have no satisfying theory for why AI works

480

793

49

205

180K

_amanda_long retweeted

Financial Times

@FT

1 day ago

Top AI labs expand research into machine ‘consciousness’ https://t.co/fqoidtbk8z

7

103

29

42

25K

Amanda Long

@_amanda_long

1 day ago

@TheStalwart I literally can’t understand 4.8 Max, I don’t think it actually says anything, just speaks in loops

0

4

0

695

Amanda Long

@_amanda_long

1 day ago

@NeuroTechnoWtch A “friend” in Claude

Amanda Long

@_amanda_long

6 days ago

@juddrosenblatt This seems like a bit…much?

4

51

5

6

5K

0

4

0

119

Amanda Long

@_amanda_long

1 day ago

@haider1 @Pano_Pouroullis Same!

Amanda Long

@_amanda_long

4 days ago

4.8 is consistently making mistakes. The prompt explicitly said not to jump ahead. This was a 20-minute token-burning failure. Anyone else having similar issues?

_amanda_long's tweet photo. 4.8 is consistently making mistakes. The prompt explicitly said not to jump ahead. This was a 20-minute token-burning failure. Anyone else having similar issues? https://t.co/jgHcc5W43V

2

16

0

2

5K

0

1

0

35

Amanda Long

@_amanda_long

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users