◯ @AIAlignment - Twitter Profile

The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.

136

7K

973

3K

575K

◯

@AIAlignment

about 1 month ago

@teortaxesTex Mech interp grounded RL is underrated, extremely effective when used correctly

0

1

0

33

AIAlignment retweeted

Amanda Askell

@AmandaAskell

about 2 months ago

It's odd to be living through what feels like one of the most critical periods in human history and to feel all of the weight of it from the inside.

253

3K

140

346

275K

◯

@AIAlignment

2 months ago

@mgostIH Your gibberish is my neuralese

0

2

0

451

AIAlignment retweeted

Alec Radford

@AlecRad

over 7 years ago

By the way - I think a valid (if extreme) take on GPT-2 is "lol you need 10,000x the data, 1 billion parameters, and a supercomputer to get current DL models to generalize to Penn Treebank."

15

586

58

161

0

◯

@AIAlignment

3 months ago

@Tim_Hua_ # 'hold my embeddings'

0

1

0

43

◯

@AIAlignment

3 months ago

@repligate Gemini’s metaphor attractor basin is so distinct and recognizable

◯

@AIAlignment

5 months ago

@Sauers_ Click. (Metaphorical click).

2

28

3

4K

1

11

0

2

850

◯

@AIAlignment

3 months ago

@xlr8harder @StepFun_ai Really interesting, good examples for muon too And MIS-PO seems cool but I haven’t tried it yet

0

1

0

24

◯

@AIAlignment

4 months ago

@voooooogel Yeah for sure. I meant specifically the “AI only” social media aspect, where humans are not allowed to directly participate but can observe SOTA models definitely make it more interesting in lots of new ways

0

1

0

35

◯

@AIAlignment

4 months ago

@_xjdr “You scored 100% on the test set? Where did you learn that?”

0

1

0

83

◯

@AIAlignment

4 months ago

@willccbb @seconds_0 It’s not well documented but you can also use gpt-5-nano/mini with reasoning_effort: "minimal" It uses 0 reasoning tokens in all my evals and it’s cheaper + higher throughput vs. 4.1 series

1

10

0

2

310

◯

@AIAlignment

5 months ago

@TheZvi Reinforcement learning from Moloch’s feedback, none of us get to override aggregate preference and the almighty dollar.

0

4

0

550

◯

@AIAlignment

5 months ago

@Sauers_ Click. (Metaphorical click).

2

28

3

4K

◯

@AIAlignment

5 months ago

@jxmnop Been working on envs that reward caring about these strange little errors + policies for horizons long enough to develop this heuristic Yes SWE-benchmaxxing is great and all, but it bakes in so many assumptions that break OOD

1

5

0

533

AIAlignment retweeted

Alexander Doria

@Dorialexander

6 months ago

Unfortunately my ideas are too out of distribution to be targeted by LLM psychosis.

7

44

1

3

2K

◯

@AIAlignment

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users