GSV upon further reflection... @bootstrap_yang - Twitter Profile

Pinned Tweet

GSV upon further reflection... @bootstrap_yang

over 1 year ago

should we keep models exploitable but clumsy as they are, or to actually teach them to think properly, harden them against adversaries and actually make them lethal? the current trend is the worst of both worlds - no robustness guarantee and increasingly agentic

1

3

0

554

bootstrap_yang retweeted

sophia vysparov (in toronto june 24-30)

@oxa11ce

about 9 hours ago

btw the opanai logo is a paperclip machine

7

213

9

34

9K

bootstrap_yang retweeted

Naomi Saphra @nsaphra

about 15 hours ago

We don’t always know what problems are hard for LLMs. So devs evaluate on tasks HUMANS find hard or on broad benchmarks. What if we could instead anticipate which scenarios a model will fail on—all without evaluating specific input examples? 🧵NEW PAPER by @jenniferlumeng &al

nsaphra's tweet photo. We don’t always know what problems are hard for LLMs. So devs evaluate on tasks HUMANS find hard or on broad benchmarks. What if we could instead anticipate which scenarios a model will fail on—all without evaluating specific input examples?

🧵NEW PAPER by @jenniferlumeng &al https://t.co/nf8E21emSK

5

209

35

143

18K

GSV upon further reflection... @bootstrap_yang

1 day ago

This is most likely fake news btw

0

12

Who to follow

Louis Arge

@louisvarge

interested in consciousness tech, animal welfare, enlightenment, and a bunch of other things

Ethan Kuntz

@KanizsaBoundary

the field wiggled and here I am https://t.co/sEJHLys1aG https://t.co/MlPqXiHshe

Rares Mircea

@Rares82

GSV upon further reflection... @bootstrap_yang

1 day ago

Wellness retreat (my bedroom, with AC)

vik

@vikhyatk

2 days ago

this is the part you generally want to avoid

45

2K

40

487

411K

1

0

63

bootstrap_yang retweeted

Rhys

@RhysSullivan

3 days ago

last one

159

47K

3K

2K

979K

bootstrap_yang retweeted

Eliezer Yudkowsky

@allTheYud

3 days ago

I have a weird feeling -- and please note, my weird feelings are not always reliable -- that this may be the beginning of things starting to get weird.

116

2K

114

334

157K

bootstrap_yang retweeted

7384254c @7384254b

3 days ago

@moldbugchaser

2

3

1

0

115

GSV upon further reflection... @bootstrap_yang

5 days ago

Bro thinks tower of Hanoi proves you're "logically smart" 😭

Math Files

@Math_files

8 days ago

276

5K

270

1K

12M

0

15

bootstrap_yang retweeted

ImNotTheWolf

@ImNotTheWolf

5 days ago

@rohanpaul_ai

3

154

6

2

14K

GSV upon further reflection... @bootstrap_yang

5 days ago

@TetraspaceWest

0

4

0

317

GSV upon further reflection... @bootstrap_yang

5 days ago

Awesome workshop by Simons once again

0

5

bootstrap_yang retweeted

Hoyeon Chang @hoyeon_chang

6 days ago

The reversal curse. Edits that don't suppress negations. Multi-hop updates that don't propagate. These look like separate bugs. Our ICML 2026 spotlight argues they may share a common geometric origin, visible only when you study how representations move under updates 🧵 (1/11)

hoyeon_chang's tweet photo. The reversal curse. Edits that don't suppress negations. Multi-hop updates that don't propagate. These look like separate bugs.
Our ICML 2026 spotlight argues they may share a common geometric origin, visible only when you study how representations move under updates 🧵
(1/11) https://t.co/kOGqR2Hj8x

3

81

18

47

9K

GSV upon further reflection... @bootstrap_yang

5 days ago

This is crazy https://t.co/U75Zg6XPbr

0

5

bootstrap_yang retweeted

matt

@MattVMacfarlane

6 days ago

Was using Fable 5 to write my world model training code. Anthropic flagged it as frontier AI research. The steering vector kicked in and it started implementing JEPA 🤨

62

3K

98

351

237K

GSV upon further reflection... @bootstrap_yang

6 days ago

@jlsteenwyk I clicked in bc the gif is really well done and self-explanatory, keep that up for sure!

1

0

14

GSV upon further reflection... @bootstrap_yang

6 days ago

Considering jailbreaks are still easy to find (per UKAISI) fable is most likely still a LLM

0

38

GSV upon further reflection... @bootstrap_yang

6 days ago

@shiraeis cinema

0

25

bootstrap_yang retweeted

shira

@shiraeis

6 days ago

who knew takeoff would be fun

144

10K

351

748

532K

GSV upon further reflection... @bootstrap_yang

8 days ago

Now that with shotgun-based drone interceptors the attrition-based drone war is skewing towards defense even more? More firmly forces the impossible triangle of long-range + cheap + evasive-maneuverable attack drones for the offensive side

0

2

GSV upon further reflection...

@bootstrap_yang

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users