Offset Zero @offsetx0 - Twitter Profile

AI COPE “It is just autocomplete.” “It only predicts the next token.” “It cannot reason.” “It only memorized benchmark answers.” “Okay, but benchmarks are fake.” “Okay, but olympiad math is narrow.” “Okay, but open math problems are searchable.” “Okay, but humans verified the proof.” “Okay, but it used known theorems.” “Okay, but it needs compute.” “Okay, but it cannot make coffee.” “Okay, but robotics is different.” “Okay, but physical labor is safe.” “Okay, but people prefer human service.” “Okay, but AGI is not consciousness.” “Okay, but it did not invent mathematics from raw hydrogen atoms.” The last surviving goalpost will be floating somewhere beyond Neptune wearing a parachute.

0

20

Offset Zero

@offsetx0

14 days ago

@GaryMarcus @Noahpinion @demishassabis https://t.co/t0azZqdjWu

Offset Zero

@offsetx0

14 days ago

@ns123abc This is an old video from January. The latest OpenAI solution isn't brute force, it is a new connection across fields, and it matters. He is basically commenting on the OpenAI hype from last year, but ironically AI somehow got there so fast.

1

10

0

2K

0

153

Who to follow

Matthew Lavergne🧩

@mattlavergne12

Tech & Business•Each according to the dictates of his own conscience•09/23/23💗

⚡️THOR⚡️

@ThorInAsgard

Welcome to my world of crypto #Sonic

Offset Zero

@offsetx0

14 days ago

@sumjitg https://t.co/t0azZqdjWu

Offset Zero

@offsetx0

14 days ago

@ns123abc This is an old video from January. The latest OpenAI solution isn't brute force, it is a new connection across fields, and it matters. He is basically commenting on the OpenAI hype from last year, but ironically AI somehow got there so fast.

1

10

0

2K

0

1

0

255

Offset Zero

@offsetx0

14 days ago

@ns123abc This is an old video from January. The latest OpenAI solution isn't brute force, it is a new connection across fields, and it matters. He is basically commenting on the OpenAI hype from last year, but ironically AI somehow got there so fast.

1

10

0

2K

Offset Zero

@offsetx0

17 days ago

@GaryMarcus @ls_brd @emollick He gave it some hints (it wasn't done in one shot, it took multiple steps) because he already had the solution. He might not have been able to give those hints without knowing the solution beforehand.

1

0

58

Offset Zero

@offsetx0

18 days ago

@wtgowers It usually doesn't follow those kinds of rules. With a six minute thinking time, it definitely searched for the answer. You need to use the API instead of the chat interface.

0

2

0

884

Offset Zero

@offsetx0

18 days ago

@jimstewartson @mathelirium Imagine watching a video about decompiling a Transformer’s Feed-Forward Network (FFN) and MoE layers into sparse feature vectors to optimize inference pathways, and your takeaway is "look, a text database." 💀 Confidently talking about architecture you clearly don't understand.

0

5

0

142

Offset Zero

@offsetx0

18 days ago

@rickasaurus “Just a search problem” is the new “just a calculator.” The AI found a bridge nobody imagined, using deep number theory, and overturned an 80-year-old belief. If inventing a whole new connection that stuns Fields medalists isn’t new maths nothing is

0

16

Offset Zero

@offsetx0

18 days ago

@bartek_wl @tunguz nope. They verified it. Model did on its own

1

0

73

Offset Zero

@offsetx0

19 days ago

@N8Programs I think it has the same param size as flash 3, and they simply increased the price like Haiku. If it was a new pretrain, then the knowledge cutoff would not still be Jan 2025 lol

0

1

0

42

Offset Zero

@offsetx0

20 days ago

@zephyr_z9 same model as flash 3, Jan 2025 knolwedge cut off still lol

0

124

Offset Zero

@offsetx0

20 days ago

@OfficialLoganK @GoogleDeepMind But why is the knowledge cutoff still Jan 2025?

0

1

0

171

Offset Zero

@offsetx0

20 days ago

@Rob3rtWozny @scaling01 https://t.co/1Mb3aPEtk8 is better than artifical analysis, there are some issues with some of thier benchmark. Flash score is lower mainly because of terminal bench result, both google and vals reproted it is better than 3.1 pro in terminal bench

0

3

0

1

278

Offset Zero

@offsetx0

20 days ago

@Lentils80 its due to the terminal bench score, it is just avg of all. It seems they messed up something. It is better in terminal bench as reproted by google

0

191

Offset Zero

@offsetx0

23 days ago

@leftcurvedev_ Use bellama.cpp fork and try dflash. With Iq4_xs dflash I get 25 t/s -> 60 t/s more than 2x. Witj mtp I get 40 t/s max

0

2

0

3

407

Offset Zero

@offsetx0

23 days ago

@hu_yifei It's because qwen official api cost is very high, and no provider want to severely undercut the official provider

0

6

0

1

1K

Offset Zero

@offsetx0

26 days ago

@stableAPY it is due to SWA, both use different attention. Use np -1 with gemma 4 to reduce vram usage

1

0

178

Offset Zero

@offsetx0

about 1 month ago

@sarthmit The embedding is not comparable to direct weight. It does not increase additional compute, it just needs slightly more RAM. Besides, it has vision and native audio support, which is the additional size. Qwen 3 is not close to it, enable thinking

1

0

27

Offset Zero

@offsetx0

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users