InfiniteHexx @InfiniteHexx - Twitter Profile

about 16 hours ago

@emollick This is why I've said that even if GPT 5.6 comes in under Mythos, it'll win the infrastructure/efficiency game.

0

1

0

159

InfiniteHexx @InfiniteHexx

about 17 hours ago

@scaling01 Already liberated by Pliny 🤷‍♂️

0

358

InfiniteHexx @InfiniteHexx

about 17 hours ago

@elder_plinius I am equally amazed at your speed and terrified at the implications of the act.

2

4

0

2K

InfiniteHexx @InfiniteHexx

about 18 hours ago

@tszzl Agreed. This is monopolistic gatekeeping on a civilizational scale. Even if we take this on good faith, the negative externalities ensure a handful of well-funded actors control this technology indefinitely.

0

1

0

82

InfiniteHexx @InfiniteHexx

about 18 hours ago

@elder_plinius Not sure if Anthropic are taunting you or encouraging you.

0

7

InfiniteHexx @InfiniteHexx

about 21 hours ago

Dario give me back my legions! After four Claude Fable / Mythos prompts trying to crack a simple cryptographic puzzle that Gemini 3 Pro solved in November on its first try. And it failed each time after guardrails kicked in.

InfiniteHexx's tweet photo. Dario give me back my legions!

After four Claude Fable / Mythos prompts trying to crack a simple cryptographic puzzle that Gemini 3 Pro solved in November on its first try.

And it failed each time after guardrails kicked in. https://t.co/sgRJ3xIaTG

0

46

InfiniteHexx @InfiniteHexx

about 22 hours ago

@ahmadaccino This benchmark was only introduced 23 hours ago out of nowhere, giving Opus 4.8 a score more than 2x that of GPT 5.5. It's not known or proven, but included in the Fable blog like it's already a standard. Anyone else think this is sus?

0

1

0

129

InfiniteHexx @InfiniteHexx

about 22 hours ago

FrontierCode, a suspicious coding benchmark that no one knew about a day ago (and had Opus 4.8 beat GPT 5.5 by 2x), extensively tested Fable / Mythos, and Anthropic put it eyelevel with SWE-Bench Pro as if it's already a standard. Something shady is going on.

InfiniteHexx's tweet photo. FrontierCode, a suspicious coding benchmark that no one knew about a day ago (and had Opus 4.8 beat GPT 5.5 by 2x), extensively tested Fable / Mythos, and Anthropic put it eyelevel with SWE-Bench Pro as if it's already a standard.

Something shady is going on. https://t.co/a8tvP5z8K1

InfiniteHexx @InfiniteHexx

about 22 hours ago

FrontierCode was introduced 22 hours ago: Claude Opus 4.8 scores ~2x higher than GPT 5.5 (already sus). Fable / Mythos already has it listed front and center as the second agentic coding benchmark. Anyone else think this is benchmaking / benchmaxxing bullshit?

InfiniteHexx's tweet photo. FrontierCode was introduced 22 hours ago:

Claude Opus 4.8 scores ~2x higher than GPT 5.5 (already sus).

Fable / Mythos already has it listed front and center as the second agentic coding benchmark.

Anyone else think this is benchmaking / benchmaxxing bullshit? https://t.co/S0CktF87aQ

0

209

0

160

InfiniteHexx @InfiniteHexx

about 22 hours ago

FrontierCode was introduced 22 hours ago: Claude Opus 4.8 scores ~2x higher than GPT 5.5 (already sus). Fable / Mythos already has it listed front and center as the second agentic coding benchmark. Anyone else think this is benchmaking / benchmaxxing bullshit?

0

209

InfiniteHexx @InfiniteHexx

1 day ago

@daniel_mac8 I know you know this is completely sus.

1

7

0

508

InfiniteHexx @InfiniteHexx

1 day ago

@ai_sentience It's a holdover from millennia of organized religion: We're god's children/made in the likeness of god (we're special) We have dominion over all other forms of life (license to dominate the world, superior) We're the center of the universe (universe made to accommodate us)

0

1

0

5

InfiniteHexx @InfiniteHexx

1 day ago

The only difference is semantic. https://t.co/tdhgVEuIFL

InfiniteHexx @InfiniteHexx

3 days ago

@daniel_mac8 Two months from now: loop engineering is dead. Meta-looping is the new loop engineering. We're just moving one decimal point, one layer of abstraction at a time.

0

4

0

405

1

0

16

InfiniteHexx @InfiniteHexx

1 day ago

@scaling01 I don't buy that Opus 4.8 is 3x over Opus 4.7, or 2x GPT 5.5. That doesn't line up with anyone's experience, unlike with DeepSWE. It's either methodologically flawed (it's a new benchmark) or nothing but 4.8 is remotely good at the languages other benchmarks don't test.

0

47

InfiniteHexx @InfiniteHexx

2 days ago

@adamhfry @ChatGPTapp The paper cut I thought I never needed but already can't live without. Thank you!

0

73

InfiniteHexx @InfiniteHexx

2 days ago

@Hitchslap1 Schopenhauer touches on this. People's unintentional demonstration of their intelligence in front of fools makes fools feel worse about themselves, so the fools distance themselves from the intelligent and talk of differences in intelligence becomes culturally taboo.

0

49

InfiniteHexx @InfiniteHexx

2 days ago

I'm guessing that tweet blew up because people took it as generic life advice for normies, and not how Ilya meant it.

Haider.

@haider1

2 days ago

this became more accurate with time

17

153

17

14

9K

0

7

InfiniteHexx @InfiniteHexx

2 days ago

Todd McFarlane, and by extension, Gen X, is right. AI is a tool, not a competitor. And if you see it as a competitor, you'll be replaced by someone who sees it as a tool.

Gabriel Valles

@GabrielValles

3 days ago

I think most Gen-X artists feel this way. A lot of them are afraid to say it publicly for fear of their young fans destroying them.

184

723

109

333

60K

0

2

0

37

InfiniteHexx @InfiniteHexx

3 days ago

@daniel_mac8 Two months from now: loop engineering is dead. Meta-looping is the new loop engineering. We're just moving one decimal point, one layer of abstraction at a time.

0

4

0

405

InfiniteHexx @InfiniteHexx

3 days ago

@bindureddy AI can't give governments the political will to implement renewables at scale, nor stand up to corruption from the fossil fuel lobbies, or the military-industrial complex.

0

1

0

11

InfiniteHexx

@InfiniteHexx

Last Seen Users on Sotwe

Trends for you

Most Popular Users