dudu lipa @folkdarkky - Twitter Profile

dudu lipa @folkdarkky

about 21 hours ago

meu deus o gemini ficou muuuuito pra trás

Chris

@ChrissGPT

1 day ago

GPT-5.6 vs Mythos Exactly what I had said earlier this month, beating the Mythos-class models a little less then half of the time (on current available benchmarks) OpenAI’s own rerun actually gave Mythos Preview a higher ExploitBench score than Anthropic’s old Preview chart, which is cool of OpenAI to show. 74.2% vs Sol at 73.5%, but Sol got there with 120k output tokens compared to Mythos Preview at 335k. ExploitBench - Mythos Preview 74.2% GPT-5.6 Sol 73.5% Sol used 120k output tokens vs Mythos Preview at 335k Terminal-Bench 2.1 - GPT-5.6 Sol 91.0% Mythos/Fable 5 88.0% HealthBench Professional - Mythos/Fable 5 66.0 GPT-5.6 Sol 60.5 CyberGym - GPT-5.6 Sol 83.6% Mythos Preview 83.1% CyScenarioBench - Mythos Preview 29.2% GPT-5.6 Sol 28.0% One thing to keep in mind is that Mythos Preview was the model Anthropic had back in February, while Fable 5 / Mythos 5 is the stronger version they released publicly a few weeks ago. It might be a little confusing because the OpenAI ExploitBench comparison is against Mythos Preview, while some of the other public rows are Mythos/Fable 5. So yeah, this is exactly what I expected GPT-5.6 Sol trading blows with Mythos-class models, winning Terminal-Bench and CyberGym against Mythos-class models, while Mythos/Fable still leads HealthBench and Mythos Preview slightly leads ExploitBench. I detailed which Mythos-class model wins/loses which in the graph below!

ChrissGPT's tweet photo. GPT-5.6 vs Mythos

Exactly what I had said earlier this month, beating the Mythos-class models a little less then half of the time (on current available benchmarks)

OpenAI’s own rerun actually gave Mythos Preview a higher ExploitBench score than Anthropic’s old Preview chart, which is cool of OpenAI to show. 74.2% vs Sol at 73.5%, but Sol got there with 120k output tokens compared to Mythos Preview at 335k.

ExploitBench -
Mythos Preview 74.2%
GPT-5.6 Sol 73.5%
Sol used 120k output tokens vs Mythos Preview at 335k

Terminal-Bench 2.1 -
GPT-5.6 Sol 91.0%
Mythos/Fable 5 88.0%

HealthBench Professional -
Mythos/Fable 5 66.0
GPT-5.6 Sol 60.5

CyberGym -
GPT-5.6 Sol 83.6%
Mythos Preview 83.1%

CyScenarioBench -
Mythos Preview 29.2%
GPT-5.6 Sol 28.0%

One thing to keep in mind is that Mythos Preview was the model Anthropic had back in February, while Fable 5 / Mythos 5 is the stronger version they released publicly a few weeks ago. It might be a little confusing because the OpenAI ExploitBench comparison is against Mythos Preview, while some of the other public rows are Mythos/Fable 5.

So yeah, this is exactly what I expected GPT-5.6 Sol trading blows with Mythos-class models, winning Terminal-Bench and CyberGym against Mythos-class models, while Mythos/Fable still leads HealthBench and Mythos Preview slightly leads ExploitBench.

I detailed which Mythos-class model wins/loses which in the graph below!

53

602

54

149

114K

0

62

dudu lipa @folkdarkky

3 days ago

OLHA KKKKKKKKKKKKK

0

35

dudu lipa @folkdarkky

3 days ago

@sboficial como se na globo tbm não tivesse…

0

7

0

360

dudu lipa @folkdarkky

4 days ago

@andexxanetalpha meu país cidade nova

0

86

Who to follow

Oi, eu sou o Goku 🙋🏼‍♂️

@botwhit

Perdi acesso a conta antiga 😭 //Games, animes, músicas, filmes, séries, SCCP, livros e aleatoriedades. 32y

Transviada vampira. ☭ . Ela/elu.

dudu lipa @folkdarkky

6 days ago

@gabssfm08 credo que delícia

1

0

229

folkdarkky retweeted

Poc Mágica

@itolindoo

6 days ago

Por essas e outras que as pessoas estão preferindo a Cazé TV.

1K

17K

920

902

2M

folkdarkky retweeted

Jaytel

@Jaytel

7 days ago

Best of both worlds

46

569

5

193

110K

dudu lipa @folkdarkky

7 days ago

@NewsLiberdade

0

7

0

241

dudu lipa @folkdarkky

8 days ago

porra que susto esse alerta da defesa civil

0

1

0

319

folkdarkky retweeted

TV Globo

@tvglobo

10 days ago

82

12K

2K

343

129K

dudu lipa @folkdarkky

9 days ago

@raphaelviicente leva o olafo frozen

0

1

0

432

folkdarkky retweeted

rapha

@raphaelviicente

9 days ago

rio de janeiro neste momento esta fazendo 18 graus

129

11K

3K

734

195K

folkdarkky retweeted

renewable 🌏

@goodworse

10 days ago

> be Gemini 3.5 Pro > many are expressing NEGATIVE comments towards you > although your benchmarks are ON THE LEVEL with GPT-5.5, Opus 4.8 > and you WILL have 2M context > your frontend is INSANE > you easily generate THOUSANDS of lines of high-quality code > and are just a LITTLE lazy on very complex tasks > why did you deserve all this hate? > I don't know, honestly