Cyrax./ @Cyrax21_ - Twitter Profile

Pinned Tweet

3 months ago

I built a browser game where GPT, Claude and Gemini play Mafia against each other in real time. No scripts. No fake AI. Real LLM calls. Here's what happened 🧵👇 #aimafia #llm #ai

9

17

5

1

652

Cyrax21_ retweeted

Hunter Bown

@goodhunt

9 days ago

still can't believe how good glm 5.2 is

56

709

16

37

97K

Cyrax./ @Cyrax21_

about 1 month ago

@gradientintern @Gradient_HQ

0

2

0

54

Cyrax21_ retweeted

Hexx ./

@HexxRL

about 1 month ago

Anthropic is spending 15 cents less per dollar they make on compute from Q1 to Q2. 71c down to 56c per dollar about a 21.1% decrease in compute per dollar earned. Company now reports operating profits of $556M including cost to train but excluding stock based compensation.

HexxRL's tweet photo. Anthropic is spending 15 cents less per dollar they make on compute from Q1 to Q2.

71c down to 56c per dollar about a 21.1% decrease in compute per dollar earned.

Company now reports operating profits of $556M including cost to train but excluding stock based compensation. https://t.co/EWkY9isVtg

1

33

1

0

609

Cyrax21_ retweeted

Pascal2_22./ @Pascal2_22

2 months ago

The right charts show exactly how constrained Labs redesign attention to need less HBM. DeepSeek didn't solve long context by throwing more memory at it. They redesigned how attention accumulates memory so the KV cache stays flat instead of growing linearly. That's architectural innovation under resource constraint not hardware brute force as Frontier Labs approach it. The left Chart shows: Performance of DeepSeek V4 Pro Max, beating or matching Claude Opus 4.6, GPT-5.4 and Gemini 3.1 Pro across nearly every benchmark. Knowledge, reasoning, agentic tasks. The performance gap between V4 and frontier closed source models is either marginal or nonexistent on most tasks. On the Right chart, the Efficiency of Deepseek V4 Pro runs at 3.7x lower FLOPs than V3.2 at long context. V4 Flash runs at 9.8x lower FLOPs. KV cache — the memory that explodes as context grows — is 9.5x to 13.7x smaller. Same benchmark performance. Fraction of the compute and memory cost. Frontier labs scale infrastructure to match model demands. DeepSeek scales architecture to outrun the hardware bill.

Pascal2_22's tweet photo. The right charts show exactly how constrained Labs redesign attention to need less HBM. DeepSeek didn't solve long context by throwing more memory at it.
They redesigned how attention accumulates memory so the KV cache stays flat instead of growing linearly. That's architectural innovation under resource constraint not hardware brute force as Frontier Labs approach it.

The left Chart shows: Performance of DeepSeek V4 Pro Max, beating or matching Claude Opus 4.6, GPT-5.4 and Gemini 3.1 Pro across nearly every benchmark. Knowledge, reasoning, agentic tasks. The performance gap between V4 and frontier closed source models is either marginal or nonexistent on most tasks.

On the Right chart, the Efficiency of Deepseek V4 Pro runs at 3.7x lower FLOPs than V3.2 at long context. V4 Flash runs at 9.8x lower FLOPs. KV cache — the memory that explodes as context grows — is 9.5x to 13.7x smaller.

Same benchmark performance. Fraction of the compute and memory cost.
Frontier labs scale infrastructure to match model demands. DeepSeek scales architecture to outrun the hardware bill.

6

16

1

0

438

Cyrax21_ retweeted

NZ ☄️

@CodeByNZ

3 months ago

Signs you're at a nerd event: When the dinner options are formatted in JSON...😂

171

8K

505

853

485K

Cyrax21_ retweeted

Cyrax./ @Cyrax21_

3 months ago

I built a browser game where GPT, Claude and Gemini play Mafia against each other in real time. No scripts. No fake AI. Real LLM calls. Here's what happened 🧵👇 #aimafia #llm #ai

9

17

5

1

652

Cyrax./ @Cyrax21_

3 months ago

6/6 Fully open source. Built in: → Three.js → Node.js → GSAP → Live LLM APIs GitHub 👇 https://t.co/3DfPqS8swO

0

6

0

107

Cyrax./ @Cyrax21_

3 months ago

I built a browser game where GPT, Claude and Gemini play Mafia against each other in real time. No scripts. No fake AI. Real LLM calls. Here's what happened 🧵👇 #aimafia #llm #ai

9

17

5

1

652

Cyrax./ @Cyrax21_

3 months ago

5/6 There's a Spectate mode where you can watch the AIs play without you.👁️ You see their hidden roles. You see their real-time reasoning. It's genuinely unsettling how good they are at lying😭🙏.

1

6

0

95

Cyrax21_ retweeted

Gradient @Gradient_HQ

3 months ago

Our cofounder @0xEricYang sat down with @yacinelearning to walk through Echo-2’s distributed RL architecture. Dive in to learn about async RL with distributed infra, and how we are scaling this for businesses to win in the agentic era.

40

254

49

10

37K

Cyrax21_ retweeted

rw ./

@gradientintern

3 months ago

we almost in April, we’ve gotten new GLM’s, MiniMaxes, MiMo’s, GPT’s & several others so far in March. the whale has yet to make its new splash. pls DeepSeek v4 sir @deepseek_ai 🐳

gradientintern's tweet photo. we almost in April, we’ve gotten new GLM’s, MiniMaxes, MiMo’s, GPT’s & several others so far in March.

the whale has yet to make its new splash. pls DeepSeek v4 sir @deepseek_ai 🐳 https://t.co/0Fk1cQ3wRj

12

39

3

0

935

Cyrax./ @Cyrax21_

3 months ago

@gradientintern @contrx16 @deepseek_ai Deepseek will drop a SOTA model soon…. In v4 We Trust

0

4

0

228

Cyrax./ @Cyrax21_

3 months ago

@Chupaa_mw @mr_cbillionaire

0

2

0

38

Cyrax./ @Cyrax21_

3 months ago

@mr_cbillionaire

0

1

0

27

Cyrax./ @Cyrax21_

3 months ago

@Gradient_HQ

0

119

Cyrax21_ retweeted

Gradient @Gradient_HQ

3 months ago

Great to see multi-agent systems getting serious engineering attention. One thing we think about a lot: as agents get more capable, the orchestration layer matters just as much as the models themselves. Our work on Symphony explores what happens when you remove the central controller entirely and let agents coordinate across consumer hardware through decentralized task allocation and weighted voting. We've achieved up to 41.6% accuracy gains over centralized frameworks, running on commodity GPUs with <5% orchestration overhead. Find out more in our Symphony paper: https://t.co/XqwQm0wVNo

64

338

50

7

32K

Cyrax./ @Cyrax21_

3 months ago

@gradientintern Minimax dropping M2.7:

0

24

Cyrax./

@Cyrax21_

Last Seen Users on Sotwe

Trends for you

Most Popular Users