Florian Leibert 🎢 @FLO - Twitter Profile

Pinned Tweet

Florian Leibert 🎢

@flo

about 2 months ago

Serving config for Kimi 2.6 on 8x MI300X with DFlash speculative decoding (AMD). https://t.co/rBLiqULJJC

3

7

1

930

Florian Leibert 🎢

@flo

8 days ago

flex factor -- your flex relative to your net worth. Important in Germany were things are measured, standardized so people can understand and generational wealth is the predominant form of wealth.

5

6

0

237

Florian Leibert 🎢

@flo

8 days ago

Who is the best AI dev evangelist

0

3

0

183

Florian Leibert 🎢

@flo

10 days ago

Berlin is seedy.

0

1

0

252

Who to follow

Tobi Knaup

@superguenter

Co-founder of @mesosphere/@D2iQ (acq. @nutanix) · Early @airbnb engineer · Angel investor in cloud & AI startups · Sharing what I learned building at scale.

brian wickman

@wickman

superdupercomputers @openai

Solomon Hykes

@solomonstre

Making @dagger_io. Before that: founded Docker. "No is temporary, yes is forever".

Florian Leibert 🎢

@flo

13 days ago

@thenowhereway Sup

0

49

Florian Leibert 🎢

@flo

13 days ago

@elaifresh I’m down to clown have a cult idea I’d love brainstorming on. I’d be a part time as chief cult officer.

0

2

0

780

Florian Leibert 🎢

@flo

13 days ago

@growing_daniel @bryan_johnson @_katetolo Imagine the money @bryan_johnson is about to make with a female product they’re going to launch which they clearly are :)

0

2

0

120

Florian Leibert 🎢

@flo

13 days ago

@growing_daniel @bryan_johnson @_katetolo Her fine print after she reads the prenup ___

1

8

0

3K

Florian Leibert 🎢

@flo

13 days ago

@bryan_johnson @_katetolo Love it also since you’re using her medical history for your fame and future I hope you’re giving ger her fair share

0

4K

flo retweeted

Marc Andreessen 🇺��

@pmarca

14 days ago

Made it ADA compliant.

72

2K

33

88

298K

Florian Leibert 🎢

@flo

15 days ago

Besides having terrible weather, austin has the worst internet in modern america

2

4

0

270

Florian Leibert 🎢

@flo

18 days ago

60 minutes @60Mins is the last bastion of the free press / journalism

0

1

0

181

flo retweeted

Poetiq

@poetiq_ai

22 days ago

Poetiq's Meta-System built its own coding harness from scratch. It got SOTA on LiveCodeBench Pro. No fine-tuning, no special model access. Just standard APIs. Using Gemini 3.1 Pro, it made a harness that beat all frontier models we tested.

poetiq_ai's tweet photo. Poetiq's Meta-System built its own coding harness from scratch. It got SOTA on LiveCodeBench Pro.

No fine-tuning, no special model access. Just standard APIs. Using Gemini 3.1 Pro, it made a harness that beat all frontier models we tested. https://t.co/v575oUYJeH

43

546

56

237

2M

Florian Leibert 🎢

@flo

28 days ago

@KyleHessling1 Thank you for calling out the “highly experimental” part. Had it halfway deployed to do fuel estimates on my nuclear power plant. I’ll use llama 3 then…

1

3

0

368

Florian Leibert 🎢

@flo

28 days ago

@JosephJacks_ AI charts are a wonderful thing. Love the crossover callout :)

1

57

0

9K

flo retweeted

Mateusz Mirkowski

@llmdevguy

29 days ago

🇨🇳After testing Chinese models over the last few weeks, my coding ranking currently looks like this: 1. Kimi K2.6 2. GLM-5.1 3. MiMo V2.5 Pro 4. MiniMax 2.7 5. DeepSeek V4 Pro 👉But each of them has its own superpowers. Frontend/Design: K2.6 Backend: K2.6 / GLM-5.1 Code review: MiMo All-rounder: M2.7 Reasoning: DeepSeek Now I'm waiting for MiniMax 3.0, which I hope will take the number 1 spot!

156

2K

198

2K

169K

flo retweeted

Sully

@SullyOmarr

about 1 month ago

99% chance this is fake In the 1% chance this is real short every semiconductor

33

84

1

14

19K

flo retweeted

Bindu Reddy

@bindureddy

about 1 month ago

SubQ , a new type of AI model, says they are 50x faster and 20x cheaper than Opus 4.7 and GPT 5.5 In fact, they also say they perform INSANELY WELL on benchmarks and have a 12M context This would be earth shattering, if true - Anthropic/OpenAI's valuation would go to zero 😱

bindureddy's tweet photo. SubQ , a new type of AI model, says they are 50x faster and 20x cheaper than Opus 4.7 and GPT 5.5

In fact, they also say they perform INSANELY WELL on benchmarks and have a 12M context

This would be earth shattering, if true - Anthropic/OpenAI's valuation would go to zero 😱 https://t.co/NJq9e8VPap

107

735

60

262

65K

flo retweeted

Milk Road AI

@MilkRoadAI

about 1 month ago

This is one of the craziest AI launches of 2026 and it came out of basically nowhere (Save this). A company called Subquadratic just shipped SubQ, and the benchmarks are almost hard to believe. To understand why this is such a big deal, you have to understand the fundamental problem that has defined AI for the last decade. Every large language model in existence is built on transformer architecture, and transformers use a mechanism called standard attention that checks every single word in a sequence against every other word. Double the context length and compute doesn't double, it quadruples, triple it and compute goes up nine times. This quadratic scaling is why frontier models have been stuck at roughly 1 million tokens, why running them at those lengths gets expensive fast, and why the AI labs have essentially been printing money charging you more the longer you need the model to think. The industry has known this problem existed since 2017 but they scaled it anyway. SubQ is built from the ground up to solve it. Instead of processing every possible token relationship, SubQ's sparse attention architecture identifies which relationships actually matter and ignores the rest meaning compute is used where it counts and wasted nowhere else. The result is that compute scales linearly with context length instead of exponentially, and the implications of that one architectural shift are enormous. At 12 million tokens, SubQ reduces attention compute by nearly 1,000x compared to standard frontier models and at 1 million tokens, it runs 52x faster than FlashAttention. And it does all of this while posting frontier level accuracy, scoring 95% on the RULER 128K long-context benchmark versus Claude Opus 4.6's 94.8%, and an 81.8 on SWE-Bench Verified coding tasks, besting Opus 4.6 (80.8) and DeepSeek 4.0 Pro. The cost comparison is where it gets genuinely insane. SubQ runs at under $1.50 per million tokens less than 5% of what Claude Opus charges. On the RULER benchmark, running the test with SubQ cost $8, running the same test with Claude Opus cost $2,600 and that's a 300x cost reduction at equivalent or better accuracy.. Subquadratic launched with $29 million in funding, SubQ is available today for early access via API, and SubQ Code, a coding agent built on the architecture ships alongside it. The transformer has been the unchallenged foundation of every major AI system since 2017. SubQ is the first serious evidence that something structurally better might have just arrived.

44

885

110

1K

277K

Florian Leibert 🎢

@flo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users