The Token Company (YC W26) @thetokenco - Twitter Profile

🦔Microsoft canceled its internal Claude Code licenses this week after token-based billing made the cost untenable, even for a company with effectively infinite cloud resources. Uber's CTO sent an internal memo warning the company burned through its entire 2026 AI budget in just four months. American AI software prices have jumped 20% to 37%, and GitHub (owned by Microsoft) is dropping flat-rate plans for usage-based billing across its products. My Take The AI subsidy era is ending in real time. The same company that put $13 billion into OpenAI and built the Azure infrastructure powering most of Anthropic's compute just looked at the bill from a competitor's coding tool and decided it was not worth paying. That is not a productivity failure on Anthropic's end. Token-based pricing is forcing every enterprise customer to confront the actual cost of running these models at scale, and the number turns out to be far higher than the flat-rate experiments suggested. This ties directly to my Gemini Flash post yesterday. Anthropic, OpenAI, and Google all raised effective prices in the last six months. Enterprises that built workflows assuming AI costs would keep falling are now watching annual budgets evaporate in months. Two outcomes look likely from here. Either enterprises scale back AI usage to fit budgets, which slows the revenue ramp the labs need to justify their valuations ahead of IPOs, or the labs cut prices and absorb the losses, which makes the unit economics worse at exactly the wrong moment. Both paths land in the same place, the numbers stop working, and somebody has to take the writedown. Hedgie🤗

HedgieMarkets's tweet photo. 🦔Microsoft canceled its internal Claude Code licenses this week after token-based billing made the cost untenable, even for a company with effectively infinite cloud resources. Uber's CTO sent an internal memo warning the company burned through its entire 2026 AI budget in just four months. American AI software prices have jumped 20% to 37%, and GitHub (owned by Microsoft) is dropping flat-rate plans for usage-based billing across its products.

My Take
The AI subsidy era is ending in real time. The same company that put $13 billion into OpenAI and built the Azure infrastructure powering most of Anthropic's compute just looked at the bill from a competitor's coding tool and decided it was not worth paying. That is not a productivity failure on Anthropic's end. Token-based pricing is forcing every enterprise customer to confront the actual cost of running these models at scale, and the number turns out to be far higher than the flat-rate experiments suggested.

This ties directly to my Gemini Flash post yesterday. Anthropic, OpenAI, and Google all raised effective prices in the last six months. Enterprises that built workflows assuming AI costs would keep falling are now watching annual budgets evaporate in months. Two outcomes look likely from here. Either enterprises scale back AI usage to fit budgets, which slows the revenue ramp the labs need to justify their valuations ahead of IPOs, or the labs cut prices and absorb the losses, which makes the unit economics worse at exactly the wrong moment. Both paths land in the same place, the numbers stop working, and somebody has to take the writedown.

Hedgie🤗

1K

20K

4K

12K

8M

1

4

1

0

410

The Token Company (YC W26) @thetokenco

23 days ago

Save on your LLM bill with @opencode and @thetokenco

ricky

@drf0k

23 days ago

pushed an update to the @opencode plugin using the @thetokenco compression model for reducing input tokens for every prompt. i've been meaning to ship an update every since plugins got updated! > added compression status to sidebar widget > added opencode command for easier adjustments i set it, forget it, and use it every single day

1

2

0

402

0

3

1

0

287

thetokenco retweeted

The Token Company (YC W26) @thetokenco

23 days ago

The Token Company is now HIPAA compliant! Customers using The Token Company's models to compress LLM inputs can now securely process protected health information. Our compression models reduce bloated context from LLM inputs making LLM models perform better and cheaper under long inputs

thetokenco's tweet photo. The Token Company is now HIPAA compliant!

Customers using The Token Company's models to compress LLM inputs can now securely process protected health information.

Our compression models reduce bloated context from LLM inputs making LLM models perform better and cheaper under long inputs

1

5

3

0

317

The Token Company (YC W26) @thetokenco

23 days ago

The Token Company is now HIPAA compliant! Customers using The Token Company's models to compress LLM inputs can now securely process protected health information. Our compression models reduce bloated context from LLM inputs making LLM models perform better and cheaper under long inputs

1

5

3

0

317

thetokenco retweeted

Rasmus Uusipaikka

@rasmus_up

29 days ago

Claude Code making really pretty animations

0

5

1

179

The Token Company (YC W26) @thetokenco

3 months ago

X right now

1

4

0

518

The Token Company (YC W26) @thetokenco

3 months ago

Try out the before and after at https://t.co/sYto9yMbRy

0

2

0

278

The Token Company (YC W26) @thetokenco

3 months ago

Evaluating compressed prompts just got easier. You can now compare LLM outputs before and after compression in our Compression sandbox to directly evaluate your use case.

1

5

0

1

490

The Token Company (YC W26) @thetokenco

3 months ago

"Grok make the lobster wear a cape with our logo on it" Grok:

Harj Taggar

@harjtaggar

3 months ago

Does anyone know what’s going on with the lobster on Wall Street lol?

257

2K

139

180

2M

0

4

0

742

thetokenco retweeted

ricky

@drf0k

3 months ago

i built a plugin that saves you hundreds of dollars in @opencode by using the @thetokenco compression model, it shrinks the amount of tokens in your input query before hitting the models the queries are faster and cheaper, all while maintaining output quality

1

7

1

0

922

The Token Company (YC W26) @thetokenco

3 months ago

Link to the article: https://t.co/nHidF57DjC

0

186

The Token Company (YC W26) @thetokenco

3 months ago

Compression absolutely crushed SEC filings Does anyone actually read these? I don't know, but we let our bear-1.2 LLM compression model do that on FinanceBench (150 real SEC filing questions) Bro got up to 84.7% accuracy (vs 82% baseline) with 20% token reduction. Saves costs, boosts speed, crushes financial analysis Removing bloat makes the model perform better

thetokenco's tweet photo. Compression absolutely crushed SEC filings

Does anyone actually read these? I don't know, but we let our bear-1.2 LLM compression model do that on FinanceBench (150 real SEC filing questions)

Bro got up to 84.7% accuracy (vs 82% baseline) with 20% token reduction. Saves costs, boosts speed, crushes financial analysis

Removing bloat makes the model perform better

2

5

1

429

The Token Company (YC W26) @thetokenco

3 months ago

@ycombinator @OtsoVeistera Check out our API and benchmarks at https://t.co/wi1e7hQPzT

0

1

0

480

thetokenco retweeted

Y Combinator

@ycombinator

3 months ago

The Token Company (@thetokenco) builds LLM input optimization to lower costs, reduce latency, and improve accuracy. Congrats on the launch, @OtsoVeistera! https://t.co/aDsoAvIjT1

12

233

20

115

27K

thetokenco retweeted

Dan

@aidaniil

3 months ago

One of my batchmates is starting his fundraise earlier because he needs to take his highschool exams right before demo day I've never been aura farmed on so much in my life

2

56

1

4

4K

thetokenco retweeted