Quant Cat @0xquantcat - Twitter Profile

@JonhernandezIA What if people learned to govern their token usage instead? Proper model selection for task delegation + planning = at least 40% cost saved Read more: https://t.co/Bc6xTtHtKJ

Quant Cat

@0xQuantCat

about 23 hours ago

https://t.co/tFQNh0FJN4

0

1

406

0

Quant Cat

@0xQuantCat

39 minutes ago

@matthewmillerai OpenAI just needs a useable mid-frontier that allows more than one weekly task

0

Quant Cat

@0xQuantCat

40 minutes ago

@tunguz I'm pretty sure this has been a thing since the opus family started, and not just on claude models

0

Quant Cat

@0xQuantCat

41 minutes ago

@TimJayas Your method + This below = GOLD https://t.co/Bc6xTtHtKJ

Quant Cat

@0xQuantCat

about 23 hours ago

https://t.co/tFQNh0FJN4

0

1

406

0

Quant Cat

@0xQuantCat

about 1 hour ago

@iruletheworldmo Big if true, but then again, this won't be possible without increasing cost/token. It's probably a 15/60 per mToken

0

25

Quant Cat

@0xQuantCat

about 1 hour ago

@rishi_raj_jain_ Vibe coders discovering do-while

0

1

Quant Cat

@0xQuantCat

about 2 hours ago

@stevyhacker Is it really that easy?

0

18

Quant Cat

@0xQuantCat

about 2 hours ago

@neural_avb AWS realising they need proper branch management and task delegation

0

1

0

15

Quant Cat

@0xQuantCat

about 2 hours ago

As per their docs, it's parsers record token usage while ingesting messages and usage events, so the database already knows the input, output, cache-creation, and cache-read tokens those agents have logged. I know this reporting is in early phase, but is this reliable or just guesswork based on the above listed logs? Wouldn't it be better to parse it from ~/.codex/sessions/* and log actual cost?

0

23

Quant Cat

@0xQuantCat

about 2 hours ago

@heswithme_eth Imagine calling Opus 4.8 slopus Valid crashout though, they should not bill you for Fable when Opus tokens are basically half the price

0

122

Quant Cat

@0xQuantCat

about 2 hours ago

@CastAsHuman Solid rules

0

1

0

18

Quant Cat

@0xQuantCat

about 4 hours ago

@banteg I wouldn't be mad, but then don't bill me fable prices on my opus

0

1

0

181

Quant Cat

@0xQuantCat

about 4 hours ago

@Dimillian A good, streamlined and useable tool >>> a demigod model

0

47

Quant Cat

@0xQuantCat

about 4 hours ago

@reach_vb It’s that simple. Don’t gave to provide a demigod model, just provide a good model people can actually use

0

1

0

178

Quant Cat

@0xQuantCat

about 4 hours ago

@GergelyOrosz I don’t think they aim for more premium users going forward. We’ve been constantly throttled on top premium plans these past months already. It costs less and pays more to provide the government and top companies with premium models than the masses.

0

54

Quant Cat

@0xQuantCat

about 4 hours ago

Thank you for this! Memory and persistence is the single biggest source of chaos in the space right now. After harnesses and loops, the next focus should be on a governance layer that sits on top of existing runtimes and tools, managing memory efficiently. We won’t be able to “move fast and break things” for any longer because of these absurd costs and usage limits. We don’t need another runtime or UI. Just a deterministic working-set selection so that persistent memory actually stays reliable, compact and cheap!

0

7

Quant Cat

@0xQuantCat

Last Seen Users on Sotwe

Trends for you

Most Popular Users