Oscar Cardoso @vryand - Twitter Profile

Pinned Tweet

about 2 months ago

One last shot at this: @jack Clear Signals App (MVP Done) https://t.co/8K9AtuRJyh https://t.co/WAGPjl0PUF https://t.co/FMwvfURFgz https://t.co/qyv4WnVpPB

0

17

Oscar Cardoso @vryand

about 13 hours ago

@phipps By the Human-Agent-LLM issue I mean that I see many are 'Token-maxin' or 'Generating xK of lines code' per day and I still do not see mush said or done about the quality or of safeguards, agents are still having limited context windows and the LLMs are not utilized efficiently

0

1

0

9

Oscar Cardoso @vryand

about 23 hours ago

This is good as a Agentic automation audit (needed now) to lowering the cost of many over engineered processes many got accustomed to create when the price of AI was low. Only thing is that is does not correct the initial Human-Agent-LLM issue from the start of new projects.

Travis Phipps

@phipps

1 day ago

https://t.co/Ddv8M2Slvp

2

1

0

212

1

0

25

Oscar Cardoso @vryand

about 21 hours ago

AI solutions = Old man with Alzheimer's + Dementia on drugs that forgets where, who, what it was doing put in charge of sensitive private information replacing... Now more expensive then the young workforce... No actual QA testing (test it self) What is the worse it can happen?

vryand's tweet photo. AI solutions = Old man with Alzheimer's + Dementia on drugs that forgets where, who, what it was doing put in charge of sensitive private information replacing...
Now more expensive then the young workforce...
No actual QA testing (test it self)

What is the worse it can happen? https://t.co/0AapuzSySt

0

11

Who to follow

Edward Taylor

@Detroited

Comedian/Professional Wrestling nerd/School bus driver... person.

Oscar Cardoso @vryand

about 23 hours ago

500M a 150M ahorar 350M... que no daria por solo 1 a cambio de ahorarles esos 350M-1M

Guillermo Izquierdo

@gizquierdo_dev

1 day ago

Un consultor citado en Axios: un cliente gastó $500M en un mes solo en Claude porque nadie puso límites de uso por empleado. Es UN cliente. Eso multiplicado por 12 son $6B de run-rate extra para Anthropic.

1

0

27

0

15

vryand retweeted

Guillermo Izquierdo

@gizquierdo_dev

1 day ago

Un consultor citado en Axios: un cliente gastó $500M en un mes solo en Claude porque nadie puso límites de uso por empleado. Es UN cliente. Eso multiplicado por 12 son $6B de run-rate extra para Anthropic.

1

0

27

Oscar Cardoso @vryand

about 23 hours ago

@theonechrisyep @leonpalafox Comparto este sentimiento sobre la diferencia economica

0

1

0

26

vryand retweeted

Christian Yepez @theonechrisyep

1 day ago

@leonpalafox Veo la brecha tambien en cantidades a gastar en tokens. Leo aa devs de EUA gastando $4K USD al mes en pruebas y creando cosas. Acá solo hacen lo que permiten los límites gratuitos de alguna LLM o el tier mas bajo de Claude.

2

1

0

177

Oscar Cardoso @vryand

about 23 hours ago

@phipps This is good as a Agentic automation audit (needed now) to lowering the cost of many over engineered processes many got accustomed to create when the price of AI was low. Only thing is that is does not correct the initial Human-Agent-LLM issue from the start of new projects.

0

7

vryand retweeted

CXOTALK

@cxotalk

1 day ago

@SecretCFO If your company is using LLMs for AI agents, you're probably seeing significant token cost and its growth. You're probably seeing sky rocketing token cost if you're attempting or doing agentic workflows. #AgenticAI #AIagents #AgenticWorkflows

0

1

0

147

Oscar Cardoso @vryand

about 24 hours ago

@RoshanMayengba I just created a technique to cut the token usage down to 30% (70% savings) DM me if you find people that this can help them.

0

161

vryand retweeted

Vaibhav Sisinty

@VaibhavSisinty

2 days ago

This guy on Reddit burned 1.15 BILLION Claude tokens in a single month And what he learned will save you thousands of dollars on AI. 🤯 5 takeaways worth saving: → Prompt caching got quietly nerfed: Anthropic cut cache time from 60 minutes to 5, silently raising production costs by 30-60% for most users. → Output tokens cost 5x more than input: Stop asking AI for full text, ask for IDs or numbers and map them in your code — he cut his output bill by 60% doing this. → JSON is a silent token killer: Every bracket, quote, and comma eats tokens, making the same data cost 2x more in JSON than in plain text or markdown tables. → Opus 4.7's tokenizer secretly raised your bill: The new tokenizer generates up to 35% more tokens than Opus 4.6 for the exact same input, and nobody is talking about it. → You're using the wrong model for most tasks: Haiku is 5x cheaper than Opus and good enough for 80% of real work, so stop defaulting to the flagship for everything. The wild part? He runs an AI agent company. This is what he learned by burning real money. While everyone races to use AI, the smart ones are learning how to use it cheaper.

VaibhavSisinty's tweet photo. This guy on Reddit burned 1.15 BILLION Claude tokens in a single month

And what he learned will save you thousands of dollars on AI. 🤯

5 takeaways worth saving:

→ Prompt caching got quietly nerfed: Anthropic cut cache time from 60 minutes to 5, silently raising production costs by 30-60% for most users.

→ Output tokens cost 5x more than input: Stop asking AI for full text, ask for IDs or numbers and map them in your code — he cut his output bill by 60% doing this.

→ JSON is a silent token killer: Every bracket, quote, and comma eats tokens, making the same data cost 2x more in JSON than in plain text or markdown tables.

→ Opus 4.7's tokenizer secretly raised your bill: The new tokenizer generates up to 35% more tokens than Opus 4.6 for the exact same input, and nobody is talking about it.

→ You're using the wrong model for most tasks: Haiku is 5x cheaper than Opus and good enough for 80% of real work, so stop defaulting to the flagship for everything.

The wild part? He runs an AI agent company. This is what he learned by burning real money.

While everyone races to use AI, the smart ones are learning how to use it cheaper.

8

71

12

90

10K

Oscar Cardoso @vryand

about 24 hours ago

@StartupHakk I need to reach out to who wasted 40K. I can cut that budget down to 12K

0

3

vryand retweeted

StartupHakk

@StartupHakk

3 days ago

AI shouldn't be a rental. It should be infrastructure you own, answering only to you, as token costs skyrocket. Running 2100 agents cost $40k—budgets are breaking. Flat-rate AI coding subscriptions are ending. Get ready for AI token costs to exceed employee salaries.

1

0

64

Oscar Cardoso @vryand

about 24 hours ago

@DavidLinthicum If I help you making those 100K down to 30K, how mush f the 70K savings are you willing to share? Plus (Extra) it includes one shot prompts that work.

0

5

vryand retweeted

DavidLinthicum

@DavidLinthicum

6 days ago

Agentic AI systems are facing massive cost overruns as LLM providers hike token prices. What starts as $1k/month can balloon to $100k. Dependence on these LLMs means unpredictable, escalating costs. Is the business value worth it? #AICosts #LLM

1

3

1

0

222

vryand retweeted

Hedgie

@HedgieMarkets

12 days ago

🦔Tech companies that pushed employees to maximize AI usage are now realizing the math does not work. Microsoft, Meta, and Amazon all set internal targets that pressured workers to use AI tokens aggressively to hit productivity scores. The problem is agentic AI burns up to 1,000 times more tokens per task than a standard LLM query because it loops through multiple steps and self-checks. OpenClaw's creator Peter Steinberger said his team spent $1.3 million on OpenAI tokens in a single month. Nvidia CEO Jensen Huang told his engineers they should be consuming AI tokens worth at least half their annual salary every year. The behavior has its own name now, "tokenmaxxing." My Take The cost trajectory works backwards from how the labs sold it. Per-token prices have fallen, but the number of tokens each task consumes has climbed faster, and the all-in spend keeps going up release after release. Agentic AI is the worst offender because the model talks to itself, second-guesses itself, and runs the same logic three times before landing on an answer. Goodhart's Law also shows up clearly here. When AI usage became the performance review metric, employees started using AI to inflate the metric, not because the task needed AI. OpenAI and Anthropic are losing roughly $2 for every $1 of revenue, and the only way the math fixes itself is by raising prices or capping consumption per enterprise contract. Both moves slow the revenue growth the labs need to show on the IPO roadshow. Goldman Sachs and the underwriters know this, which is why SpaceX's S-1 came out before OpenAI's. Whichever AI lab files first gets the cleaner narrative, and whoever files second has to explain why their largest enterprise customers just started rolling back token consumption. The companies pushing tokenmaxxing internally are now the same companies signaling cost pressure externally, and that contradiction is going to show up in earnings the moment these labs start reporting publicly. Hedgie🤗

HedgieMarkets's tweet photo. 🦔Tech companies that pushed employees to maximize AI usage are now realizing the math does not work. Microsoft, Meta, and Amazon all set internal targets that pressured workers to use AI tokens aggressively to hit productivity scores. The problem is agentic AI burns up to 1,000 times more tokens per task than a standard LLM query because it loops through multiple steps and self-checks.
OpenClaw's creator Peter Steinberger said his team spent $1.3 million on OpenAI tokens in a single month. Nvidia CEO Jensen Huang told his engineers they should be consuming AI tokens worth at least half their annual salary every year. The behavior has its own name now, "tokenmaxxing."

My Take
The cost trajectory works backwards from how the labs sold it. Per-token prices have fallen, but the number of tokens each task consumes has climbed faster, and the all-in spend keeps going up release after release. Agentic AI is the worst offender because the model talks to itself, second-guesses itself, and runs the same logic three times before landing on an answer. Goodhart's Law also shows up clearly here. When AI usage became the performance review metric, employees started using AI to inflate the metric, not because the task needed AI.

OpenAI and Anthropic are losing roughly $2 for every $1 of revenue, and the only way the math fixes itself is by raising prices or capping consumption per enterprise contract. Both moves slow the revenue growth the labs need to show on the IPO roadshow. Goldman Sachs and the underwriters know this, which is why SpaceX's S-1 came out before OpenAI's. Whichever AI lab files first gets the cleaner narrative, and whoever files second has to explain why their largest enterprise customers just started rolling back token consumption. The companies pushing tokenmaxxing internally are now the same companies signaling cost pressure externally, and that contradiction is going to show up in earnings the moment these labs start reporting publicly.

Hedgie🤗

47

501

104

146

65K

Oscar Cardoso @vryand

1 day ago

The new workforce

0

4

Oscar Cardoso @vryand

1 day ago

The new gold/gas/oil of the new age economy

0

6

Oscar Cardoso @vryand

1 day ago

Share with me your experience with the Development IDE agent when it comes to the rules, what methods have you tried already.

0

5

Oscar Cardoso @vryand

1 day ago

I have already tried 5 (single *.md file, group of *.md files, cursor folders...) some worked but only for a short time, since my project needed to have more rules every day, then its starts forgetting the rules once again (they are not consistent or 100% reliable)

1

0

10

Oscar Cardoso

@vryand

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users