3xHarry @TrippelHarry - Twitter Profile

about 1 month ago

The AI cost crisis is real. The diagnosis isn't. Microsoft just told ~100,000 engineers to drop Claude Code by June 30. Officially: "standardizing on Copilot CLI." Unofficially (per @tomwarren 's leaked memo + Fortune): the bills got brutal. Uber burned $3.4B — its 2026 AI budget — in four months. Per-engineer Claude Code spend: $500–$2,000/month. The frame everyone's using: AI is too expensive. The frame that's true: most teams don't know how to use it. Where most of your tokens are actually burning. You've watched this. The first 15 minutes of a session are sharp. By minute 40 the agent starts looping — tries the same broken approach three times, forgets the spec you handed it at minute 1. By hour 3 you're paying it to confidently invent the wrong fix and ship it as a regression you'll spend tomorrow undoing. That long-session drift is the cost line item. It's also the regression line item. Hallucination compounds with context decay compounds with steering loss — and you keep paying for tokens that introduce more work than they finish. The benchmark math is brutal. APEX-Agents tests legal, consulting, and analyst tasks that take humans 1–2 hours. Frontier models that score 90%+ on standard coding benchmarks complete those real-work tasks 24% of the time. After 8 retries: 40%. The diagnosis was consistent across every model: agents got lost after too many steps, looped back to approaches that had already failed, lost track of what they were supposed to be doing. The steering instructions from step 1 got buried under hundreds of intermediate tool results. That's not a model problem. Three receipts that prove a tighter harness wins. Vercel stripped 80% of their text-to-SQL agent's tools. Accuracy went 80% → 100%. Tokens dropped 40%. 3.5× faster. Cursor drove tool-call errors down 10× by tuning the harness to the model's actual training format (patch for OpenAI, string-replace for Anthropic). Each retry that doesn't happen is a token you don't pay for. Manus rebuilt their agent framework five times in six months. The wins came from removing features. Average task: 50 tool calls — long enough that the steering prompt gets evicted before the agent reaches it again. And the same Opus 4.5 scores 45.9% / 50.2% / 55.4% on the same SweepBench Pro tasks depending on whether it's running on a minimal scaffold, Cursor's harness, or Claude Code's. Same model, different wrapper. You're paying Opus prices and getting whichever number your scaffolding earned. @AnthropicAI's own engineering blog confirms the shape: same model + complicated harness was 20× more expensive — but the output quality jump was immediately apparent. Same model. Different scaffolding. The right question in 2026. Stop asking which model is best. Ask which harness around a specific model is the best one. Opus and Sonnet aren't always the answer. Vibe-coding a 50-tool MCP loop around a frontier model and watching it drift for four hours isn't a model problem. It's discipline. This is the lane I'm building in. I'm shipping an IDE that authors agent harnesses — Planner / Generator / Evaluator loop, bounded context, a handful of skills instead of 50 MCP tools, verifier loops that catch "made up" before it ships to prod. If your bills are exploding, the model isn't the issue. It's what's wrapped around it. Early access: https://t.co/CZj3G4oS5U source: https://t.co/7N7VIJr7zH

0

43

3xHarry

@TrippelHarry

about 1 month ago

Started building https://t.co/9prFC6I5UG as "agency for agentic workflows." Green field, all that. Then I tried deploying vanilla agents and hit the wall every Claude Code user knows. They hallucinate. They build skills that drift from your workflow. Give them real tools and they take actions you didn't ask for. The bigger the toolbox, the worse the damage. The Agent isn't the problem — OpenClaw / Hermes both nail the loop. The Agent is not a Harness Engineer — it doesn't understand how to develop a reliable harness. So I'm building an IDE for that. Spec → skills → tools → deploy, with the platform showing you what should be a tool vs. a skill, where approval gates go, which context lives where. Right components, right places, right agent. Built on top of @steipete's OpenClaw. Early. But already hosting client agents on it. -- Also: unfollow anyone telling you their OpenClaw / Hermes agent is making money on its own, running their entire business, or "is their CEO" 😂. That's literally a psyop to burn as many tokens as possible by big tech. #stoptheslop

TrippelHarry's tweet photo. Started building https://t.co/9prFC6I5UG as "agency for agentic workflows." Green field, all that.

Then I tried deploying vanilla agents and hit the wall every Claude Code user knows. They hallucinate. They build skills that drift from your workflow. Give them real tools and they take actions you didn't ask for. The bigger the toolbox, the worse the damage.

The Agent isn't the problem — OpenClaw / Hermes both nail the loop. The Agent is not a Harness Engineer — it doesn't understand how to develop a reliable harness.

So I'm building an IDE for that. Spec → skills → tools → deploy, with the platform showing you what should be a tool vs. a skill, where approval gates go, which context lives where. Right components, right places, right agent.
Built on top of @steipete's OpenClaw.

Early. But already hosting client agents on it.

-- Also: unfollow anyone telling you their OpenClaw / Hermes agent is making money on its own, running their entire business, or "is their CEO" 😂. That's literally a psyop to burn as many tokens as possible by big tech.

#stoptheslop

0

69

3xHarry

@TrippelHarry

about 1 month ago

Recently unemployed. Decided building https://t.co/MII258KqUm instead of looking for a job. Many nights I get anxious wondering how to provide for my family. But i know in my heart, God who's carried me this far isn't going to stop now. https://t.co/vJDejjcW30

1

0

30

3xHarry

@TrippelHarry

about 1 month ago

@Rothmus made a map of where they actually landed. a few counties are absolutely cooked https://t.co/sYyEc2klEx

0

1

0

366

Who to follow

Catalyst

@TheCatalystOG

games, internet culture & future nostalgia

Magdal3na

@MagdaI3na

My friends call me Maggy | I help startups build structures that scale | venture capital, finance, and operations in high-tech environments

0xSimao

@0xSimao

Founding Researcher @blackthornxyz | #2 @sherlockdefi 2025 | 30 Top-3 & 70+ Private Audits | Founder The Contest Academy | DM for audits https://t.co/V6VPeRhRWg

3xHarry

@TrippelHarry

4 months ago

https://t.co/L2hikqNnzL

0

53

3xHarry

@TrippelHarry

7 months ago

@BitcoinArchive @camelfinance

0

12

3xHarry

@TrippelHarry

9 months ago

@infraa_ @camelfinance

0

2

0

41

3xHarry

@TrippelHarry

10 months ago

Christ is King 👑 https://t.co/XJZypOtHu7

0

70

3xHarry

@TrippelHarry

12 months ago

@CryptoHayes @camelfinance Crack pipe Monday topic seem to become reality!

2

1

0

879

TrippelHarry retweeted

rasmr

@rasmr_eth

about 1 year ago

I asked to use CRYPTOCURRENCY for a mortgage…

83

400

36

28

30K

3xHarry

@TrippelHarry

about 1 year ago

@SmokeyTheBera @DefiIgnas Rev should be the only relevant revenue stream for L1 token holders. In fact they are buybacks as staking usually compounds all rewards into the L1 token! DeFi demand for an L1 token also creates a lot of buying pressure but only relevant if the ecosystem’s thriving.

0

1

0

161

TrippelHarry retweeted

DEGEN NEWS

@DegenerateNews

over 1 year ago

NEW: @Gemini LISTS SOLANA MEMECOINS $GOAT (@gospelofgoatse), $PNUT (@pnutsolana), $MEW (@MewsWorld), AND $BOME (@Darkfarms1)

84

1K

114

35

106K

3xHarry

@TrippelHarry

almost 2 years ago

@RadarHits @camelfinance

0

6

0

68

TrippelHarry retweeted

13yr old with a credit card

@13yroldwithcc

almost 2 years ago

MANY MEN

15

2K

169

79

225K

TrippelHarry retweeted

Mario Nawfal

@MarioNawfal

almost 2 years ago

Again, Main Stream Media is the worst... Over 20 minutes after Trump was shot at, CNN's headline: "Trump Speech interrupted by Secret Service"

MarioNawfal's tweet photo. Again, Main Stream Media is the worst...

Over 20 minutes after Trump was shot at, CNN's headline:

"Trump Speech interrupted by Secret Service"

345

4K

831

278

655K

TrippelHarry retweeted

Jakey

@SolJakey

about 2 years ago

2024.

297

4K

430

104

545K

TrippelHarry retweeted

Kaizen

@KAIZ3NS

about 2 years ago

If the ETH ETF doesn’t get approved this is the biggest betrayal since:

56

927

93

56

260K

3xHarry

@TrippelHarry

about 2 years ago

LOVE when @solana is completely down when i try to repay my loan and only works after i got liquidated! @toly already start implementing an auctioning system for MEV or a priority FEE! Beglilion TPS but my TX takes more than 30 min to go through! Worst UX ever!

0

1

0

137

3xHarry

@TrippelHarry

about 2 years ago

@ethena @MakerDAO @MorphoLabs Ptsd

0

40

3xHarry

@TrippelHarry

about 2 years ago

@MaxBecauseBTC @base Oh boss you are right in time for the party! MR $FINK already set his eyes on the base chain and is ready to pump the RWA narrative $ElonRWA

0

46

3xHarry

@TrippelHarry

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users