Clay

Verified account

@clay_phi

@_GameFrame_ | ex @AzraGames | @WolvesDAO | Front End Quant | Architect

Houston, TX

Joined April 2020

156 Following

272 Followers

484 Posts

Pinned Tweet

12 days ago

Everyone's talking about harnesses. None of them solve the core problem. I built one 9 months ago that does. Artifacts > Vibes. Progress only on Proof of Work LLM inference is unreliable, inconsistent, and degrades the longer it runs. You cannot fix that with with a loop that doesn't address the underlying problem. You fix it by orchestrating from the kernel of work, the task. Atomic tasks. Intra-task QA w correction cycles. Audit by default. Trust nothing. Verify everything. I call it the PABLOV method. Programmatic Artifact-Based LLM Output Verification. Claude code, Codex, Cursor, Temporal, n8n, Aider, Devin, Lovable, Replit, OpenHands, Kiro, etc. all fall over on long-haul lifts. I built a programmatic orchestrator last year on the PABLOV method that runs at high accuracy for 100+ hours straight.

clay_phi's tweet photo. Everyone's talking about harnesses. None of them solve the core problem. I built one 9 months ago that does.

Artifacts > Vibes. Progress only on Proof of Work

LLM inference is unreliable, inconsistent, and degrades the longer it runs. You cannot fix that with with a loop that doesn't address the underlying problem. You fix it by orchestrating from the kernel of work, the task. Atomic tasks. Intra-task QA w correction cycles. Audit by default. Trust nothing. Verify everything.

I call it the PABLOV method. Programmatic Artifact-Based LLM Output Verification.

Claude code, Codex, Cursor, Temporal, n8n, Aider, Devin, Lovable, Replit, OpenHands, Kiro, etc. all fall over on long-haul lifts. I built a programmatic orchestrator last year on the PABLOV method that runs at high accuracy for 100+ hours straight.

1

2

0

0

294

13 minutes ago

@GJarrosson 100%. Age of experts is over

0

0

0

0

4

14 minutes ago

@neerajjj6785 5. PM

0

0

0

0

4

16 minutes ago

@CoachDanGo Is this one of those tell me you’ve never been a founder without telling me posts?

0

0

0

0

53

Who to follow

Verified account

Build Ai You Own

Verified account

Building my empire online — Instagram/YouTube: eeelistar — Podcast @ThisIsNewWave_

Business Analyst Proud member of @WolvesDAO and @GameTheoryWeb3

18 minutes ago

@benhylak 1. Correct 2. Wrong Tech has only ever gotten better and cheaper

0

0

0

0

7

19 minutes ago

@Taniyatweets_ Workarounds for Claude -p

0

0

0

0

8

22 minutes ago

So many talking farms posting about LLM providers profitability, bubbles, and yada yada. Its all b8 Uber founded 2009 Uber >$1B rev 2014 Uber profitable 2023

0

0

0

0

6

28 minutes ago

@esrtweet That’s not how any of this works.

0

0

0

0

24

34 minutes ago

@theo Like pricing Netflix subscription against movie tickets. You got $1,000,000 worth the movies for $19.99. Nonsensical

0

0

0

0

67

36 minutes ago

@federicodonaton Engine can’t move you without wheels or steering

0

0

0

0

3

about 24 hours ago

Your harness is your Agent OS. and, it matters more than the OS your machine runs on. The lock-in on your Agent OS will be far harder to break than any OS lock-in before it. The cause is the ability and freedom to customize. Every prompt you tune, tool you wire, workflow you encode raises the cost or leaving. Every tweak welds you in tighter. Unlike the OS era where the whole world ran on three options, there will be thousands of harnesses tuned to teams, roles, individuals. New market. Wide open. Most are arguing about which model is more capable. The opportunity is one layer up.

1

0

0

0

22

38 minutes ago

@YashHustle_22 Are you checking every line of code written by human

0

0

0

0

2

about 2 hours ago

@DanKulkov Lemme tell ya about subscription rates

0

0

0

0

55

about 2 hours ago

@pmarca Def changes what makes a good software engineer and what they spend their time doing. Most trad software devs are not good in areas required to orchestrate unlimited agent instances

0

0

0

0

734

about 11 hours ago

@ThePrimeagen Compute the multiple for doing nothing vs something

0

0

0

0

4

about 12 hours ago

@PeterDiamandis @BillAckman This pod needs to ban the words Dyson and swarm

0

1

0

0

513

about 13 hours ago

@barrettjoneill I see you’re enjoying having Claude recreate the 999,999,999,999th re financial model

0

1

0

0

143

about 13 hours ago

@growing_daniel

0

0

0

0

145

1 day ago

@nicbstme Even with evap, where does the water go? Not into the….water cycle…..perhaps

0

0

0

0

380

1 day ago

@LeakerApple I remember 9 months ago trying to tell some PC bois that their GPUs cannot compete with Apple’s unified memory for local inference. Engrained

0

1

0

0

948

Last Seen Users on Sotwe

Trends for you

Most Popular Users