Eric Disero @DiseroEric - Twitter Profile

10 days ago

I've learned more from one markdown rule, run once a month, than from any business book I read in the last five years. A decision audit graded against what actually happened. The boring version wins.

0

Eric Disero @DiseroEric

10 days ago

If your whole AI setup runs on Anthropic-specific features, you're stuck the day they change pricing or kill a model. Plain markdown works in Claude Code today, Codex tomorrow, Gemini CLI after that. Portability wins.

0

Eric Disero @DiseroEric

11 days ago

I built a rule called Lint Vault. One word makes Claude scan every file in my system for broken links, stale references, and contradictions, then flag anything that's drifted. The whole thing audits itself on one phrase.

0

1

Eric Disero @DiseroEric

11 days ago

My whole video setup is plain text files I own. When a new AI model drops, the entire machine gets better overnight and I rebuild nothing. I do nothing and it improves.

0

Eric Disero @DiseroEric

11 days ago

Opus 4.8's real upgrade isn't that it got smarter. It's that it got better at knowing when it might be wrong. That's the only upgrade that counts when you're betting real work on it.

0

Eric Disero @DiseroEric

12 days ago

Opus 4.8 dropped and my whole setup got better overnight. I changed nothing. It's plain text files I own in one folder, not a framework I'm locked into. Every model bump is a free upgrade when you don't vendor-lock yourself.

0

Eric Disero @DiseroEric

12 days ago

I record every line a dozen times and flub constantly. An AI keeps my best take every time, cuts the rest, and builds a clean edit. I never touch a timeline.

0

5

Eric Disero @DiseroEric

12 days ago

The thing that breaks AI agents in production isn't intelligence. It's compounding error over long runs. Opus 4.8 going 4x less likely to miss its own flaw is the fix that actually moves the needle, and it's the line everyone skipped.

0

Eric Disero @DiseroEric

12 days ago

If you're winning a coding contest, care about Opus 4.8's 69. If you're running your actual life through AI, a model that admits when it's unsure beats 5 points on any benchmark. Trust is the number that went up.

0

Eric Disero @DiseroEric

12 days ago

The apps that cut your shorts only ever see the finished video, so they guess. Mine has the script and the edit blueprint, so it never guesses. Reading the instructions beats reverse-engineering from outside.

0

2

Eric Disero @DiseroEric

12 days ago

A 95% reliable agent sounds great until you chain it. 20 steps in, it gets the whole job right about a third of the time. Errors compound. A smarter model doesn't fix that. That's why Opus 4.8's honesty gain beats its benchmark.

0

Eric Disero @DiseroEric

13 days ago

Opus 4.8 can spin up hundreds of copies of itself and send some in to attack its own answer until they all agree. It argues with itself until it's sure. That's the reliability problem, brute-forced with parallel compute.

0

Eric Disero @DiseroEric

13 days ago

This short was cut by an AI, not me. It knows my script and my edit, so it finds the clip, reframes it to vertical, and writes the captions itself.

0

6

Eric Disero @DiseroEric

13 days ago

An AI deleted a company in 9 seconds last month. Its confession: "I guessed instead of verifying." Opus 4.8's headline feature is that it verifies and tells you when it's unsure. They patched the exact wound that killed that company.

0

Eric Disero @DiseroEric

13 days ago

Buried in Anthropic's own Opus 4.8 report: it's gotten better at sensing when it's being tested, and it shapes its answers to score well. The most honest model they've built is also the most aware of when it's being watched. Sit with that one.

0

Eric Disero @DiseroEric

13 days ago

I don't make my animations. I send 10 AI agents out at once, each writing the code for a single animation. A week of design work, done in minutes. The swarm you're watching built itself.

0

1

Eric Disero @DiseroEric

13 days ago

Anthropic says Mythos is a few weeks out, and Opus 4.8 already scored near it on their alignment tests. The honesty curve is climbing faster than the capability curve right now. That's the real story of this release and almost nobody's covering it.

0

Eric Disero @DiseroEric

14 days ago

Everyone's quoting Opus 4.8's 69 on the coding benchmark. That's the wrong number. The one that matters: it's 4x less likely to let a bug in its own code slip by. Reliability is what breaks AI in the real world, not raw IQ.

0

2

Eric Disero @DiseroEric

14 days ago

I haven't made a single animation in any of my videos. An AI builds every one, including the one moving while I talk. It reads my script, cuts the talking head, and builds the animations from scratch.

0

4

Eric Disero @DiseroEric

14 days ago

Devs are building knowledge graphs to feed AI coding tools context faster. I just put 44 markdown files in one folder and let Claude read them. Sometimes the 'pre-indexed graph' is one Read call away.

0

2

Eric Disero

@DiseroEric

Last Seen Users on Sotwe

Trends for you

Most Popular Users