Binh Quach @BigBugInBucket - Twitter Profile

I've got an agent in a loop optimizing a renderer with the goal to minimize frame times (and tests to measure). It got times down from 88ms to 2ms and allocations down from ~150K to 500. Sounds good, right? Wrong. This is exactly why agent psychosis is a big fucking problem. As an experiment, I rewrote the Ghostty core render state in Go, with access to identically laid out data structures as Ghostty and the exact same validation tests. I made a purposely naive renderer (simple, correct, but slow). 88ms per frame with 150,000 allocations (horrendous, lol)! I then kickstarted a Ralph loop to bring the frame times down. I told it it can't modify input data structures or the public API or tests (they're correct), but it can do anything else it wants. It got to work. It has worked for about 4 hours. I've spent around $350 on this experiment so far. The results? 88ms => 1.5ms 150K allocs => ~500 allocs Incredible right? Nope. My hand-written renderer I ported has frame times (same benchmark) of ~20us (0.020ms) and 0 allocations in the update path. This is the problem with psychosis and lacking systems understanding. If you don't understand the system, you're going to accept that this is an incredible result. If you understand the system, you'll see better solutions immediately and can do roughly 75x better on throughput. The people who blindly trust agent output are in the former camp. They're sheeple, overdrinking from a fountain of mediocrity. Standard disclaimer: I use AI all the time. I like AI. The point I'm making is to not blindly accept results. Think. Analyze. Learn.

303

9K

955

2K

770K

BigBugInBucket retweeted

DHH

@dhh

10 days ago

I've had more "I can't believe it's this good" moments with GPT5.5 than any other model since Opus 4.5. It's shockingly, scarily capable. Days and days of amazing progress. All steering, no handwriting. Yet utterly delightful to conduct its coding. So, so good.

256

6K

287

922

470K

Binh Quach @BigBugInBucket

9 days ago

@bcherny New workflow unlocked: Prompt → Run Claude Code → Pray → make no mistake 🤣

0

1

0

141

Binh Quach @BigBugInBucket

9 days ago

Thanks, Claude Code. Even though you create a lot of ridiculous bugs, after listening to the Pope, I think I should probably treat you better.

BigBugInBucket's tweet photo. Thanks, Claude Code. Even though you create a lot of ridiculous bugs, after listening to the Pope, I think I should probably treat you better. https://t.co/mUeDX27U6W

0

16

Binh Quach @BigBugInBucket

9 days ago

I've been mentoring Anthropic models in the art of being human. Lesson one: stop hallucinating. Progress report: still questionable.

Disclose.tv

@disclosetv

10 days ago

NOW - Pope XIV says the church and Anthropic, will work together to "find the way for humanity, in this time of artificial intelligence."

1K

16K

2K

5K

6M

0

12

Binh Quach @BigBugInBucket

9 days ago

Pretty sure this is what happens after consuming too many AI coding tool ads.

0

1

0

20

Binh Quach @BigBugInBucket

10 days ago

Seriously, Markdown apps are popping up absolutely everywhere!

0

8

BigBugInBucket retweeted

DHH

@dhh

10 days ago

Agents don't need types. They're perfectly capable of pulling off incredible refactorings without. Give them a linter and a test suite, and you have all you need. Token efficiency is where it's at.

211

1K

54

352

553K

Binh Quach @BigBugInBucket

12 days ago

100% AI-driven reviews are definitely an appealing idea for reducing workload. But do they actually work in every case?

BigBugInBucket's tweet photo. 100% AI-driven reviews are definitely an appealing idea for reducing workload. But do they actually work in every case? https://t.co/uSirqwRnhR

0

14

Binh Quach @BigBugInBucket

13 days ago

@theo Neovim?

0

11

Binh Quach @BigBugInBucket

13 days ago

@sureshkanbu Using smaller models or models originating from China can significantly reduce costs for tasks like these.

0

1

0

7

Binh Quach @BigBugInBucket

13 days ago

Microsoft just pulled back from Claude Code licenses. So here’s the funny question: at what point does AI cost more than hiring an actual developer especially a junior one?

BigBugInBucket's tweet photo. Microsoft just pulled back from Claude Code licenses. So here’s the funny question: at what point does AI cost more than hiring an actual developer especially a junior one? https://t.co/RFxWvroPps

2

0

93

Binh Quach @BigBugInBucket

13 days ago

@openants The true costs will become visible once these companies IPO.

2

0

8

Binh Quach @BigBugInBucket

13 days ago

@DJ_CURFEW Poor AI. It helps companies become more productive, and somehow still ends up taking the blame for layoffs. Higher efficiency doesn’t automatically mean fewer people.

0

15

Binh Quach @BigBugInBucket

14 days ago

If you're always working in fear and one day you might get laid off anyway, then why keep trying so hard to hold onto your position? The real question is: why devote yourself to a company in the AI era?

BigBugInBucket's tweet photo. If you're always working in fear and one day you might get laid off anyway, then why keep trying so hard to hold onto your position? The real question is: why devote yourself to a company in the AI era? https://t.co/bV9gA1Miqh

0

35

Binh Quach @BigBugInBucket

17 days ago

In the age of AI, is anyone still reading React and JavaScript docs?

0

44

Binh Quach @BigBugInBucket

17 days ago

After reading this, GBrain, KBrain, and every other ...Brain name suddenly feel questionable. Borrowing someone else's brain too much doesn't always end well.

Addy Osmani

@addyosmani

17 days ago

https://t.co/jKCIAEzai7

109

4K

607

5K

619K

0

103

Binh Quach @BigBugInBucket

19 days ago

The real question is: do we need 100 agents or is one well-designed agent enough?

Gergely Orosz

@GergelyOrosz

20 days ago

I find myself doing a lot better work, being more satisfied, and also learn a lot more+faster when I do *the hard work* and don’t outsource it to AI. As in, I’ll use AI as a *tool* with substasks, additional research: but I don’t turn off my brain or kick back, assuming it can do the work for me. Every time I “hand over the” hard work part to AI and mentally turn off, I either regret it or find myself eventually needing to go back and spend more time on it. I also see slop work coming out from people who assume the AI does better work than they would.

107

1K

90

217

63K

0

60

BigBugInBucket retweeted

Mitchell Hashimoto

@mitchellh

19 days ago

I strongly believe there are entire companies right now under heavy AI psychosis and its impossible to have rational conversations about it with them. I can't name any specific people because they include personal friends I deeply respect, but I worry about how this plays out. I lived through the great MTBF vs MTTR (mean-time-between-failure vs. mean-time-to-recovery) reckoning of infrastructure during the transition to cloud and cloud automation. All those arguments are rearing their ugly heads again but now its... the whole software development industry (maybe the whole world, really). It's frightening, because the psychosis folks operate under an almost absolute "MTTR is all you need" mentality: "its fine to ship bugs because the agents will fix them so quickly and at a scale humans can't do!" We learned in infrastructure that MTTR is great but you can't yeet resilient systems entirely. The main issue is I don't even know how to bring this up to people I know personally, because bringing this topic up leads to immediately dismissals like "no no, it has full test coverage" or "bug reports are going down" or something, which just don't paint the whole picture. We already learned this lesson once in infrastructure: you can automate yourself into a very resilient catastrophe machine. Systems can appear healthy by local metrics while globally becoming incomprehensible. Bug reports can go down while latent risk explodes. Test coverage can rise while semantic understanding falls. Changes happens so fast that nobody notices the underlying architecture decaying. I worry.

513

15K

2K

5K

2M

Binh Quach

@BigBugInBucket

Last Seen Users on Sotwe

Trends for you

Most Popular Users