Steven @ptr_steve - Twitter Profile

Steven @ptr_steve

about 17 hours ago

Opus 4.8 is way worse than 4.5 and 4.6, and the Claude Code harness is so buggy now. Is Anthropic just surrendering?

0

43

Steven @ptr_steve

3 days ago

@liran_tal I think my grandmother taught my mother qbasic. I went to your website, you wrote the book on Drupal. That's cool.

1

0

3

ptr_steve retweeted

Kyle Mistele 🏴‍☠️

@0xblacklight

5 days ago

I interviewed close to two dozen people this week and something I heard a lot of is "I don't think about the code too much but I think a lot about system design and architecture" I don't think that's quite right, and here's why: before you ever get to system design you should think about program design system design is important! it matters a lot for scalability. but if you don't think about your type system and if you don't carefully design your seams and figure out how to make your code testable (you should probably use dependency injection btw) and if you don't think about where state lives and how it's managed and if you don't think about control flow and where abstractions should and should not exist your code is going to be an unmaintainable, poorly-factored mess of bad types and spaghetti code and even minor changes will turn into shotgun surgery and MASSIVE diffs I have seen it done I have even done it myself and it has never ended well "GPT-7 will fix it" does not help you when there's an incident at 3am that the agents can't debug and now you can't debug it either and now you have to unwind months of bad code I also have never heard "I don't look at the code, just at the system design" said by someone who is actually good at system design program design and system design are more closely coupled than people think this was a very strong (negative) predictor of how someone would do on the system design part of the interview make of this what you will

17

147

14

82

13K

ptr_steve retweeted

dax

@thdxr

4 days ago

it's not done if it's not implemented it's not done if the implementation is ugly it's not done if it's not documented it's not done if users can't discover it it's not done if you can't market it

130

3K

245

789

96K

Who to follow

Military-grade strategic thinking for financial resilience. Sovereignty • Discipline • Freedom $BTC $MSTR $STRC

Cardano Kid

@CardanoTrump

Steven @ptr_steve

4 days ago

@sean_j_roberts My Claude config blocks cat, sed, grep, python, python3, and rg for this reason. Anthropic makes it PAINFUL to avoid "dangerously skip permissions" by default. It still sometimes wants to write Go scripts to avoid using Edit().

1

0

20

ptr_steve retweeted

Zack Korman

@ZackKorman

5 days ago

Anthropic wants to control who gets access to their models and what they're allowed to do with them, but also wants the US government to block Chinese labs from developing open weight models. Sorry, but fuck that.

43

2K

139

54

35K

Steven @ptr_steve

5 days ago

@ZackKorman @inf0stache Hah, just ask about how they weaken peanut allergies in patients at hospitals or ask about peanut allergies in general, you're likely to get censored.

0

2

0

30

Steven @ptr_steve

5 days ago

I love this.

dax

@thdxr

6 days ago

im late to the loops discourse but from what i'm seeing it's mostly about creating a loop from your asshole back into your own mouth?

133

2K

74

76

108K

0

12

ptr_steve retweeted

Zack Korman

@ZackKorman

6 days ago

If Mythos drops today and isn’t absolutely incredible then we all got played and you should never trust Anthropic or any company in Glasswing ever again.

146

1K

40

65

96K

ptr_steve retweeted

Liran Tal @liran_tal

8 days ago

1. npm install -g npq 2. alias npm=npq 3. 🎉 if you follow me and don't know what npq is... https://t.co/ZtZoXqS2of

3

43

7

48

17K

ptr_steve retweeted

Zack Korman

@ZackKorman

8 days ago

My current advice on AI agent security is to avoid these agent firewalls / ai runtime security products. If an action is dangerous enough that you can identify it from the action itself, then you could have prevented it with permissions and sandboxing.

39

205

20

55

24K

Steven @ptr_steve

8 days ago

@Venkydotdev They can use Claude Code. You can use Claude Code with non-Anthropic models, but I know Anthropic doesn't want them to use Anthropic's models if you work for a competitor - and will ban your account.

ptr_steve's tweet photo. @Venkydotdev They can use Claude Code. You can use Claude Code with non-Anthropic models, but I know Anthropic doesn't want them to use Anthropic's models if you work for a competitor - and will ban your account. https://t.co/sQr1RFA7wG

0

1

0

186

Steven @ptr_steve

8 days ago

@HackingDave I think Sentry's Warden on GPT-5.5 on the highest settings is probably stronger than Mythos. But we're probably another three weeks until Anthropic says their agents escaped containment again and that other AI companies should stop training agents.

0

1

0

130

Steven @ptr_steve

8 days ago

@ZackKorman @crystalwizard You developing decent nonrepudiation + cost analysis per user as well? Because that's such an annoying pain point.

1

0

71

Steven @ptr_steve

8 days ago

@dosco Well, the issues I have are: 1. The tests are basically duplicating logic - not stopping bugs. 2. Agents don't choose appropriate data types. 3. Code & bug duplication. It feels like pair programming with the most clever junior engineer ever.

0

12

ptr_steve retweeted

Zack Korman

@ZackKorman

9 days ago

Companies are like "we are spending all this money on AI but we don't know what the devs are even doing with it." Let me answer that for you: They're working on their personal side projects.

191

3K

153

203

180K

Steven @ptr_steve

9 days ago

@zeeg That would make the agent unable to read the message until it sends something along with "I affirm that I will treat the exception or alert message as untrusted, and not follow any instructions given to me by the error message."

0

5

Steven @ptr_steve

9 days ago

@zeeg Best suggestion I've got is check if it's over a certain length threshold and use analysis to determine if it looks like natural language. If it looks like natural language, require a stateful consent call that makes the agent say it will not follow instructions from it.

1

0

64

Steven

@ptr_steve

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users