I asked @coderabbitai 's VP of AI, David Loker: "How to build an agent that actually works."
What he said is confirmed by research.
https://t.co/L4XZgiI6c8
@jxnlco Writing. Gpt-5x voice is not good. Don't get me wrong anything is better than 4o. However opus/claude will follow a style guide and stylistically much better.
Its also the octopus in Chinese room with a lot of vocabulary of self and metaphilosophy....it doesn't "know" what it is saying and has no memory aside from what you gave it.
You and I have some common ground, Kevin: Claude is neither human nor conscious.
But as for Claude having a theory of self, I assert that is true if and only if one applies a particularly sparse meaning for self: Claude has no ability to understand; it possesses no agency, goals, or motivation; it has no sense as to the borders of its self.
Claude may have some degree of self-reflection in the sense that it can feed its dialog back to itself - sort of a simple Hofstadter strangle loop - but this also is more of a small ripple in the fabric if a true self.
In the end, I think the problem is that self is a suitcase word: it is full of historical and emotional baggage and thus is dangerous to apply all to this piece of software..
Remember these predictions?
“Software engineering will be dead because of AI” —> we’re seeing more demand for sw engineering (good part thanks to AI)
“SaaS will be dead because of AI” —> SaaS businesses growing massively (in part thanks to AI)
Be careful what you believe
@RogerAlsing Same kinda thing. If their training set happened to be trained on your problem then genius. If not then dumbish. So moments where you're like "better than gpt or claude." Its no GLM but for its size and speed super impressive.
@peteralexbizjak@JamesWard@ahmetb (but you missed that the principal objective was to call my former colleague, who is about the same age as me, old...)
@danveloper I've had a different experience. One provider or the next doesn't understand the others tool call format, various models require different tool call ids or formats, structures, different rules for reasoning blocks, etc.
@peteralexbizjak@JamesWard@ahmetb Decades of experience with it. Running around the world tuning the GC. I have a longer rant about the time and place has passed (multi-platform compilation is no longer primitive). The larger goals have/are been/being eclipsed by better tech.
@JamesWard@ahmetb Okay but Java and the JVM eww yucky. Its now as old as COBOL was when Java came out (roughly). Lets use Rust and be done with the GC voodoo. I'm too young to write Java.
@Grady_Booch@claudeai You're asking the wrong guy. The model doesnt control what is in its context. Its like putting toast in the toaster and telling it not to make toast when you push the thing down. you have to prevent the cli from loading the context...
Comon @OpenAI fess up you got /r/copypasta in your training data https://t.co/KCDUY2YcbA and thats where they came from. The other creatures are also copypasta obsessions...
@Grady_Booch@notRichardRen@hendrycks ...and if your paper is on the well-being of a stateless algorithm.... you are definitely part of the technical marketing team...
@Grady_Booch@notRichardRen@hendrycks Writing an article about this, but I've begun to suspect everyone involved in "AI risks" or "safety" is actually part of the technical marketing team, even if they work at a university... Give a random string generator shell access, and it will eventually format your drive.
So Grammarly (which, weirdly, now uses @Superhuman as its handle) flagged some of what I wrote as likely AI-generated. Is that a compliment or an insult? I'm not entirely sure.
Ready to put a verdict down on GPT 5.5. It is definitely better than 5.4 and way more agentic. They reduced its constant need for "if you want I can" ChatGPT engagement prompts. It still isn't very good at writing. It is a great researcher but a bad writer.