Author of HEC-Commander Tools and RAS-Commander library. Engineer. Modeler. AI enthusiast. See my open source repos for HEC-RAS modeling and automation work
@bcherny@theo@benhylak Bro y’all have to do better, remote control has been broken for nearly 2 weeks.
I don’t give a f about new features, fix the ones you’ve already added
@turbo_xo_@MatthewBerman Codex is slow and a bit regarded if you aren’t specific enough, but it does pick up on stickiest problems and figures them out. Anthropic’s models are much better at task management and actually doing things.
@thsottiaux@daniel_mac8 Awesome, I’ve been waiting for nearly 6 months for most of the recent features. Recently reactivated the more expensive subscription but still lots of work to do….
@johnnydamacha@runaway_vol@chadeyecom I don’t know why people glaze 5.4, it’s terribly slow and just as brittle as previous iterations. I want to like it, but I wonder how people make these statements as fact. It’s like they don’t actually use the model for anything important.
@steipete@Voxyz_ai My first instruction was to never download skills from the internet. I think people rightly assume it’s a huge malware vector, and don’t report. Honestly you should just move to a codex review gate to improve trust.
@unclebobmartin It sounds like you really need to look at Hooks! They help immensely with exactly the type of behavior you are describing. You can force the model to loop/self-verify until complete. It’s not perfect, but it helps a lot with context drift.
@AaronHenray@thsottiaux@0thernet IME Claude still reliably completes tasks with minimal steering. Especially long running tasks that rely on features within the CC harness. Codex is more powerful but requires more steering. Codes is a more powerful model with a more chaotic jagged edge and a weaker harness.
@AaronHenray@thsottiaux@0thernet IME Claude still reliably completes tasks with minimal steering. Especially long running tasks that rely on features within the CC harness. Codex is more powerful but requires more steering. Codes is a more powerful model with a more chaotic jagged edge and a weaker harness.
@thsottiaux@markchen90@0thernet I don’t know what you guys are working on, but I use both Codex and CC regularly and I have never had this experience. Just feels like Coke vs Pepsi debates. Everyone convinced they are superior because of their subjective preference.
@Austen@joelgrus You’ve got other settings you’ve already changed, like whitelisted tool calls. Your tweet doesn’t make sense, try again on a fresh machine.