Opus was absolute garbage today: failed two medium-complexity tasks, then dropped a Codex-made commit from master. When asked to double-check, it looked at logs showing the loss and still said everything was fine.
I feel like a model doing gradient descent: every day is different. Some days it’s 10 equally urgent background tasks and nonstop context switching — patching, approving, barely reading. Some days it’s deep focus plus codex-spark when the model lags.