anybody else notice claude (1m context) is VERY eager to 'pick this up in the next session' when ctx > 180k? I can only assume an agent hardcoded a number to sidestep a broken test at some point.
pair programming with claude is garage beers with your buddy. except you're the only one drinking and he never gets tired. tonight we're reviving a dead 48v golf cart battery.
ok real talk, claude code opus 4.6 is garbage now. routinely answers without actually reading files. can't interpret images. reasoning is max. @bcherny@AnthropicAI what is going on? defaulting to codex xhigh with much better results
ok real talk, claude code opus 4.6 is garbage now. routinely answers without actually reading files. can't interpret images. reasoning is max. @bcherny@AnthropicAI what is going on? defaulting to codex xhigh with much better results
the current fear is is that AI homogenizes culture and turns humans into passive consumers
one counterpoint: in Go, human play showed very little improvement from 1950 to 2016 until alphago beat lee sedol - then human decision quality jumped. players started developing moves that were distinct both from previous human moves and from the novel moves introduced by machine intelligence
this seems more likely to me - fun times ahead
trying to avoid confirmation bias, but came here to complain about opus being very dumb lately (yes i always use max reasoning) and turns out everyone else feels the same way. was having claude do codex consults on almost every turn. then crazy idea -- what if I just used codex? turns out, codex is pretty awesome.
friday fun: `diamond-replay`
turn gamechanger baseball streams into analysis. tui for viewing. can also output json for things like generating youth specific game-health dashboards (e.g. https://t.co/USk3VQpglt)
https://t.co/kW7TpN6VV3
missing from gstack: /autobuild
an alias for: /loop 30m if we have nothing to do, build the next chunk. if we're stuck, run a /codex consult. when chunks complete run /simplify and /codex review. commit. repeat.
mkdn 0.1.7 (+0.1.6).
- git-aware sidebar shows your current branch and dirty files now.
- footnotes click-to-scroll between reference and citation.
- new table layout engine.
- breadcrumb bar (cmd-j to jump to a section).
also fixed a link cursor bug that's been there since launch. turns out TextKit 2 can't do coordinate math.
`brew upgrade mkdn`
@TheAhmadOsman ive been optimizing 2B on transcription correction. there is so much knowledge embedded in the weights. pal-4 < 1gb, 65tok/s on m2, it is really incredible.