Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.
Last September I argued that Claude Code was becoming less of a coding assistant and more of an OS for agentic workflows.
I also wrote: “Cursor could do the same if they shipped an SDK.”
Cursor shipped theirs on Apr 29.
Great to see this direction becoming real.
read Mario Zechner's "slowing the fuck down" post. mostly skill issue dressed as wisdom, partly harness issue, partly model capability. not buying the doom narrative.
Bought a @system76 Lemur Pro for the Linux experience and the "right to repair" promise. Tab key broke. Their solution: $175 keyboard assembly + $143 shipping to Europe. For one key. No individual parts, no EU option. The repair-friendly laptop I can’t repair.
LiteLLM HAS BEEN COMPROMISED, DO NOT UPDATE. We just discovered that LiteLLM pypi release 1.82.8. It has been compromised, it contains litellm_init.pth with base64 encoded instructions to send all the credentials it can find to remote server + self-replicate. link below
@gkossakowski lol, maybe, but read this: "The agent could reason over the diff, call tools, and decide where..." - I dont think any human at @cursor_ai would try to make a point that agents with tools are better than agents without tools... not in 2026 :)
Unsupervised eval for coding agents: https://t.co/Vmb8Ty7dmA Agents playing treasure hunt! Challenger agents hide bugs, reviewer agents try to find them, and an LLM matcher scores assignments. No human labeling.
Unsupervised eval for coding agents: https://t.co/Vmb8Ty7dmA Agents playing treasure hunt! Challenger agents hide bugs, reviewer agents try to find them, and an LLM matcher scores assignments. No human labeling.