I've spent quite some time researching and experimenting with #ChatGPT to write a thorough post. It's done at last!
https://t.co/U1g1LvCrOO
I would love your feedback! Tell me what's missing or needs to be improved 🙏
Meet Gemma 4!
Purpose-built for advanced reasoning and agentic workflows on the hardware you own, and released under an Apache 2.0 license.
We listened to invaluable community feedback in developing these models. Here is what makes Gemma 4 our most capable open models yet: 👇
On DeepWiki and increasing malleability of software.
This starts as partially a post on appreciation to DeepWiki, which I routinely find very useful and I think more people would find useful to know about. I went through a few iterations of use:
Their first feature was that it auto-builds wiki pages for github repos (e.g. nanochat here) with quick Q&A:
https://t.co/DQHXagUwK0
Just swap "github" to "deepwiki" in the URL for any repo and you can instantly Q&A against it. For example, yesterday I was curious about "how does torchao implement fp8 training?". I find that in *many* cases, library docs can be spotty and outdated and bad, but directly asking questions to the code via DeepWiki works very well. The code is the source of truth and LLMs are increasingly able to understand it.
But then I realized that in many cases it's even a lot more powerful not being the direct (human) consumer of this information/functionality, but giving your agent access to DeepWiki via MCP. So e.g. yesterday I faced some annoyances with using torchao library for fp8 training and I had the suspicion that the whole thing really shouldn't be that complicated (wait shouldn't this be a Function like Linear except with a few extra casts and 3 calls to torch._scaled_mm?) so I tried:
"Use DeepWiki MCP and Github CLI to look at how torchao implements fp8 training. Is it possible to 'rip out' the functionality? Implement nanochat/fp8.py that has identical API but is fully self-contained"
Claude went off for 5 minutes and came back with 150 lines of clean code that worked out of the box, with tests proving equivalent results, which allowed me to delete torchao as repo dependency, and for some reason I still don't fully understand (I think it has to do with internals of torch compile) - this simple version runs 3% faster. The agent also found a lot of tiny implementation details that actually do matter, that I may have naively missed otherwise and that would have been very hard for maintainers to keep docs about. Tricks around numerics, dtypes, autocast, meta device, torch compile interactions so I learned a lot from the process too. So this is now the default fp8 training implementation for nanochat
https://t.co/3i5cv6grWm
Anyway TLDR I find this combo of DeepWiki MCP + GitHub CLI is quite powerful to "rip out" any specific functionality from any github repo and target it for the very specific use case that you have in mind, and it actually kind of works now in some cases. Maybe you don't download, configure and take dependency on a giant monolithic library, maybe you point your agent at it and rip out the exact part you need. Maybe this informs how we write software more generally to actively encourage this workflow - e.g. building more "bacterial code", code that is less tangled, more self-contained, more dependency-free, more stateless, much easier to rip out from the repo (https://t.co/iKJUoHiIpl)
There's obvious downsides and risks to this, but it is fundamentally a new option that was not possible or economical before (it would have cost too much time) but now with agents, it is. Software might become a lot more fluid and malleable. "Libraries are over, LLMs are the new compiler" :). And does your project really need its 100MB of dependencies?
Introducing GLM-OCR: SOTA performance, optimized for complex document understanding.
With only 0.9B parameters, GLM-OCR delivers state-of-the-art results across major document understanding benchmarks, including formula recognition, table recognition, and information extraction.
Weights: https://t.co/vqIBgBCXYi
Try it: https://t.co/Ld7H8Pasls
API: https://t.co/xVLNG0XSfP
Weekend thoughts on Gas Town, Beads, slop AI browsers, and AI-generated PRs flooding overwhelmed maintainers. I don't think we're ready for our new powers we're wielding. https://t.co/J9UeF8Zfyr
We brought the Ralph Wiggum loop to goose with a multi-model approach where one model does the work and a second model reviews it.
Tutorial 👇
https://t.co/dtL5U8QD7O
in OpenCode v1.1.18 we implemented the detailed planning flow you find in cursor or claude code
but not 100% sure about it yet so we put it behind a OPENCODE_EXPERIMENTAL_PLAN_MODE=1 feature flag
quick video on how it works but if you have feedback please put it on this thread
One useful way to think about agents: they’re control systems. Generating output is easy. Feedback is everything.
At Ramp we built a background coding agent, Inspect, that can actually translate requests in English into code, and then observe reality: tests, telemetry, and feature flags — plus visual checks for UI work (screenshots/live previews). It doesn’t just propose diffs; it iterates until the evidence says the change is correct.
Two consequences surprised me:
1. Cheap, parallel sessions change behavior. When an agent runs in a real sandboxed dev environment (not your laptop), you stop babysitting and start running more iterations.
2. Multi-client + multiplayer matters more than people think. If it shows up in the places work already happens (PRs, Slack, web, VS Code) and you can hand a session to a teammate, it becomes shared infrastructure, not a novelty.
We’re now at ~30% of merged PRs in our core repos authored by Inspect, without mandating it. People from essentially every job function, not just engineering, submitted code last week. Wild times.
Aaaaand ralph-tui is live - thanks for your patience
https://t.co/Q90AQlAYk6
It's been a fun day using ralph-tui to build ralph-tui.
All the details in the repo but:
- Install w/ your fave package mgr eg 'bun install -g ralph-tui'
- First time setup 'ralph-tui init'
- Create a PRD and tasks 'ralph-tui prime'
After that you'll be dropped into the TUI to start the ralph loop.
Tons of tweakability for those that care.
On that note, I'm out for the night 🤘
h/t @GeoffreyHuntley 🤠
@ryancarson@danshipper@kieranklaassen@clairevo@mattpocockuk@gregisenberg