New TIL: I figured out how to use my LLM CLI tool in a shebang line, which means you can write executable scripts in English, or hook up more complex scripts with a snippet of YAML template
Everyone can use the same phone - broke or billionaire. Buffett's "democratization of luxury"
Is this ending for AI with Mythos class models?
Or is this just a bump before intelligence becomes a utility (while we figure out how to do it safely?)
I understand and agree with Anthropic's choice to not release Mythos publicly (yet) but it makes me feel very vulnerable and I don't like it.
I really don't like one company deciding who gets intelligence and who doesn't.
Again I get it but it doesn't feel good as an entrepreneur.
if you’re freaking out about Mythos, remember:
Never make any major life decisions within 30 days of a meditation retreat, psychedelic trip, or first encounter with a frontier AI model.
Announcing OpenCode School!
A free, self-paced course that teaches you how to use OpenCode, the open-source AI coding agent.
No account required. No personal data collected. Free forever.
https://t.co/I6O2k1bXCn
Chat, my nanochat (left) with its onboard wasm-interpreter is now clearly exceeding @karpathy’s nanochat (right) on a range of computation tasks. The wasm interpreter plus cross attention only adds about 300 million params, a marginal increase in params for a big boost!
You could call it tool use but it’s a single transformer that can both predict the next token and is a functioning wasm machine, there is no external tool.
I haven't dove into the details yet .. .but @brendanh0gan is the real deal. He has been pumping out deep work for ages. I first found him when he shared a perfect for learning GRPO/RL repository after the "deepseek" moment.
Follow him / check out his work
introducing: Loophole - an agentic system that translates your natural language moral beliefs into codified laws, and then runs adversarial agents that try to come up with legal scenarios that break your laws - either a scenario that is immoral and legal, or vice versa - a judge agent fixes the law if it can do so consistently, but if there is an inconsistency you as the user must decide what is best.
you can work with the system until your legal framework can't be broken by the agents - and you get as output a legal system that is aligned with your moral code
more details and code below
Great stat in here: Claude Code went from 17% to 92% on our eval set once it had access to LangSmith traces and Skills. A coding agent without trace data is just guessing at fixes
Announcing Codex.
A new product from OpenAI that moves beyond coding, into cooking. We were already cooking before, but now *you* can cook too ... with Codex. It is powered by the same technology as our other Codex products. You can just cook things.
Software dev has already changed a lot since the beginning of the year
And seems like both Anthropic and OpenAI will have much better models by the end of the year.
Software has always been very inefficient to make. And now it will be not perfectly efficient but orders of magnitude more so.
Is there a new kind of Efficient Market Hypothesis for the software industry?
Ie if you only have public information that everyone else also has, you probably shouldn’t use it to trade stocks or build a startup on. There’s little alpha there, and roaming apex predators with more GPUs than you.
I find enterprise interesting because it’s a non-public slog. Long procurement processes, no public docs, bespoke fractals of internal processes and jargon and messy human context that isn’t public or legible to labs or others yet.
Previously, context (eg a well written internal Google doc) was cheap relative to the cost of building software. Now it’s flipped, weirdly
If building software becomes more efficient, where are the “private context slogs” worth making?
Ie curating non-public context that (when combined with public agents) unlocks new value to businesses
Has anyone been thinking about the Efficient Software Hypothesis, and its implications?
If you are not yet following @Prince_Canuma do it now!
He is the man behind many of the engines powering local AI on your Apple Silicon, leveraging Apple MLX framework. 🚀