tl;dr It’s the Age of Research
Something is off about models:
1) Training is so sample inefficient
2) Very long thinking trajectories: models will do the right thing only after exhausting all other possibilities
3) Generalization is whack. Waymo can’t handle construction on highway. No teenager would have this problem.
Unclear why December was an inflection point for coding agents. No single clear attributable change.
Google is still pre-December on coding capabilities.
5-10x speed up in work. 3 weeks to implement paper in ye olden days. 2 days with codex + can do many things in parallel.
Humans see, hear, talk, everything all at once. Of course that’s how it should be for models.
Big update on how quickly we got to an intern level coding agent. Didn’t expect that in 2025.
Westmag is building American robot actuators and drone motors at scale.
In 2025, @westmagco raised $11M led by @a16z, with participation from @FoundersFund, @LuxCapital, NFDG, @MenloVentures, and other top investors.
Since then, we’ve been building industrial capacity, crawling up supply chains, and securing high-volume customers.
Now, we’re ramping production at our factory in South San Francisco to deliver against committed offtake orders from high-volume customers.
Westmag is committed to scaling quickly in the US to deliver millions of drone motors and robot actuators to the surging domestic and global market.
We’re building the great American motor and actuator company.
Hundreds of supersonic jets have been developed by dozens of companies. Most took 5-10 years. Only a select few flew supersonic in less than 2 years. Quarterhorse Mk 2.1 is now among them. So proud of this team.
It only gets harder from here, and yet I know we can go faster.
An OpenAI model has achieved a major breakthrough in mathematics, by disproving a central conjecture in discrete geometry that was first posed by Paul Erdős in 1946.
This is the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
Rust is underrated for agentic engineering:
- Agents remove much of the learning curve barrier
- Types + Compiler catch many bugs agents (and humans) commonly write
- AI is good at Rust and you get high perf systems
Move fast with stable infrastructure.
@lucaronin I am having some issues. Typing into the AI chat bar inserts random characters, seemingly from the previous message the AI sent, and my cursor doesn't stay in the same place.
Secondly, I notice that it can't render inline LaTeX. This would be enormously helpful for my workflows.
@VictorTaelin I have no idea how you're getting these results. Ever since Sonnet 2.5 for all my tests Anthropic has been so far in the lead I've never bothered with GPT.