Claude Code creator:
"Loops are as big a step as move from source code to agents. Loops - step from agents to the next thing.
30% of my code is fully written by loops right now."
in a 40-minute fireside chat, Boris Cherny reveals his actual working setup.
Loops + dynamic workflows + routines, and more.
Watch the full episode, then read the article on loop engineering below.
Creator of Claude Code:
"At Anthropic, almost 100% of our engineers are running 100+ agents with self-improving loops
self-improving loops help agents become better with each run."
in a 1-hour podcast, Boris explains how they build agents loops from sratch.
Claude + loops + routines + dynamic workflows - that’s the secret.
Watch the talk, then read how to apply the same playbook to quant trading below.
Australians have older looking skin.
In one week, the Australian sun aged my skin by 5%.
> 15% stronger UV; fair skin can burn in under 15 mins
> 2 in 3 aussies skin cancer before age 70
> 2-3x increase in melanoma risk vs USA
A study involving 1472 Caucasian and Asian women found that signs of skin aging appeared 10–20 years earlier in Australian women than in US women from the same study. Australians self-reported higher rates of change and significantly more severe facial lines and volume-related features like tear troughs and naso-labial folds than women from the other countries.
The sun in Australia is very intense.
The Australian sun increased my skin aging by 5% (via UV damage and spots).
Even though I:
> I used my umbrella when in sun
> protected my skin during peak UV
> and still my skin UV damage increased by 5%
Australians get more intense UV radiation. For example, during summer the earth's orbit puts the Southern Hemisphere closer to the sun, making the sun stronger than in most of the U.S.
Even outside of Southern Hemisphere summer, Australia is almost a "perfect storm" for UV exposure:
> thinner ozone overhead makes it stronger
> cleaner air allows UV to penetrate deeper
> high solar elevation angles due to its relatively low latitude, meaning sunlight reaches the surface more directly and passes through less atmosphere
This is particularly true in Queensland, where proximity to the equator further amplifies UV intensity. Most people in Queensland live roughly at 17–28° from the equator, compared with ~34° for Los Angeles, allowing the Sun to reach significantly higher elevations in the sky and resulting in more intense UV radiation at ground level.
More than the increased cancer risk, up to 90% of visible facial aging is from UV.
The sun is great. You want the right amount. Not too much and not too little.
GLM-5.2 is the most impressive LLM I've ever seen *aside from Fable 5*
The fact that this level of intelligence is now open-sourced is pure insanity.
Just 3 months ago, this would've been considered the most powerful coding model in the world, and if Fable 5 didn't launch, it would be #1 right now.
For 99% of people, this model can handle ALL of your daily tasks, and you never need to worry about getting the plug pulled on your workflows (like Fable 5).
Bullish on the future of actually owning your intelligence, and this is a massive leap forward.
Anthropic research lead:
"99% of our engineers are running swarms of 300+ self-improving agents.
close the agent loop. Give the model a way to verify its own output"
in a 20-minute session, Anthropic team member explains how to build a model that improves itself.
Claude + loops + plan mode + dynamic workflows -that’s the secret.
Watch the talk, then save the playbook below.
We're introducing GLM-5.2, our latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers that capability on a solid 1M-token context. GLM-5.2's new capabilities include:
Solid 1M Context: A solid 1M-token context that stably sustains long-horizon work
Advanced Coding with Flexible Effort: Stronger coding capabilities with multiple thinking effort levels to balance performance and latency
Improved Architecture: We propose IndexShare, which reuses the same indexer across every four sparse attention layers, reducing per-token FLOPs by 2.9× at a 1M context length. We also improve GLM-5.2’s MTP layer for speculative decoding, increasing the acceptance length by up to 20%
Pure Open: An MIT open-source license — no regional limits, technical access without borders
Supporting long-horizon tasks starts with making long context engineering-usable: the model must maintain quality across long, messy coding-agent trajectories, not just accept more tokens. A 1M context is easy to claim, but much harder to keep reliable under real engineering pressure. To this end, we substantially expanded 1M-context training for coding-agent scenarios, covering large-scale implementation, automated research, performance optimization, and complex debugging. The result is a long-context system that is not only wide in scope, but solid in execution: a practical substrate for sustained engineering work.
This capability is reflected in GLM-5.2's performance on three long-horizon coding benchmarks. FrontierSWE measures whether an agent can complete open-ended technical projects at the scale of hours to tens of hours, spanning systems optimization, large-scale code construction, and applied ML research. On this benchmark, GLM-5.2 trails Opus 4.8 by only 1%, while edging out GPT-5.5 by 1% and Opus 4.7 by 11%. On PostTrainBench, where each agent is given an H100 GPU and evaluated by how much it can improve small models through post-training, GLM-5.2 outperforms both Opus 4.7 and GPT-5.5, ranking second only to Opus 4.8. On SWE-Marathon, an ultra-long-horizon software engineering benchmark covering tasks such as building compilers, optimizing kernels, and developing production-grade services, GLM-5.2 still has room to grow, trailing Opus 4.8 by 13% while remaining second only to the Opus series. Across all three benchmarks, GLM-5.2 is the highest-ranked open-source model, showing that its 1M context has translated into practical long-horizon delivery capability.
I was wrong
I've been saying for months that open source AI models are 6 months behind frontier
They caught up. GLM 5.2 is as good as Opus 4.8
This changes everything. If you run GLM 5.2 locally no government can take it away. You become sovereign
And even if you run through APIs, its a fraction of the cost
The battlefield is different now. If open source is as good as frontier, and people have cheaper alternatives, governments can't be as quick to regulate. It will destroy the frontier AI labs
All of this is such a massive win for the people
If you are not paying attention to local models yet, you are making a tremendous mistake
Exciting news: GLM-5.2 (Max) ranks #2 in Code Arena: Frontend, with +29pt over Claude Opus 4.7 (Thinking) and only behind Fable 5! GLM-5.2 is the best open model vs Kimi-K2.6 and Minimax-M3 by a large margin.
- #2 React and #4 HTML sub-leaderboards
- Ranks as the top model in nearly all sub categories: Brand & Marketing, Reference-Based Design, Data & Analytics, Consumer Product, Gaming, and Simulations.
Congrats @Zai_org for the incredible milestone!
NEW: Inside the 24-hrs before WH slapped export controls on Anthropic
- Last Thursday, Amazon CEO Andy Jassy raised concerns about Fable jailbreak to Trump admin
- Friday AM, Sean Cairncross, Bessent, Susie etc. held WH call to discuss
- Then White House started reaching out to Anthropic to speak with Dario Amodei, who was at a wellness retreat.
- When Amodei was finally available past 1pm, he had three tense phone calls with a combo of ppl including Cairncross, Bessent, Lutnick, Kessler, Will Scharf, Richard Walters, and Walker Barrett.
-Amodei tried to clear up what he assumed was a misunderstanding. He defended the guardrails and distinguished between universal and non-universal jailbreak
- Cairncross and Bessent were unmoved and asked Amodei to take down Fable and work with the admin to fix the vulnerabilities. (A WH official said Amazon’s findings were run past the NSA and they felt they had “proof.”)
- Amodei asked for more time and info, but he made no commitments to pull the model
- Bessent told Amodei directly at one point that he was making a “bad decision”
- By Friday evening, the Trump admin imposed its export controls.
- “Export controls were a last resort after begging them for hours to work with us,” senior WH official said.
W/ @cheyennehaslett
https://t.co/0Rwny9md3p
WE ARE SOOOOO BACK!
Someone leaked the Claude Fable 5 system prompt and ran it on an Opus 4.8
Output is like 90% of the real thing
Turns out half the magic was never the weights. It was the prompt the whole time
Repo down below:
All that we see is constant criticism and demotion of Anthropic at the expense of Open AI. Truly shows the level of involvement the US Gov has in Open AI, they are trying to provide them with an opportunity to catch up. If there was a jailbreak then provide the details of this jailbreak, if the frontier lab can’t set its own guard rails without interference then it’s no good. Does the US Gov think they are better than Anthropic at building AI models? If the answer to that question is no, then as Elon put it best ‘GFY’.
I’ve had a number of conversations with folks inside and outside government about the current situation with Anthropic, and here is what I believe to be true:
— As we know, Anthropic publicly released its Mythos class models earlier this week under the commercial name Fable.
— Fable is Mythos with guardrails. But if those guardrails fail, then you’ve exposed Mythos and its advanced cyber capabilities to people who shouldn’t have them. (Keep in mind that Anthropic itself widely promoted the idea that Mythos was a cyberweapon and needed to be regulated as such. They asked for government regulation of Mythos and championed the guardrails on Fable. If there is a vulnerability — big or small — it is Anthropic’s responsibility to patch.)
— A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused.
— In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious.”
— In the past, Anthropic has always said that safety must be top priority and taken super seriously. In this case, Anthropic prioritized the continued offering of the consumer model over safety.
— In reaction, the Admin issued the export control. The Admin did this reluctantly. It’s been very surprised that Anthropic hasn’t wanted to cooperate with a reasonable safety request (ie fixing the jailbreak issue). Anthropic’s reaction is very much at odds with their branding and ethos as a safe AI research community.
— The Admin’s hope now is that Anthropic remediates the safety issue, the export control is lifted, and Fable goes back into general release. The Admin wants all of this to happen as soon as possible. It is frankly bewildered that Anthropic hasn’t wanted to comply with safety requests that it previously said were its highest priority.
— Those trying to misdirect and tie this action to the prior DoW/Anthropic issues are wrong. The Admin values Anthropic’s technical capabilities and feels that this issue, while serious, should be easily resolved. The ball is in Anthropic’s court.
Claude Fable 5 just changed the AI game.
People are one-shotting games, 3D worlds, app builders and insane code optimizations. There's a major shift.
10 examples:
Anthropic and OpenAI are both telling engineers to write loops.
Not prompts.
Not agents.
Loops.
That is not a coincidence.
When the two most important AI labs on the planet independently converge on the same pattern — that is a signal worth paying attention to.
Most engineers are still thinking in terms of single calls.
Input → model → output.
The engineers winning in 2026 think in cycles.
Output becomes input. The model evaluates its own work. The loop runs until the result is right.
This is the complete breakdown of what loops are, why they matter, and how to build them ↓
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.
Its capabilities exceed those of any model we’ve ever made generally available.
Claude Code's creator said something that stopped me cold:
"I don't prompt Claude anymore. I write loops — and the loops do the work. My job is to write loops."
Most developers are still crafting the perfect prompt.
The person who built the tool moved past prompting entirely.
In 30 minutes Boris reveals his actual daily Claude Code setup.
Claude Code + loops + dynamic workflows.
Worth more than any $500 vibe-coding course.
Watch it.
Then read this - everything you need to know about loops to actually apply what he says ↓
Bookmark both. This is your weekend.