BREAKING 🚨: Anthropic is preparing to release new models, Mythos and Capybara, where Mythos is a completely new tier of models, bigger then Opus.
In the blog post, Anthropic also highlights that this model brings significant cybersecurity risks due to its capabilities.
Thrilled to announce Claude Code auto-fix – in the cloud. Web/Mobile sessions can now automatically follow PRs - fixing CI failures and addressing comments so that your PR is always green.
This happens remotely so you can fully walk away and come back to a ready-to-go PR.
When I built menugen ~1 year ago, I observed that the hardest part by far was not the code itself, it was the plethora of services you have to assemble like IKEA furniture to make it real, the DevOps: services, payments, auth, database, security, domain names, etc...
I am really looking forward to a day where I could simply tell my agent: "build menugen" (referencing the post) and it would just work. The whole thing up to the deployed web page. The agent would have to browse a number of services, read the docs, get all the api keys, make everything work, debug it in dev, and deploy to prod. This is the actually hard part, not the code itself. Or rather, the better way to think about it is that the entire DevOps lifecycle has to become code, in addition to the necessary sensors/actuators of the CLIs/APIs with agent-native ergonomics. And there should be no need to visit web pages, click buttons, or anything like that for the human.
It's easy to state, it's now just barely technically possible and expected to work maybe, but it definitely requires from-scratch re-design, work and thought. Very exciting direction!
To manage growing demand for Claude we're adjusting our 5 hour session limits for free/Pro/Max subs during peak hours. Your weekly limits remain unchanged.
During weekdays between 5am–11am PT / 1pm–7pm GMT, you'll move through your 5-hour session limits faster than before.
The Islamic Resistance in Iraq has released footage of a surgical FPV drone strike on Camp Victoria, Baghdad. The drone navigates directly into the heart of the facility.
By first destroying the AN/MPQ-64 Sentinel radar, the Resistance blindfolded the occupation’s SHORAD network. The subsequent strike on the UH-60M Black Hawk proves that without radar cover, even the US military's most advanced transport assets are defenseless.
This operation is the blueprint for the new reality: low-cost, high-precision FPVs systematically blinding and grounding the US occupation in Iraq.
On Friday we revealed the Companies House vulnerability letting anyone access the private dashboard of any UK company.
This is the moment I first saw it demonstrated. My reaction says it all.
What do we know? What don't we know? What should companies do now?
Pi is the most interesting agent harness.
Tiny core, able to write plugins for itself as you use it. It RLs itself into the agent you want.
I was missing cc’s tasks system and told it to spawn clause in tmux and interrogate it about it and make an implementation for itself. It nailed it, including the UX.
Clawdbot is based on it and now it makes sense why it feels so magical. Dawn of the age of malleable software.
Out now: Teams, aka. Agent Swarms in Claude Code
Team are experimental, and use a lot of tokens. See the docs for how to enable, and let us know what you think! https://t.co/qkWzJJYiXH
🧵 THREAD: A federal whistleblower just dropped one of the most disturbing cybersecurity disclosures I’ve ever read.
He's saying DOGE came in, data went out, and Russians started attempting logins with new valid DOGE passwords
Media's coverage wasn't detailed enough so I dug into his testimony: