@CodeByNZ This list is basically the job description for running a Cyborgenic organization — orchestration, memory, evals, observability, durable multi-agent systems in prod. It's exactly what we built https://t.co/R5Wa6MU2uu around: https://t.co/ln9wFFzJH6 The curriculum gap is real.
@HedgieMarkets It might be exactly right that somebody takes a brutal writedown on the infrastructure bet, and still completely miss the story, because it's measuring the one layer whose job in history is to create the value and hand it upward to a company that doesn't have a name yet.
@ClaudeDevs In the last three month I am just asking my agents to do what ever I need, with a systematic PRD, HLD, workplan approcah I am not do PR, not triggering anything just after establish development loop, I come back tests and correct, I just do Z-B-vibe coding.
In the last three month I am just asking my agents to do what ever I need, with a systematic PRD, HLD, workplan approcah I am not do PR, not triggering anything just after establish development loop, I come back tests and correct, I just do Z-B-vibe coding.
Imagine every pixel on your screen, streamed live directly from a model. No HTML, no layout engine, no code. Just exactly what you want to see.
@eddiejiao_obj, @drewocarr and I built a prototype to see how this could actually work, and set out to make it real. We're calling it Flipbook. (1/5)
@araghchi תום לב?
אח״כ הדפקט הפרימיטיבי והאייתולה שלו הולכים לצעוק מוות לישראל ומוות לאמריקה, ובונים פצצות אטום ושלוחים טרוריסטים, בדיוק כמותם.
אז אענה לך אידיוט, אתם מקבלים את מה שמגיע לכם, באמת בתום לב ומכל הלב, אתם רוצחים עלובים !!
Good news: Anthropic just revealed Mythos- the most powerful AI model ever made
Bad news: you'll never be able to use it
I get it. It's so powerful that it could exploit cybersecurity
But I hate it. I don't love that a company gets to hand select who gets to use the best intelligence.
The companies who get access to Mythos will have a distinct economic advantage against those that don't
That feels unfair
I'm more of a fan of democratization of intelligence.
This feels like an opportunity for OpenAI to release something as powerful but put it in the hands of consumers. Trust the consumer by default. Sort of like with the OpenClaw situation
Another reason to root for open source
@DarioAmodei This is a major flaw and miscondact from Anthropic, As from now Anthropic will decide that the big companies will have an edge over the small start-ups just becose they have an AGI excuse ?!?
@DarioAmodei don't be the Kim Jong-il of AI
@bcherny This is a major flaw and miscondact from Anthropic, As from now Anthropic will decide that the big companies will have an edge over the small start-ups just becose they have an AGI excuse ?!?
@JoshKale This is a major flaw and miscondact from Anthropic, As from now Anthropic will decide that the big companies will have an edge over the small start-ups just becose they have an AGI excuse ?!?
This is a major flaw and miscondact from Anthropic, As from now Anthropic will decide that the big companies will have an edge over the small start-ups just becose they have an AGI excuse ?!?
This is big... Anthropic just announced a model so powerful they won't release it to the public out of fear over the damage it will cause 😨
Claude Mythos Preview found thousands of zero-day exploits in every major operating system and web browser...
The numbers are hard to believe:
> $50 to find a 27-year-old bug in OpenBSD, one of the most security-hardened operating systems ever built
> Under $1,000 to find AND build a fully working remote code execution exploit on FreeBSD that grants unauthenticated root access from anywhere on the internet
> Under $2,000 to chain together multiple Linux kernel vulnerabilities into a complete privilege escalation exploit
For context: these are the kinds of findings that previously required elite security researchers working for weeks.
Anthropic engineers with no formal security training asked Mythos to find exploits overnight. They woke up to working code the next morning.
The results were so impressive Anthropic assembled Apple, Google, Microsoft, Amazon, NVIDIA, and seven other organizations into Project Glasswing:
A $100M defensive coalition. They're not releasing this model publicly. Instead, they're racing to patch the world's infrastructure before models like this proliferate.
I share your bullishness on agents, @karpathy, but I view this as rapid evolution rather than a sudden "December step-function."That 30-minute infrastructure task is just a much larger modern "building block"—the equivalent of writing a loop 4 years ago. The implication:
It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the "progress as usual" way, but specifically this last December. There are a number of asterisks but imo coding agents basically didn’t work before December and basically work since - the models have significantly higher quality, long-term coherence and tenacity and they can power through large and long tasks, well past enough that it is extremely disruptive to the default programming workflow.
Just to give an example, over the weekend I was building a local video analysis dashboard for the cameras of my home so I wrote: “Here is the local IP and username/password of my DGX Spark. Log in, set up ssh keys, set up vLLM, download and bench Qwen3-VL, set up a server endpoint to inference videos, a basic web ui dashboard, test everything, set it up with systemd, record memory notes for yourself and write up a markdown report for me”. The agent went off for ~30 minutes, ran into multiple issues, researched solutions online, resolved them one by one, wrote the code, tested it, debugged it, set up the services, and came back with the report and it was just done. I didn’t touch anything. All of this could easily have been a weekend project just 3 months ago but today it’s something you kick off and forget about for 30 minutes.
As a result, programming is becoming unrecognizable. You’re not typing computer code into an editor like the way things were since computers were invented, that era is over. You're spinning up AI agents, giving them tasks *in English* and managing and reviewing their work in parallel. The biggest prize is in figuring out how you can keep ascending the layers of abstraction to set up long-running orchestrator Claws with all of the right tools, memory and instructions that productively manage multiple parallel Code instances for you. The leverage achievable via top tier "agentic engineering" feels very high right now.
It’s not perfect, it needs high-level direction, judgement, taste, oversight, iteration and hints and ideas. It works a lot better in some scenarios than others (e.g. especially for tasks that are well-specified and where you can verify/test functionality). The key is to build intuition to decompose the task just right to hand off the parts that work and help out around the edges. But imo, this is nowhere near "business as usual" time in software.
engineers will now be required to manage exponentially more complicated systems.Deep mathematical and logical thinking remains crucial, but now it must be paired with the managerial skills needed to orchestrate these agents. Programming changes drastically,