In short, Anthropic is asking for IOCs-like distribution mechanism controlled by the US govt. What tech exists and who is allowed to share what with whom needs to be essentially controlled.
Anthropic says Recursive Self Improvement is approaching faster than they expected.
Quoting from the blog:
'What should we do?
If it were possible to effectively slow the development of this technology to give ourselves more time to deal with its immense implications, we think that would likely be a good thing. But if a slowdown simply lets the least cautious actors catch up technologically, it could leave everyone less safe. Without a global coordination mechanism, companies and governments will have to make difficult decisions about safety while under competitive and geopolitical pressures.
We believe it would be good for the world to have the option to slow or temporarily pause frontier AI development to enable societal structures and alignment research to keep up with the advance of the technology. The Anthropic Institute will conduct research—in collaboration with many others—and take actions to help build the systems that a credible slowdown or pause would require. These systems would enable frontier AI developers to verify that others globally have actually stopped or slowed, and that a bad actor could not use the auspices of a coordinated slowdown to jump ahead in secret. If such systems existed, we expect that we would slow down or temporarily pause, if other developers at or near the frontier also did so in a verifiable manner.
A meaningful slowdown or pause would require multiple well-resourced labs at or near the frontier, in multiple countries, agreeing to stop under the same conditions. It would also require that each can verify that the others have actually stopped. Due to the unique characteristics of AI systems, the detectability (a lower standard than verifiability) element of this arms control problem is much more challenging than with other technologies. Training runs are far easier to conceal than missile silos, their inputs are general-purpose, and the incentive to defect quietly is enormous, because whoever continues while others pause could inherit the lead. A credible pause also has to specify what triggers it, what lifts it, and who adjudicates.
None of this is necessarily impossible in principle—the world has built verification regimes for other complex technologies (e.g., the Intermediate-Range Nuclear Forces Treaty)—but those regimes took decades to build both the infrastructure and the trust. We don’t have that long. A unilateral pause by one lab, by contrast, is achievable immediately, but accomplishes much less: it would change who the front-runner is, but it would not create the wider deliberative process that is currently missing.
In the coming months, we will organize conversations where policymakers, researchers, civil society, and other AI companies can help answer some of the questions this piece raises, especially around full recursive self-improvement and how to create better options for coordination and deliberation. We’ll publish what comes out of it. The window to investigate the questions together is here, and people outside AI companies should be involved in this deliberation.'
My hunch was correct. Anthropic had been testing Mythos since February 24 -- and this model is literally a beast. I won't be surprised if you can point it an existing medium scale system, and it iteratively builds a faster version of it over few days.
https://t.co/YENvEpRfAA
Devs burning $10K of compute on $200/mo plan are the biggest risk to model providers. Unlike sticky chat users, devs have zero loyalty and will easily migrate the second someone else drops a better coding model.
@RihardJarc Google has become a bureaucratic hell -- but the unique thing about them is the number of knobs they can turn to incrementally squeeze higher revenues each quarter. I expect the stock to double from here but eventually the company will be sold for the parts.
This is simply unethical. While the token limit was reset to 100%, you also pushed back the weekly timer. This means I just effectively lost 3 days because my clock was restarted midweek.
@sama
Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946.
For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids.
An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better.
This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
Gemini TL wants you to derive chincilla laws from first principles before he is gonna talk to you. Imagine the guts when gemini is a distant third behind chatgpt and claude.
Promo maxers lol.
The qos on gemini is so horrendously bad.
You may have your entire quota available but you can't use any because @GeminiApp can't figure out how to serve traffic.
I guess they are diverting compute to another chat app.