Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40+ hrs of work by leading open-source maintainers.
Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?
@edzitron This thing is blurring stuff together in a way that's pretty inaccurate. I just tell people who start at the company to get hobbies that don't involve computers because I think people who work in tech shouldn't spend all their time doing tech stuff. Touch grass, basically.
Our highest and most urgent national priority should be AI safeguards. The risks of AI weapons, pathogens, mass unemployment, surveillance, and even extinction must not continue to be largely ignored.
I think it is really worth reading this piece on RSI at Anthropic.
There is a bit of navel-gazing, some marketing, and a lot of very sincere beliefs about what Anthropic thinks is likely in the near future of AI that you probably want to be aware of. https://t.co/A5yxryBjHv
We are in the foothills of a takeoff.
This is the most important thing you need to know for setting your cyber defense strategy for the next few years.
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor.
It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx
This Executive Order is an important step in strengthening America’s leadership in AI.
We look forward to collaborating with the White House to support its implementation.
https://t.co/ZwDimPrp3t
We’re expanding Project Glasswing. We’ve extended access to Claude Mythos Preview to approximately 150 additional organizations, based in more than fifteen countries.
Read more about this expansion and our future plans for Project Glasswing: https://t.co/QrtHSBdRbh
Google Chrome is rolling out device-bound session credentials to all users. Session cookies get cryptographically tied to your device, so stolen cookies can't be replayed from a different machine. Attackers who exfiltrate your cookie database get nothing usable.
Earlier this month, our run-rate revenue crossed $47 billion.
This growth has been driven by organizations across many industries deploying Claude in their core operations, and by a growing number of people using it for their everyday work.
Read more: https://t.co/V1fdqOxQdY
Once upon a time there was an Lead AI Developer who's AI was not getting impressive benchmark results. That evening, all of his neighbors came around to commiserate. They said, "We are so sorry to hear that deep learning is hitting a wall. This is most unfortunate." The Lead Developer said, "Maybe."
The next day the LLM came back bringing seven massive benchmark scores and even got 90% on the LSAT. I the evening everybody came back and said, "Oh, isn’t that lucky. What a great turn of events. You now are really close to AGI!" The Lead AI Developer again said, "Maybe."
The following day his son tried to train the next successor model, and while training it, he found that 10x'ing pre-training compute wasn't giving results anymore. The neighbors then said, "Oh dear, that’s too bad. Deep learning is hitting a wall." and the Lead AI Developer responded, “Maybe.”
The day after, the Lead AI Developer announced they'd achieved breakthrough results by adding inference-time compute, RL scaling, and tool use. The neighbors came around and said, "Oh wow, AGI is soon!" The Lead AI Developer said, "Maybe."
Humanity, created by God in all its grandeur, is today facing a pivotal choice: either to construct a new Tower of Babel or to build the city in which God and humanity dwell together. In Jesus Christ, this humanity in its grandeur becomes the Way, the Truth and the Life, opening the path for each of us to grow toward fullness. #MagnificaHumanitas
https://t.co/6i9MWs6LJl
Last month we launched Project Glasswing, our collaborative AI cybersecurity initiative. Since then, we and our partners have found more than ten thousand high- or critical-severity vulnerabilities in essential software.
VP VANCE: "We also want to make sure that we're protecting people. We're protecting people's data. We're protecting people's privacy. I think with this Mythos release, one of the things that we're very focused on [is] whether some other bad actor could use Mythos to target various cybersecurity vulnerabilities. [...W]e're just trying to make sure that the American people are as safe as possible.
We want to be pro-innovation. We recognize, I mean, artificial intelligence could be great. It could help us find cures to diseases that currently, you know, people are dying from or suffering from. It also does have some downsides. And we're trying to balance that safety against innovation. And we think that we've got the right balance here in the Trump administration, but it's something we're gonna have to keep on working on because that's just the nature of these technologies is, is they certainly change."
One of the most important and under appreciated trends in the world right now.
1. 100s of billions of dollars will soon be available to solve big problems (making the world resilient to ASI, ending factory farming, etc).
2. The projects and organizations which will turn billions of 2027/28 dollars into impact need to be started NOW.
3. We need really talented people to start and run and work for these new projects. What @nanransohoff calls general managers, who feel personally resposible for solving one of the world’s important problems.
What is especially scarce are detailed visions about what making AI go well looks like. These will help inform what problems these new projects ought to work on.
Over the past few months, we've been holding dialogues with scholars, philosophers, clergy, and ethicists on the questions AI raises—starting with how good character forms.
Read more about how we’re widening the conversation on frontier AI: https://t.co/vKGiODEq6q
First week on the @AnthropicAI Frontier Red Team
The speed of AI progress is astounding. We have a real opportunity in front of us to dramatically improve cyber security with AI. I can't think of a better company or team to join at this critical moment in time.
Personal update: I've joined Anthropic. I think the next few years at the frontier of LLMs will be especially formative. I am very excited to join the team here and get back to R&D. I remain deeply passionate about education and plan to resume my work on it in time.