Today, we share a breakthrough on the planar unit distance problem, a famous open question first posed by Paul Erdős in 1946.
For nearly 80 years, mathematicians believed the best possible solutions looked roughly like square grids.
An OpenAI model has now disproved that belief, discovering an entirely new family of constructions that performs better.
This marks the first time AI has autonomously solved a prominent open problem central to a field of mathematics.
someone at ANTHROPIC just showed CLAUDE finding ZERO DAY vulnerabilities in a live conference demo
claude has found zero day in Ghost, 50,000 stars on github, never had a critical security vulnerability in its entire, history...
it found the blind SQL injection in 90 minutes, stole the admin api key, then did the exact, same thing to the linux kernel
AI has solved one of the problems in FrontierMath: Open Problems, our benchmark of real research problems that mathematicians have tried and failed to solve.
See thread for more.
I don't want a printing press.
I don't want clocks in every town.
I don't want ships crossing the Atlantic.
I want a good harvest.
I want my teeth to stay.
I want the plague to stop.
This is legitimately how these people think.
In their imaginary world you can easily transform a data center into social housing, transform rockets into medicine for poor countries.
And more importantly if you do one you can’t do the other. Can’t make anything, only reallocate.
We worked with @Ginkgo to connect GPT-5 to an autonomous lab, so it could propose experiments, run them at scale, learn from the results, and decide what to try next. That closed loop brought protein production cost down by 40%.
for normies, “discoveries” are a finite resource where you put in Science Points and get guaranteed results, instead of a decades-long processes of unconnected research that probably had its origins in something completely unrelated like trying to grow human kidney cells in rats
An advanced version of Gemini with Deep Think has officially achieved gold medal-level performance at the International Mathematical Olympiad. 🥇
It solved 5️⃣ out of 6️⃣ exceptionally difficult problems, involving algebra, combinatorics, geometry and number theory. Here’s how 🧵
This is the craziest photo ever taken. It blows my mind every time I see it
> two brothers, from the middle of nowhere
> testing their flying machine off the coast of nowhere
> achieving a dream man has had for millennia
> despite having no college degree
> one of the five people there happened to have a camera
Today, we’re announcing Veo 2: our state-of-the-art video generation model which produces realistic, high-quality clips from text or image prompts. 🎥
We’re also releasing an improved version of our text-to-image model, Imagen 3 - available to use in ImageFX through @LabsDotGoogle. → https://t.co/zMJQwON4Gx
Introducing Oasis: the first playable AI-generated game.
We partnered with @DecartAI to build a real-time, interactive world model that runs >10x faster on Sohu. We're open-sourcing the model architecture, weights, and research.
Here's how it works (and a demo you can play!):
THAT'S IT? This is what 20 years worth of spent nuclear fuel looks like safely stored at the former Maine Yankee nuclear plant.
`The energy produced from this fuel helped avoid 70 million metric tons of CO2 emissions.
Are we in a new, 3rd paradigm for AI language models?
First, models predicted the most likely next word. Think 2018-2021, for transformer-based language models.
Second, they were rewarded for words that were helpful, harmless, and honest. Think RLHF or RLAIF, 2022-2023.
Now, with the o1 family, they are being rewarded for being objectively correct. Think 2024-???
Breakdown in video below:
Today, I’m excited to share with you all the fruit of our effort at @OpenAI to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/
SSI is building a straight shot to safe superintelligence.
We’ve raised $1B from NFDG, a16z, Sequoia, DST Global, and SV Angel.
We’re hiring: https://t.co/DmFWnrc1Kr
I often wonder, what exactly is it doing in that time? A three second stall is enough time for ten billion CPU operations. What are they being used for? What ten billion calculations are happening between my right click and the context window displaying?