GPT-5.3, codenamed "Garlic" 🧄 is released on Thursday, Feb. 26th.
It surpasses human baseline on SimpleBench of 83.7%.
In fact, it blows every previous model out of the water on all non-coding benchmarks.
Word has it is a *HUGE* leap.
A GPT-3 to GPT-4 moment again.
OpenAI has long had the best RL/post-training pipeline, which makes sense since they were the first lab to train LLMs for inference time reasoning using RL (o1).
Now they've got their mojo back when it comes to pretraining too (Mark Chen, Chief Researcher, alluded to this on Ashlee Vance's podcast last year).
Public comments from sama also point in the direction of major progress.
This could be the big one.
It may be deserving of a major version bump.
That's my prediction.
Why is everything so slow in antigravity? I waited an hour for 3.1 to process the request, even though I sent the same request on the site, attached the project as a zip, and it took just a couple minutes
Google team, please add an option to use a custom API key for Gemini 3.1 Pro in Antigravity. It is frustrating to wait for hours when you are in the middle of a flow.