I created a silly 3 minute animated comedy short about an Alien trying Lemons for the first time, breakdown here: https://t.co/LsXx4uStWo Made mostly with SeaDance 2.0, Pollo AI, Codex, GPT Image 2, ChatGPT, ElevenLabs, and way too much timeline stitching.
Today, we’re launching Reve 2.0, the best 4K image model in the world.
We invented a new way to generate and edit any image using precise layouts. For the first time, it’s possible to create images you can touch.
BREAKING: Ideogram 4.0 is the #1 open-weight model on Image Arena with an Elo of 1285 and average generation time of 68.7 seconds.
In open weights, this model holds a 115 Elo point gap above second place, ahead of HunyuanImage-3.0 by @TencentHunyuan and FLUX.2 [dev] by @bfl_ai. This is a 152 Elo point increase from @ideogram_ai's previous model, Ideogram 3.0, placing it in the same performance band as Gemini 3.0 Pro Image Gen 2k and Gemini 3.1 Flash Image Gen by @GoogleDeepmind.
Ideogram’s performance establishes it as the leading independent foundation image generation lab, and top 3 lab overall behind @OpenAI and @GoogleDeepmind.
Huge congratulations to the @ideogram_ai team on the launch!
Miso One is live: an open-weights voice model built to sound like a real person reading, with actual warmth and pacing where most TTS still goes flat.
8B params, free on GitHub, with one-shot voice cloning from a short sample at 110ms latency.
Self-host it and your audio data never leaves your machine. No API needed, no lock-in.
Type any line into the demo and hear it before you clone the repo.
Meet Gemma 4 12B!
A unified, encoder-free multimodal model designed to bring high-performance intelligence directly to your laptop, and released under an Apache 2.0 license.
Bridging the gap between edge efficiency and advanced reasoning. Here is what’s new with Gemma 4 12B: 👇
Introducing Ideogram 4.0: the best open image model in the world.
Think it. Make it. Own it.
Download the weights, fine-tune on your own data, and run it on your hardware. Live on every Ideogram plan and the API today.
Building apps has never been easier.
With Sites, Codex can turn your work, ideas, and plans into an interactive website or app your team can explore, use, and share with a URL.
Rolling out to Business and Enterprise plans, before expanding more broadly.
🚨 GPT-5.5-Codex Spark Spotted
They hinted at the fact that tomorrow is a HUGE release date 👀
one more thing they are hyping news stuff , tweet was deleted by openai employee check tweet in the comment
Introducing Cosmos 3: Our latest frontier model for Physical AI
Cosmos 3 is the world’s first fully open omnimodel with native vision reasoning, world and action generation.
Today we’re releasing Super (32B) and Nano (8B) variants.
Windows users, this one’s for you.
Computer use now works on Windows, so Codex can take action on your Windows computer.
And with Windows support for Codex in the ChatGPT mobile app, you can start, review, and steer tasks on the go while work continues on your Windows machine.
An early experience, but we’re working on more ways to keep your work moving, wherever you are.
Damn. I haven't tried 4.8 yet - but this was my conclusion after using Gemini 3.5, and now 4.8 is giving the same for some in the community.
Spud is just built different?? Or are we waiting on something else?
@BenjaminDEKR nah ur valid bro. AI probably does not “understand” in the human, lived, embodied, morally accountable sense.
But AI does possess a functional, predictive, world model type understanding that is increasingly hard to dismiss as “mere parroting.” It does plenty of real work.
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors.
Available today at the same price.
Claude Opus 4.8 Fast just created the best Remotion video I have ever seen.
One shot. No revisions.
A complete BridgeSpace marketing video generated from a single prompt.
Grok 4.3 took 25 minutes just to plan a Remotion video and the output was garbage.
Claude Opus 4.8 one shotted a better version instantly.
This model is on a completely different level.
today is (potentially) a great day for the GPU poors
if DiffusionBlocks works on fine-tuning existing models, then literally any reasonable consumer GPU can do LLM fine-tuning
will make a video on this