Attention Toronto ‼️
I’m hosting a 3-on-3 basketball tournament where startups will compete to prove who has the best squad in the city
Comment and I will send you the link to apply
The best way to bring the composition from your head into an image → Ideogram V4 + drawing bounding boxes in Comfy.
The control here is quite unique. The model uses structured JSON so drawing bounding boxes to get the exact placement works very well.
The model only needs 12 steps (turbo), so iterating with different seeds + very impressive text rendering capability leads me to say this is a state of the art open source image model right now.
Using the 'Ideogram 4 Prompt Builder' node by Kijai.
In the Image Arena: open-weight Text-to-Image has a clear leader, with a tight race directly behind it:
- #1 Ideogram-4.0 Quality has set the pace this week with a score of 1204. @ideogram_ai
- #2 Hunyuan Image 3.0 by @TencentHunyuan with a score of 1151, just +1 pt ahead of Flux-2 Dev @bfl_ai at #3.
- #4 Qwen Image 2512 by @Alibaba_Qwen and #5 HiDream-O1 Image @HiDream_AI complete the top five, scoring 1128 and 1124.
The top six are represented by different labs, while Flux and Qwen provide the greatest depth across the Top 15.
Today we published a technical blog post about Ideogram 4.0 — our goal is to enable more innovation and creativity.
It's a 9.3B Diffusion Transformer trained from scratch, paired with a frozen 8B VLM as text encoder. The nf4 checkpoint runs on a 24GB consumer GPU.
Thread 🧵
Today we published a technical blog post about Ideogram 4.0 — our goal is to enable more innovation and creativity.
It's a 9.3B Diffusion Transformer trained from scratch, paired with a frozen 8B VLM as text encoder. The nf4 checkpoint runs on a 24GB consumer GPU.
Thread 🧵
Fun fact about Ideogram 4.0: we haven't even scaled the model yet.
We're hiring! My team and I are at @CVPR for 3 days. DM me, swing by the booth, or come to our happy hour tonight: https://t.co/iOlm7PXYyq
Ryan Dahl is the most cracked engineer I know. He started Node.js and is the CEO of @deno_land now. He spent a year doing research with me and Jon Shlens and we had a hunch in 2017 that generative models are good for image upscaling.
Introducing Ideogram 4.0: the best open image model in the world.
Think it. Make it. Own it.
Download the weights, fine-tune on your own data, and run it on your hardware. Live on every Ideogram plan and the API today.