DeepInfra @deepinfra - Twitter Profile

DeepInfra

@DeepInfra

about 17 hours ago

Cosmos 3 Nano: https://t.co/th8eULvuGe

0

1

0

357

DeepInfra

@DeepInfra

about 17 hours ago

NVIDIA Cosmos 3 is live on DeepInfra. The first open world foundation model for physical AI that reasons before it generates. Built for robots, AVs, simulation, synthetic data generation.

1

5

1

0

590

DeepInfra retweeted

X Freeze

@XFreeze

5 days ago

Entire world: We need more GPUs Meanwhile, Jensen Huang:

505

13K

699

826

1M

DeepInfra retweeted

MiniMax (official) @MiniMax_AI

3 days ago

Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas - MiniMax Sparse Attention scales context to 1M - Natively Multimodal from Step Zero API: https://t.co/fHRdSV7BwZ Token Plan: https://t.co/BDCycxepZw 🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul Weights & Tech Report in ~10 Days

MiniMax_AI's tweet photo. Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities

- Coding & Agentic Frontier: 59.0% SWE-Bench Pro, 66.0% Terminal Bench 2.1, 34.8% SWE-fficiency, 28.8% KernelBench Hard, 74.2% MCP Atlas
- MiniMax Sparse Attention scales context to 1M
- Natively Multimodal from Step Zero

API: https://t.co/fHRdSV7BwZ
Token Plan: https://t.co/BDCycxepZw
🚀New! MiniMax Code: https://t.co/GvB4YiB6Ul

Weights & Tech Report in ~10 Days

531

8K

1K

3K

3M

Who to follow

Ben Klayman

@benklayman

Former automotive journalist.

Yessen K

@yessenzhar

Co-founder @DeepInfra, ex @imoim

Northeastern CSSH

@NUCSSH

Northeastern University's College of Social Sciences and Humanities (CSSH) https://t.co/YrNzOsuJOX

DeepInfra

@DeepInfra

3 days ago

We are really excited about Nemotron 3 Ultra.

Artificial Analysis

@ArtificialAnlys

3 days ago

NVIDIA just announced the release of Nemotron 3 Ultra in Jensen Huang's Computex keynote: at 550B parameters (55B active), this is the largest Nemotron 3 model to date, and it is the most intelligent US open weights model We partnered with @nvidia to evaluate this model for intelligence and speed - these figures use the model’s BF16 weights, but as with Nemotron 3 Super the model will be made available in NVFP4 quantization as well for higher inference performance. ➤ New leader for US open weights intelligence: Nemotron 3 Ultra scores 48 on the Artificial Analysis Intelligence Index. This is well ahead of the next strongest US open weights models, Gemma 4 31B (39), Nemotron 3 Super (36) and gpt-oss-120b (33), but behind the Chinese-led open weights frontier (Kimi K2.6 at 54). ➤ Leading speed for its intelligence: on a pre-release @DeepInfra endpoint, Nemotron 3 Ultra served over 300 tokens per second. Peer models in its size class from China-based labs such as DeepSeek and Moonshot (Kimi) are generally served at speeds of 50-100 tokens per second in the market today. gpt-oss-120b is served at speeds similar to this level, but with significantly lower intelligence. ➤ Largest Nemotron 3 model so far: at approximately 550 billion total parameters and 90% sparsity, Nemotron 3 Ultra is significantly larger than its siblings and is the largest recent US open weights model release We’ll be sharing additional analysis and full benchmarks at release.

ArtificialAnlys's tweet photo. NVIDIA just announced the release of Nemotron 3 Ultra in Jensen Huang's Computex keynote: at 550B parameters (55B active), this is the largest Nemotron 3 model to date, and it is the most intelligent US open weights model

We partnered with @nvidia to evaluate this model for intelligence and speed - these figures use the model’s BF16 weights, but as with Nemotron 3 Super the model will be made available in NVFP4 quantization as well for higher inference performance.

➤ New leader for US open weights intelligence: Nemotron 3 Ultra scores 48 on the Artificial Analysis Intelligence Index. This is well ahead of the next strongest US open weights models, Gemma 4 31B (39), Nemotron 3 Super (36) and gpt-oss-120b (33), but behind the Chinese-led open weights frontier (Kimi K2.6 at 54).

➤ Leading speed for its intelligence: on a pre-release @DeepInfra endpoint, Nemotron 3 Ultra served over 300 tokens per second. Peer models in its size class from China-based labs such as DeepSeek and Moonshot (Kimi) are generally served at speeds of 50-100 tokens per second in the market today. gpt-oss-120b is served at speeds similar to this level, but with significantly lower intelligence.

➤ Largest Nemotron 3 model so far: at approximately 550 billion total parameters and 90% sparsity, Nemotron 3 Ultra is significantly larger than its siblings and is the largest recent US open weights model release

We’ll be sharing additional analysis and full benchmarks at release.

41

936

124

211

88K

0

5

0

471

DeepInfra retweeted

Supermicro

@Supermicro

20 days ago

CEO Charles Liang Keynote @ Supermicro Innovate!/COMPUTEX

48

1K

104

213

14M

DeepInfra retweeted

NVIDIA AI

@NVIDIAAI

3 days ago

Nemotron 3 Ultra is coming this week. ⌛️

106

3K

359

469

382K

DeepInfra

@DeepInfra

21 days ago

The right question, and one too few enterprises are asking. Thanks @realmtbman and @palebluenexus for having our co-founder @nikolaborisof on. Full episode: https://t.co/AZMuaTllzq

Yohann Calpu

@realmtbman

22 days ago

Enterprises ask "is your AI compliant?" The better question: who actually runs the inference? Nikola Borisov, co-founder of @DeepInfra ($107M Series B raise - including NVIDIA) on @palebluenexus: "You want to make sure you're not giving it to someone that will give it to someone that will give it to someone. And maybe the final inference happens in China."

1

2

1

0

2K

0

5

3

1

1K

DeepInfra

@DeepInfra

22 days ago

Apple: https://t.co/3M983dFoem

0

234

DeepInfra

@DeepInfra

22 days ago

"I wasn't sure what we'd build. I just wanted to work with my co-founders. We ended up deciding to do AI infrastructure. It was a great choice." Our CEO @nikolaborisof on Scaling Without Breaking podcast: why the team came before the idea. https://t.co/uGCMuPavaf Check it out on more platforms👇

1

4

0

1

649

DeepInfra retweeted

Inworld AI

@inworld_ai

30 days ago

Introducing Realtime TTS-2, a new generation of voice model built for realtime conversation. It is the first voice model that hears the conversation, takes natural-language voice direction, holds one voice identity across over 100 languages, and speaks like a person who is paying attention. The result is voice AI that feels as good as it sounds. Try it out: https://t.co/80xL7AJveV Learn More: https://t.co/PLUiAEFizP

106

782

163

776

322K

DeepInfra

@DeepInfra

30 days ago

Try it here: https://t.co/XAolgrszJf

1

2

0

274

DeepInfra

@DeepInfra

30 days ago

New on DeepInfra: Realtime TTS 2.0 from @inworld_ai • Prompt emotion + tone in plain English • Cross-lingual voices • Built for realtime apps $35 / 1M characters

1

5

1

0

741

DeepInfra

@DeepInfra

about 1 month ago

https://t.co/bwu0WihD7c

1

7

1

0

618

DeepInfra

@DeepInfra

about 1 month ago

DeepInfra has raised its $107M in Series B funding 🚀 AI is moving from training to production-scale deployment, and inference is becoming the system constraint. DeepInfra was built for this shift — scaling high-throughput inference for open-source and agent-driven workloads. Grateful to our investors and partners, co-led by @500GlobalVC and @gharik

DeepInfra's tweet photo. DeepInfra has raised its $107M in Series B funding 🚀

AI is moving from training to production-scale deployment, and inference is becoming the system constraint.

DeepInfra was built for this shift — scaling high-throughput inference for open-source and agent-driven workloads. Grateful to our investors and partners, co-led by @500GlobalVC and @gharik

8

54

10

12

12K

DeepInfra

@DeepInfra

about 1 month ago

More here: https://t.co/WCGWM7usUT

0

5

0

437

DeepInfra

@DeepInfra

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users