TheStage AI @TheStageAI - Twitter Profile

Pinned Tweet

25 days ago

TheStage AI Platform is now open to everyone. Automatically accelerate your models and download them to run in the cloud or on smartphones.

34

149

29

39

4M

TheStageAI retweeted

Kirill Solodskikh

@GarchFather

4 days ago

Gemma4 E2B, compressed by @TheStageAI , from 9.3GB to 1.4GB, is running on iPhone 16e with tool calls! The smallest and the best quality checkpoints open-sourced! @GoogleDeepMind

3

22

3

24

134K

TheStage AI

@TheStageAI

4 days ago

Blog post: https://t.co/6gJMCWRXAo

0

5

0

68

TheStage AI

@TheStageAI

4 days ago

The smallest checkpoints for Gemma 4 E2B and E4B for local inference. Results for E2B: size: 9.3 GB → 1.4 GB speed: 113 tok/s on Apple M3 quality: -3% on ifEval runs with: MLX, llama.cpp (coming) Pareto-optimal, open source! Links to the blog post and GitHub repo ⬇️ @GoogleDeepMind @lmstudio @ollama @huggingface @ggerganov

TheStageAI's tweet photo. The smallest checkpoints for Gemma 4 E2B and E4B for local inference. Results for E2B:

size: 9.3 GB → 1.4 GB
speed: 113 tok/s on Apple M3
quality: -3% on ifEval
runs with: MLX, llama.cpp (coming)

Pareto-optimal, open source! Links to the blog post and GitHub repo ⬇️

@GoogleDeepMind @lmstudio @ollama @huggingface @ggerganov

2

17

5

10

140K

Who to follow

Kirill Solodskikh

@GarchFather

White chocolate @TheStageAI Co-founder, CEO, ex Huawei P50 AI cameras

CRYPTO BULL

@cryptobull78

MAKE MEME GREAT AGAIN

Rajiv Gupta 🇮🇳 🇦🇪

@TheNomadTechie

Real Estate Analyst | Nerd | Building GPTs to revolutionize propinsights. Passionate about AI, Web3, Blockchain, and data-driven insights.

TheStage AI

@TheStageAI

4 days ago

Github: https://t.co/guNh6UNdOw

1

5

1

0

84

TheStage AI

@TheStageAI

17 days ago

@billyG881 Wait several days for announcement 😀

0

1

0

14

TheStage AI

@TheStageAI

3 months ago

Proud to team up with @brilliantlabsAR and @neuphonicspeech on Halo’s on-device privacy engine. Coming to Brilliant Labs’ Halo smart glasses: real-time voice + vision, POV stays private. ANNA + GPU/NPU SDK + memory manager for wake word, STT, TTS, diarization. SDK demo 👇

6

25

9

6

2K

TheStage AI

@TheStageAI

17 days ago

@sebuzdugan Its not just an idea, team already applying that for models compression. You can check some benchmarks for compressed models here: https://t.co/vGOFcsBXrA This month we are releasing a lot of benchmarks and ablation study.

0

99

TheStage AI

@TheStageAI

25 days ago

TheStage AI Platform is now open to everyone. Automatically accelerate your models and download them to run in the cloud or on smartphones.

34

149

29

39

4M

TheStage AI

@TheStageAI

20 days ago

Try it yourself, https://t.co/6lPd1abudw

TheStage AI

@TheStageAI

25 days ago

TheStage AI Platform is now open to everyone. Automatically accelerate your models and download them to run in the cloud or on smartphones.

34

149

29

39

4M

0

5

2

0

298

TheStage AI

@TheStageAI

22 days ago

@Liqui_Sniper Yes, we are adding onboarding templates. They will be released next week.

0

315

TheStage AI

@TheStageAI

about 2 months ago

@Nau__One We have local engines, so it can run fully on-device. We also provide ready-to-go containers for inference on your GPUs. We are SOC 2 compliant, and you can easily scan the container for vulnerabilities.

0

1

0

98

TheStage AI

@TheStageAI

about 2 months ago

Beyoncé heard cursing. TheWhisper heard Arsenal. The fastest Whisper in the world. Open-source real-time ASR. Top 5 on OpenASR benchmarks. 1800 RTFx. Built for live captions, transcription, and voice apps. See the repo

4

179

19

32

3M

TheStage AI

@TheStageAI

about 2 months ago

@Beyonce heard cursing. TheWhisper heard @Arsenal. Fastest open-source real-time ASR in the world. Top 5 on OpenASR. 1800 RTFx. Built for live captions, transcription, and voice apps. See the repo

0

2

0

75

TheStage AI

@TheStageAI

about 2 months ago

@Alacritic_Super Exactly!!!

0

1

0

450

TheStage AI

@TheStageAI

about 2 months ago

For AI engineers, latency is product. Wan 2.2 in Elastic Models now generates 5s of video in 34s on H100. Elastic Models is a library of accelerated open-source models. Also new: TheWhisper at 1800 RTFx on a single H100 and instant FLUX LoRA switching. Try it

14

572

42

70

8M

TheStage AI

@TheStageAI

about 2 months ago

@TvShowAU188166 Just follow that usage instruction: https://t.co/tauz3o97d6

0

1

0

398

TheStage AI

@TheStageAI

2 months ago

@Wendy_WendyU @brilliantlabsAR @neuphonicspeech 😎

0

1

0

13

TheStage AI

@TheStageAI

2 months ago

@DnuLkjkjh @brilliantlabsAR @neuphonicspeech NPU used not only for VAD, its also used for transcription and for TTS partially. We are using heterogeneous inference to deliver the best speed and lowest power consumption.

0

2

0

18

TheStage AI

@TheStageAI

2 months ago

@billyG881 @brilliantlabsAR @neuphonicspeech Thank you! Release of SDK is coming!

1

0

37

TheStage AI

@TheStageAI

2 months ago

@dimqtdl @brilliantlabsAR @neuphonicspeech Thank you!

0

2

0

14

TheStage AI

@TheStageAI

3 months ago

How do you make text-to-music run in real time in production? The model has to keep audio generation ahead of playback. Our new case study with @MireloAI shows how inference optimization delivered up to 2.4х higher throughput. See the full case study ↓

0

8

4

5

377

TheStage AI

@TheStageAI

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users