Kirill Solodskikh

4 days ago

@googlegemma Add these gemma4 models! Smaller, great quality! https://t.co/tWJqORvDHv

0

50

4 days ago

@ivanfioravanti Cool! Add these smallest Gemma 4 models! https://t.co/aot6GYjZOo

1

4

0

57

4 days ago

@googlegemma

0

111

Who to follow

Automated Enterprise Inference Stack & Research Lab

Daily Moon

@dailymoonMedia

Your Daily Dose Of Crypto News ⚡️ | Tweets Are Not Financial Advice #btc

CRYPTO BULL

@cryptobull78

MAKE MEME GREAT AGAIN

4 days ago

Gemma4 E2B, compressed by @TheStageAI , from 9.3GB to 1.4GB, is running on iPhone 16e with tool calls! The smallest and the best quality checkpoints open-sourced! @GoogleDeepMind

3

22

3

24

135K

4 days ago

@TheStageAI @GoogleDeepMind @Prince_Canuma, you need to add this!

0

1

0

1

133

GarchFather retweeted

4 days ago

The smallest checkpoints for Gemma 4 E2B and E4B for local inference. Results for E2B: size: 9.3 GB → 1.4 GB speed: 113 tok/s on Apple M3 quality: -3% on ifEval runs with: MLX, llama.cpp (coming) Pareto-optimal, open source! Links to the blog post and GitHub repo ⬇️ @GoogleDeepMind @lmstudio @ollama @huggingface @ggerganov

TheStageAI's tweet photo. The smallest checkpoints for Gemma 4 E2B and E4B for local inference. Results for E2B:

size: 9.3 GB → 1.4 GB
speed: 113 tok/s on Apple M3
quality: -3% on ifEval
runs with: MLX, llama.cpp (coming)

Pareto-optimal, open source! Links to the blog post and GitHub repo ⬇️

@GoogleDeepMind @lmstudio @ollama @huggingface @ggerganov

2

17

5

10

140K

GarchFather retweeted

20 days ago

Try it yourself, https://t.co/6lPd1abudw

0

5

2

0

299

GarchFather retweeted

Recraft

@recraftai

23 days ago

Say hello to V4.1 This model is built for images that captivate you. Photorealism is more human, gradients are dreamier, and new illustration styles are now possible. Test it out in Recraft Studio today and see what you can create.

38

560

54

327

3M

23 days ago

@recraftai Very cool quality!! We are using Recraft studio!

0

4

0

577

25 days ago

@Nick_Davidov Thank you for your permanent support, Nick!

0

1

0

19

GarchFather retweeted

25 days ago

TheStage AI Platform is now open to everyone. Automatically accelerate your models and download them to run in the cloud or on smartphones.

34

149

29

39

4M

about 2 months ago

@TheStageAI @Beyonce , 😂

0

1

0

29

about 2 months ago

@HarryStebbings @antonosika AGI, $100B investment, Safe AGI, $500B investment, Very Aligned Safe AGI, $1T investment. If AGI becomes smarter than humans, how soon will it create something even smarter than itself?

0

932

GarchFather retweeted

about 2 months ago

Beyoncé heard cursing. TheWhisper heard Arsenal. The fastest Whisper in the world. Open-source real-time ASR. Top 5 on OpenASR benchmarks. 1800 RTFx. Built for live captions, transcription, and voice apps. See the repo

4

179

19

32

3M

about 2 months ago

Self-hosted AGI starts with inference infra teams can actually run. Well. Elastic Models v0.2.0 is much more self-serve: world’s fastest whisper-large-v3-turbo, Wan2.2 generating 5s of video in 34s on H100, and instant FLUX LoRA switching. Explore v0.2.0

1

6

0

4

221

2 months ago

Actually, comparing 1-bit with 16-bit has no sense. Everyone is using 4-bit weights with MLX. And the speed will be around 150-180 tok/s on M4 Pro. Moreover, 4-bit quantization in MLX can be done as block quantization what preserve quality for the most cases.

PrismML @PrismML

2 months ago

1-bit Bonsai 8B running locally on an M4 Pro (MLX) alongside a standard 16-bit 8B model. Same class of model, very different deployment profile: far lower memory use and substantially higher throughput.

12

442

22

133

93K

0

4

0

184

2 months ago

Open-source experiments dashboard for AI researchers. Cool comparison overlays across modalities. What add next? S3 integration, authentication, model registry? https://t.co/FMJTsq21Pf

GarchFather's tweet photo. Open-source experiments dashboard for AI researchers. Cool comparison overlays across modalities. What add next? S3 integration, authentication, model registry?

https://t.co/FMJTsq21Pf https://t.co/xJ7CxuUC6g

0

3

0

132