Atomic Local Ai Agent

25 days ago

New Google Gemma 4 12B claims near-26B performance - we tested both! We ran both models locally on one RTX 4090 and gave each the same task: write a self-contained HTML5 canvas animation with real physics in one file without libraries. Three scenes - a Galton board, two blocks colliding off a wall, and a chaotic triple pendulum Outputs: Gemma 4 26B-A4B: 15 GB VRAM usage, 6.9k tokens, 138 tok/s Gemma 4 12B: 9 GB VRAM usage, 8.9k tokens, 80 tok/s Same Gemma 4 family, but the 26B-A4B won every scene and ran ~1.7x faster - on just 4B active params. The 12B stayed very close though, on almost half the VRAM - which makes it the ideal model for a 16 GB laptop

37

1K

71

534

151K

25 days ago

New Google Gemma 4 12B claims near-26B performance - we tested both! We ran both models locally on one RTX 4090 and gave each the same task: write a self-contained HTML5 canvas animation with real physics in one file without libraries. Three scenes - a Galton board, two blocks colliding off a wall, and a chaotic triple pendulum Outputs: Gemma 4 26B-A4B: 15 GB VRAM usage, 6.9k tokens, 138 tok/s Gemma 4 12B: 9 GB VRAM usage, 8.9k tokens, 80 tok/s Same Gemma 4 family, but the 26B-A4B won every scene and ran ~1.7x faster - on just 4B active params. The 12B stayed very close though, on almost half the VRAM - which makes it the ideal model for a 16 GB laptop

0

98

AtomicNodes retweeted

Building @AtomicBot_ai Local Ai Agent 🦞 @atomic_chat_hq Local AI Models Engine previously @AtomicWallet 15M users

about 2 months ago

Multi-Token Prediction (MTP) for Qwen on LLaMA.cpp! +40% performance! 90% acceptance rate. Running locally on a MacBook Pro M5 Max 64GB We patched LLaMA.cpp, quantized Qwen 3.6 27B into GGUF format with TurboQuant and shipped MTP drafts on top. Benchmark, Source code & models👇

18

265

30

196

51K

Who to follow

Sonak Smart Solutions LTD

@Mrsonak

Real Estate Agent! Web Design! CCTV Installations!Construction! Electrical! Pop!Plumbing! Transport, Logistics & Courier Services! General Merchandis.

AtomicNodes retweeted

about 2 months ago

Multi-Token Prediction (MTP) for LLaMA.cpp! Running Gemma4 local model 1.5x faster. We patched LLaMA.cpp. Quantized Gemma 4 assistant models into GGUF format. We ran tests on a MacBook Pro M5Max. Gemma 26B with MTP drafts tokens 40% faster. Benchmarks, source code and models 👇

24

332

42

232

87K

about 2 months ago

Multi-Token Prediction (MTP) for LLaMA.cpp! Running Gemma4 local model 1.5x faster. We patched llama.cpp. Quantized Gemma 4 assistant models into GGUF format. Run tests on a MacBook Pro M5 max. Gemma 26B with MTP drafts tokens 40% faster. Source code and models below👇

0

205

about 2 months ago

Multi-Token Prediction (MTP) for LLaMA.cpp! Running Gemma4 local model significantly faster. We patched llama.cpp. Quantized Gemma 4 assistant models for GGUF format. Run test on a MacBook Pro M5 max. Gemma 26B with MTP drafts tokens 40% faster. Source code and models below👇

0

188

about 2 months ago

Multi-Token Prediction (MTP) for Llama.cpp! Running Gemma4 local model 40% faster. We patched llama.cpp. Quantized Gemma 4 assistant models for GGUF format. Run test on a MacBook Pro M5 max. Gemma 26B with MTP drafts tokens 40% faster. Source code and models below

0

174

about 2 months ago

Multi-Token Prediction (MTP) for Llama.cpp! Running Gemma4 26B local model 40% faster. We patched llama.cpp. Quantized Gemma 4 assistant models for GGUF format. Run test on a MacBook Pro M5 max. Gemma 26B with MTP drafts tokens 40% faster. Source code and

0

175

about 2 months ago

Introducing MTP for Llama.cpp! Running Gemma4 local model 40% faster. Multi-Token Prediction gives significant speedup, without quality degradation. Instead of predicting one token at a time, MTP drafts several in parallel.

0

74

about 2 months ago

Introducing Multi-Token Prediction (MTP) for Llama.cpp! Multi-Token Prediction gives significant speedup, without quality degradation. Instead of predicting one token at a time, MTP drafts several in parallel. We patched llama.cpp. Quantized Gemma 4 assistant models for GGUF format. Run test on a MacBook Pro M5 max. Gemma 26B with MTP drafts tokens 40% faster.

0

62

about 2 months ago

Introducing Multi-Token Prediction (MTP) for Llama.cpp! Running Gemma4 local model 40% faster. We patched llama.cpp. Quantized Gemma 4 assistant models for GGUF format. Run test on a MacBook Pro M5 max. Gemma 26B with MTP drafts tokens 40% faster. Source code and

0

152

AtomicNodes retweeted

2 months ago

Compared Qwen3.6 35B and 27B in the same conditions with Google TurboQuant Device: MacBook Pro M5Max 64GB RAM Outputs characteristics: Qwen3.6 35B: 6672 tokens, 2m 10s, 65 tok/s Qwen3.6 27B: 7344 tokens, 5m 22s, 24 tok/s Conclusion: Both models were asked to draw waves using HTML, 35B responded quickly but the result feels weak and messy, while 27B took more time and delivered a much cleaner and more consistent result, because it is built for thinking and planning, so it works better on tasks that need structure, overall 27B is a better choice for tasks where planning matters, while 35B is more suitable for everyday use when you just need a fast response

20

329

23

167

56K

AtomicNodes retweeted

Atomic Mail

@atomic_mail

2 months ago

Google Chrome invited Atomic Mail to test how the built-in @GeminiApp can improve private email. Now you can run Atomic Mail AI features with Local Models on your device: – 100% private – Your data stays on your laptop – Zero cost Joint case study with @ChromiumDev coming soon – follow us! Guide + link in the comments.

15

84

16

50

62K

2 months ago

Download MacOs app or run in cloud https://t.co/8ibfBRGkW4

0

1

13

2 months ago

AtomicNodes's tweet photo. https://t.co/pkcgJ2mOJT

0

22

AtomicNodes retweeted

atomicbot.ai

@atomicbot_ai

2 months ago

Hermes Agent by @NousResearch (100k+ ⭐) now inside Atomic Bot: – Free Local models: Qwen, Gemma or – Use your API keys for any provider – Dashboard, terminal, logs and files explorer – Private and Open Source Download MacOS app or run in Cloud👇

52

651

61

1K

344K

AtomicNodes retweeted

atomicbot.ai

@atomicbot_ai

2 months ago

You can run Hermes agent by @NousResearch with Kimi K2.6 on @atomicbot_ai VPS 🌑 @Kimi_Moonshot just dropped a new open source coding model. We asked Hermes to build and deploy a game. It did incredibly well!

9

137

31

98

43K

AtomicNodes retweeted