We released iOS app for Hermes Agent ๐ฑ
Connect to your self-hosted agent over Tailscale, Cloudflare Tunnel or ngrok. Or deploy on a VPS. Run tasks and manage your agent from anywhere.
New Google Gemma 4 12B claims near-26B performance - we tested both!
We ran both models locally on one RTX 4090 and gave each the same task: write a self-contained HTML5 canvas animation with real physics in one file without libraries. Three scenes - a Galton board, two blocks colliding off a wall, and a chaotic triple pendulum
Outputs:
Gemma 4 26B-A4B: 15 GB VRAM usage, 6.9k tokens, 138 tok/s
Gemma 4 12B: 9 GB VRAM usage, 8.9k tokens, 80 tok/s
Same Gemma 4 family, but the 26B-A4B won every scene and ran ~1.7x faster - on just 4B active params. The 12B stayed very close though, on almost half the VRAM - which makes it the ideal model for a 16 GB laptop
MiniMax M3 turned a napkin sketch into a playable game
We handed MiniMax M3 a hand-drawn draft of a Doodle Jump style platformer. It read the elements off the draft, wrote the logic, drew the interface and shipped it as one self-contained HTML game
Input: 6,920 tokens
Output: 9,933 tokens
Cost: $0.028
@MiniMax_AI drops M3 on @huggingface next week
We released iOS app for Hermes Agent ๐ฑ
Connect to your self-hosted agent over Tailscale, Cloudflare Tunnel or ngrok. Or deploy on a VPS. Run tasks and manage your agent from anywhere.
@itsthedonhashim@testingcatalog Tailscale puts your phone and VPS on the same private encrypted network. Hermes runs on the VPS, your phone connects like it's local
@RaafatMS1@testingcatalog same behavior as core Hermes, we don't override session recovery logic. waiting on upstream updates from @Teknium on that one. what's your VPS setup?
@Tracklish@gladkos you can store your API keys in the cloud dashboard and they'll sync to the mobile app. sign in with the same account and you're good to go ๐