Inspired by the incredible real-time AI vision demos and ngxson's live SmolVLM + llama.cpp project, I wanted to see what was possible on Apple Silicon! π
Super excited to share my SmolVLM Real-Time Webcam Demo with MLX-VLM!Β π
It gives you 24/7 cloud uptime without leaving your computer on, and it's super cheap ($8-10/mo) while scaling up to 48 vCPU/48GB RAM.
Best of all, you can pair it with your other Railway services so you can more services to integrate with the Hermes Agent, and also you can allow can Hermes Agent autonomously manage your web apps, automate workflows, and handle customer emails.
Headless OAuth was the other one. Browser callback flows can't survive in a container.
Device code (RFC 8628) running in an xterm panel does. Same flow you'd use signing in on a TV.