introducing https://t.co/45y9MyH2cQ
- inference server based on @huggingface transformers.js
- OpenAI-compatible
- runs on mac, linux & win via cuda, coreml, dml, webgpu, wasm, cpu
- tested llms from @liquidai, @Alibaba_Qwen & @GoogleDeepMind
- embeddings & speech-to-text
- works with @NousResearch Hermes
- built in ts
- open source as MIT
- the first ever project from @runpod labs
https://t.co/SzB0IZ0bdZ
Did you know it’s possible to summon hundreds of people with crippling AI psychosis on this site? All you have to do is add a hashtag to your post. #keep4o