Today, we’re excited to introduce Miso One, the most emotive voice model in the world.
Miso One is an 8-billion-parameter text-to-speech model for highly expressive speech generation. It emotes like a human and responds faster than a human, with just 110 milliseconds of latency.
We’ve open-sourced the model weights, with API access coming soon.
Hear how Miso One sounds in the thread below.
“when i choose to see the good side of things, i'm not being naive. it is strategic and necessary. it's how I've learned to survive through everything”
actually you know what
why the FUCK havent we made blenders quieter, we found a way to silence guns, SURELY someone can silence a blender
please
theyre so fucking loud