One performance, infinite voices.
Voice Conversion is now live on Hume’s creator studio and API!
Generate the same pacing, pronunciation, and intonation with one recording across any voice you choose.
Hear it for yourself ⬇️
Introducing Octave 2: our next-generation multilingual text-to-speech model
What’s new:
- Fluent in 11+ languages
- 40% faster (<200ms latency) & 50% cheaper than Octave 1
- Multi-speaker conversation
- More reliable pronunciation
- New voice conversion & phoneme editing capabilities
For the month of October, we’re offering 50% off our Creator plan - use code OCTAVE2 at checkout!
SambaNova 🤝 @Hume_AI
Imagine if the world's most realistic voice AI was also super fast, ridiculously smart, and crazy affordable 🤯
Well, we made it happen... and you can see it (& hear it 😉) for yourself👇
You can use Cerebras' gpt-oss-120b to build realistic speech-to-speech voice interfaces with emotion via @hume_ai's new EVI 3. Perfect for your next voice ai project!
Link to try below 👇
2024: Voice Cloning
2025: What about personality cloning?
Hume’s voice AI can now not only mimic your voice but also speaking style and language.
It’s now available via our TTS and new speech-to-speech model, EVI 3, which is also launching today.
Meet EVI 3, another step toward general voice intelligence.
EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune, rhythm, timbre, and speaking style.