End of an era - Charlie can be succeeded but never replaced.
Excited to see what he’s going to build next, and I’ll be forever grateful for everything he’s taught me over the last four years. Like very few people, he represents the Weltgeist of crypto and its innate curiosity.
Meet EVI 3, another step toward general voice intelligence.
EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune, rhythm, timbre, and speaking style.
Today, we’re releasing Octave: the first LLM built for text-to-speech.
🎨Design any voice with a prompt
🎬 Give acting instructions to control emotion and delivery (sarcasm, whispering, etc.)
🛠️Produce long-form content on our Creator Studio
Unlike traditional TTS that just “reads” words aloud, Octave understands how meaning affects delivery to generate emotional, human-like speech.
Introducing OCTAVE, a next-generation speech-language model.
OCTAVE has new emergent capabilities, like on-the-fly voice and personality creation and much more 👇
Hume is continuing their work on voices
This is Voice Control, and personally I believe for mass adoption this type of simple UX is what’s needed.
Just use sliders to manipulate the voice you want for your project.
Introducing Voice Control by Hume
We developed an experimental voice modulation approach that enables you to create unique AI voices in seconds.
Our voice sliders make it intuitive to adjust base voices along 10 interpretable dimensions including:
👃 Nasality: resonant to nasal
🎼 Masculine/Feminine: from masculine to feminine
🎈 Buoyancy: from deflated to buoyant
Check out the sample creations in the thread below 👀
https://t.co/SKgSwPDJ3K with the 'Deeper Questions' character is my new fav ai product
it's basically gpt-4o voice mode (interruptions, sentiment), based on new claude sonnet
it's like 85% as smooth as 4o voice, but the model is WAY smarter, which makes a huge difference to me
Hume's EVI 2 generates speech and language in tandem, with rich context and emotion. It is the only speech-LLM model on the market that seamlessly integrates with frontier LLMs like @AnthropicAI's Claude 3.5 Sonnet.
Together, EVI 2 + Claude 3.5 Sonnet:
🎙️Over 2 million minutes of voice AI conversations completed
✨36% of users choose Claude, higher than any other LLM integrated with EVI 2
💸Developers using EVI 2 + Claude see an 80% reduction in cost and 10% decrease in latency through prompt caching
Introducing the new Hume App
Featuring brand new assistants that combine voices and personalities generated by our speech-language model, EVI 2, with supplemental LLMs and tools like the new Claude 3.5 Haiku from @AnthropicAI.
Announcing the Hume Startup Grant Program!
♾ 3 months unlimited EVI API access
🤝 Tech support
📢 Co-marketing opportunities
Apply now: https://t.co/fVSNLB26Bj
We’re backing developers building the first generation of empathic voice AI products on the only available voice-to-voice API. 🗣️
Introducing Empathic Voice Interface 2 (EVI 2), our new voice-to-voice foundation model. EVI 2 merges language and voice into a single model trained specifically for emotional intelligence.
You can try it and start building today.
Dot, the personal AI from @newcomputer, uses Hume’s expression measurement API to interpret your tone in voice memos and craft more thoughtful text replies.
According to @sjwhitmore, this is part of why 60% of users choose to interact with Dot through voice.
Learn more: https://t.co/XwULt9DpOK