Alice Baird

over 7 years ago

https://t.co/xxP6dm66Wf

0

2

0

Aliceebaird retweeted

16 days ago

Today, voice models have no problem generating “angry” or “sad” expressions. But ask for: → bored + fast → joy + shy → disappointment + confident …and most systems collapse into stereotypes. Our latest research blog explores why this happens — and how disentangling emotion from voice at the data layer improves expressive control. Read more below!

hume_ai's tweet photo. Today, voice models have no problem generating “angry” or “sad” expressions.

But ask for:
→ bored + fast
→ joy + shy
→ disappointment + confident
…and most systems collapse into stereotypes.

Our latest research blog explores why this happens — and how disentangling emotion from voice at the data layer improves expressive control. Read more below!

2

30

3

12

1K

2 months ago

Excited to be co-organizing the ACII 2026 DaiKon Workshop & Challenge! Reach out to register a team: [email protected]

2 months ago

We’re excited to launch the 2026 ACII Dyadic Contest (DaiKon) Workshop & Challenge—a new benchmark for modeling emotional influence in dyadic dialogue. Explore a sample of our conversational audio dataset: 945 sessions, 743 hours, across 5 languages. Submissions due May 25. We look forward to your participation!

hume_ai's tweet photo. We’re excited to launch the 2026 ACII Dyadic Contest (DaiKon) Workshop & Challenge—a new benchmark for modeling emotional influence in dyadic dialogue.

Explore a sample of our conversational audio dataset: 945 sessions, 743 hours, across 5 languages.

Submissions due May 25. We look forward to your participation!

6

38

9

20

3K

0

2

0

71

8 months ago

Octave 2, now multilingual, so huge! 🥳🔥

Interested in art, photography, music, travel, & literature. Love animals, Auburn Tigers, and Atlanta Falcons. (MBA & BS accounting major)

8 months ago

Introducing Octave 2: our next-generation multilingual text-to-speech model What’s new: - Fluent in 11+ languages - 40% faster (<200ms latency⁠⁠) & 50% cheaper than Octave 1 - Multi-speaker conversation - More reliable pronunciation - New voice conversion & phoneme editing capabilities For the month of October, we’re offering 50% off our Creator plan - use code OCTAVE2 at checkout!

84

2K

162

1K

7M

0

8

0

1

270

Who to follow

Aliceebaird retweeted

10 months ago

Use OpenAI's new open source model with any voice, cloned or designed!

3

53

7

31

7K

Aliceebaird retweeted

11 months ago

2024: Voice Cloning 2025: What about personality cloning? Hume’s voice AI can now not only mimic your voice but also speaking style and language. It’s now available via our TTS and new speech-to-speech model, EVI 3, which is also launching today.

212

2K

291

2K

3M

11 months ago

@hume_ai 🔥 so good!

0

5

0

280

Aliceebaird retweeted

Linus ✦ Ekenstam

@LinusEkenstam

about 1 year ago

This had me floored This isn’t just a talking model. It understands and expresses voice like a human, across any accent, tone, or style. It doesn’t just speak. It performs. Nervous stammer? Confident debate? A whispered secret? It's all in there...

15

74

7

100

28K

about 1 year ago

Huge! 💚

about 1 year ago

Meet EVI 3, another step toward general voice intelligence. EVI 3 is a speech-language model that can understand and generate any human voice, not just a handful of speakers. With this broader voice intelligence comes greater expressiveness and a deeper understanding of tune, rhythm, timbre, and speaking style.

54

570

106

349

832K

0

5

0

137

Aliceebaird retweeted

VentureBeat

@VentureBeat

over 1 year ago

Hume launches new text-to-speech model Octave that generates custom AI voices with adjustable emotions https://t.co/DbFejU2awV

0

28

6

4

26K

over 1 year ago

Wow, an incredible achievement by the entire team at Hume! 💚

over 1 year ago

Today, we’re releasing Octave: the first LLM built for text-to-speech. 🎨Design any voice with a prompt 🎬 Give acting instructions to control emotion and delivery (sarcasm, whispering, etc.) 🛠️Produce long-form content on our Creator Studio Unlike traditional TTS that just “reads” words aloud, Octave understands how meaning affects delivery to generate emotional, human-like speech.

265

3K

483

2K

394K

1

8

0

304

Aliceebaird retweeted

over 1 year ago

Octave’s first-of-its-kind voice intelligence speaks for itself 😉 In a blind study, Octave outperformed ElevenLabs Voice Design: 🔊71.6% preferred Octave’s audio quality 🗣️51.7% found Octave more natural 🎯57.7% said Octave better matched voice descriptions

hume_ai's tweet photo. Octave’s first-of-its-kind voice intelligence speaks for itself 😉 In a blind study, Octave outperformed ElevenLabs Voice Design:

🔊71.6% preferred Octave’s audio quality
🗣️51.7% found Octave more natural
🎯57.7% said Octave better matched voice descriptions https://t.co/iKGc7ZUQgD

3

60

6

9

6K

Aliceebaird retweeted

over 1 year ago

Today, we’re releasing Octave: the first LLM built for text-to-speech. 🎨Design any voice with a prompt 🎬 Give acting instructions to control emotion and delivery (sarcasm, whispering, etc.) 🛠️Produce long-form content on our Creator Studio Unlike traditional TTS that just “reads” words aloud, Octave understands how meaning affects delivery to generate emotional, human-like speech.

265

3K

483

2K

394K

Aliceebaird retweeted

over 1 year ago

Better AI voices coming soon...

21

244

32

65

12K

Aliceebaird retweeted

over 1 year ago

Introducing OCTAVE, a next-generation speech-language model. OCTAVE has new emergent capabilities, like on-the-fly voice and personality creation and much more 👇

45

1K

194

700

176K

Aliceebaird retweeted

over 1 year ago

Hume's EVI 2 generates speech and language in tandem, with rich context and emotion. It is the only speech-LLM model on the market that seamlessly integrates with frontier LLMs like @AnthropicAI's Claude 3.5 Sonnet. Together, EVI 2 + Claude 3.5 Sonnet: 🎙️Over 2 million minutes of voice AI conversations completed ✨36% of users choose Claude, higher than any other LLM integrated with EVI 2 💸Developers using EVI 2 + Claude see an 80% reduction in cost and 10% decrease in latency through prompt caching

6

101

19

43

9K

Aliceebaird retweeted

over 1 year ago

Loneliness and cognitive decline are major health challenges for older adults. https://t.co/KroZsgBrbo integrated Hume’s Empathic Voice Interface (EVI) into their digital companionship app, with powerful results: 🧠88% reported increased mental stimulation 🫂90% experienced reduced loneliness All from just a few 15-minute sessions over 5 weeks. Read the case study: https://t.co/sEcebD78Rt

hume_ai's tweet photo. Loneliness and cognitive decline are major health challenges for older adults. https://t.co/KroZsgBrbo integrated Hume’s Empathic Voice Interface (EVI) into their digital companionship app, with powerful results:

🧠88% reported increased mental stimulation
🫂90% experienced reduced loneliness

All from just a few 15-minute sessions over 5 weeks.

Read the case study: https://t.co/sEcebD78Rt

0

101

15

44

6K

Aliceebaird retweeted

over 1 year ago

EVI 2 is available now 👉 API https://t.co/Kw2o7HVgoR & live demo https://t.co/0zBH3qR39D

1

56

14

24

7K

Aliceebaird retweeted

almost 2 years ago

Thanks for taking the time to chat with EVI 2 @willknight → https://t.co/0zBH3qRAZb

1

49

11

19

6K

almost 2 years ago

So proud of our team! See what we’ve been up to: https://t.co/STDnqXp7hD

almost 2 years ago

Introducing Empathic Voice Interface 2 (EVI 2), our new voice-to-voice foundation model. EVI 2 merges language and voice into a single model trained specifically for emotional intelligence. You can try it and start building today.

49

973

177

604

166K

0

7

0

2

373

Aliceebaird retweeted