Hume AI

Verified account

@hume_ai

The empathic AI research lab. Providing the open source models, datasets, and evaluation APIs to embed emotional intelligence into your models.

New York, NY

Joined March 2021

25 Following

23.2K Followers

701 Posts

Pinned Tweet

4 months ago

Today we're releasing our first open source TTS model, TADA! TADA (Text Audio Dual Alignment) is a speech-language model that generates text and audio in one synchronized stream to reduce token-level hallucinations and improve latency. This means: → Zero content hallucinations across 1,000+ test samples → 5x faster than similar-grade LLM-based TTS → Fits much longer audio: 2,048 tokens cover ~700 seconds with TADA vs. ~70 seconds in conventional systems → Free transcript alongside audio with no added latency

102

3K

310

3K

269K

about 1 month ago

https://t.co/Fy0OZYrUtX

0

6

0

2

580

about 1 month ago

Today, voice models have no problem generating “angry” or “sad” expressions. But ask for: → bored + fast → joy + shy → disappointment + confident …and most systems collapse into stereotypes. Our latest research blog explores why this happens — and how disentangling emotion from voice at the data layer improves expressive control. Read more below!

hume_ai's tweet photo. Today, voice models have no problem generating “angry” or “sad” expressions.

But ask for:
→ bored + fast
→ joy + shy
→ disappointment + confident
…and most systems collapse into stereotypes.

Our latest research blog explores why this happens — and how disentangling emotion from voice at the data layer improves expressive control. Read more below!

2

31

3

12

1K

3 months ago

@derek1159 We are too!

0

0

0

0

84

Who to follow

Verified account

Unleash your imagination—chat, create, explore.

Verified account

🚀 Turn text into videos with AI voices 🔥 Convert tweets into videos 🎥 Transform ideas and blog articles into videos 🎙️ Lifelike text to speech voices

Databricks AI Research

Verified account

We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.

3 months ago

We’re excited to launch the 2026 ACII Dyadic Contest (DaiKon) Workshop & Challenge—a new benchmark for modeling emotional influence in dyadic dialogue. Explore a sample of our conversational audio dataset: 945 sessions, 743 hours, across 5 languages. Submissions due May 25. We look forward to your participation!

hume_ai's tweet photo. We’re excited to launch the 2026 ACII Dyadic Contest (DaiKon) Workshop & Challenge—a new benchmark for modeling emotional influence in dyadic dialogue.

Explore a sample of our conversational audio dataset: 945 sessions, 743 hours, across 5 languages.

Submissions due May 25. We look forward to your participation!

6

39

9

21

3K

3 months ago

This dataset at scale is also available in its original dual-channel format. Explore more of our training datasets at https://t.co/C5p5lAtDYl

1

4

0

4

1K

3 months ago

Register and find out more at https://t.co/DyEf9ereL1

1

7

1

4

1K

3 months ago

Find more here 👇 https://t.co/RBhWViOHhW https://t.co/PQGfrpfQhY

1

20

2

24

2K

3 months ago

Today, we're shipping MLX support for TADA, our open-source text-to-speech model, which means the entire pipeline (LLM, flow-matching, and decoder) can now run locally on any Apple Silicon device. We're seeing a 45% reduction in memory usage and a 10x speed-up when using it quantized. With these improvements, you can use TADA on-device for OpenClaw or any personal chatbot. If you own a MacBook, Mac Mini, or Mac Studio, record a 10-second clip of any voice, type any text, and get high-quality, natural and expressive speech in real-time. Completely offline, completely free.

10

303

28

348

24K

4 months ago

@fffiloni @huggingface @Gradio Great work!

0

0

0

0

125

hume_ai retweeted

4 months ago

I made a @huggingface Space @Gradio demo for TADA to make the paper’s workflow easier to explore. The original demo was a bit confusing, so this one is more guided and helps you understand what’s really going on — and in what order the pipeline is supposed to work.

3

54

6

51

7K

4 months ago

@wuzhu_ @huggingface @grok It is not fine-tuned for languages outside of English, though it has some multilingual capabilities!

0

0

0

0

15

4 months ago

Today we're releasing our first open source TTS model, TADA! TADA (Text Audio Dual Alignment) is a speech-language model that generates text and audio in one synchronized stream to reduce token-level hallucinations and improve latency. This means: → Zero content hallucinations across 1,000+ test samples → 5x faster than similar-grade LLM-based TTS → Fits much longer audio: 2,048 tokens cover ~700 seconds with TADA vs. ~70 seconds in conventional systems → Free transcript alongside audio with no added latency

102

3K

310

3K

269K

4 months ago

@CastrikoEdit It is not fine-tuned for languages outside of English, though it has some multilingual capabilities!

0

1

0

0

215

4 months ago

@overbeight Please try again and let us know if you're still seeing this!

0

0

0

0

416

4 months ago

@_NeutralFan_ It is not fine-tuned for languages outside of English, though it has some multilingual capabilities!

0

0

0

0

241

4 months ago

@rajbreno It is not fine-tuned for languages outside of English, though it has some capabilities in other languages.

0

1

0

1

416

4 months ago

@brubbleR Try again and let us know if you're running into issues!

1

0

0

0

192

4 months ago

@0xecall Please try again and let us know if you're still running into this issue!

0

0

0

0

284

4 months ago

Read our blog: https://t.co/KqNp4jD0lV

1

57

5

35

11K

4 months ago

Try the model: https://t.co/kY2sSifaXq

3

157

14

110

37K

Last Seen Users on Sotwe

Trends for you

Most Popular Users