Dan Lyth @danlyth - Twitter Profile

28 days ago

Maya and Miles are joined by Simone and Charlie, and have far greater utility than they used to, while still being extremely conversational and low-latency. More info in the blog: https://t.co/8yGu9c1jSO

0

112

Dan Lyth @danlyth

28 days ago

Sesame preview now available on iOS ❤️

Sesame

@sesame

28 days ago

A collection of personal agents, crafted for everyday conversation. Preview available now on iOS. https://t.co/Hq5ZXAqPvX

32

340

36

121

185K

2

8

0

623

Dan Lyth @danlyth

about 1 year ago

@dscape Ah, sorry, fixed now

0

15

Dan Lyth @danlyth

about 1 year ago

@dscape Hey, DMs are open :)

1

0

26

Who to follow

Neil Zeghidour

@neilzegh

CEO @GradiumAI. Founder of @kyutai_labs. Invented neural codecs and audio LLMs. Prev. Google DeepMind/Brain, Meta, Toha Heavy Industries.

Stefan Lattner

@deeplearnmusic

Sony Computer Science Laboratories, Paris

Ethan Manilow

@ethanmanilow

universal musical approximator, research @GoogleDeepMind / @GoogleMagenta

Dan Lyth @danlyth

over 1 year ago

@sesame @_apkumar https://t.co/qzwEUfOLi9

0

1

0

1

446

Dan Lyth @danlyth

over 1 year ago

We're a small team @sesame, but looking for great people to join. Check out this podcast with @_apkumar to get a feel for how we work. More info at https://t.co/BD2PsOWlhk

Anjney Midha

@AnjneyMidha

over 1 year ago

the core research team behind the sesame voice model is <9 ppl as @_apkumar walked through in our latest 1.5 hr podcast, talent density beats team size most days

8

170

19

79

36K

1

3

0

3

608

Dan Lyth @danlyth

over 1 year ago

@realmrfakename @sesame https://t.co/vxf29q0YBv https://t.co/GUhV3lJbOi https://t.co/BPN0hMWZEK ❤️

2

7

1

2

245

Dan Lyth @danlyth

over 1 year ago

We're just getting started at @sesame. Check out the demo here: https://t.co/yCyyNKsiw7

Brendan Iribe

@brendaniribe

over 1 year ago

We’re exploring a future where the computer isn’t just a tool—it’s a partner with a truly natural voice and personality. No big claims, just early work we’re excited to share. @sesame

108

784

80

223

303K

6

15

0

2

2K

Dan Lyth @danlyth

over 1 year ago

@realmrfakename @sesame Yeah, we’re open-sourcing one of the base models (not fine-tuned with the voices you hear in the demo) in the next two weeks. Will be here: https://t.co/BPN0hMXxui

7

36

7

20

6K

Dan Lyth @danlyth

over 1 year ago

@reach_vb @realmrfakename @sesame Hey, sounds good vb, will be in touch.

1

3

0

203

Dan Lyth @danlyth

over 1 year ago

Kind words from @seanhollister 🙏. But we've still got a long way to go...

The Verge

@verge

over 1 year ago

Sesame is the first voice assistant I’ve ever wanted to talk to more than once https://t.co/TddtBZUZPY

0

25

6

5

18K

0

3

1

0

493

danlyth retweeted

Brendan Iribe

@brendaniribe

over 1 year ago

And we’re building hardware.

14

104

4

8

8K

Dan Lyth @danlyth

over 1 year ago

Nice overview of some of the things we've been working on @sesame. Always a pleasure working with @justLV.

Justin Alvey

@justLV

over 1 year ago

Excited to share a peek of what I’ve been working on We @sesame believe voice is key to unlocking a future where computers are lifelike Here’s an early preview you can try! 👇 We’ll be open sourcing a model, and yes… we’re building hardware! 🧵

187

2K

250

963

453K

1

7

0

1

404

Dan Lyth @danlyth

over 1 year ago

Delighted to share a little glimpse of what we've been working on @sesame

Sesame

@sesame

over 1 year ago

At Sesame, we believe in a future where computers are lifelike. Today we are unveiling an early glimpse of our expressive voice technology, highlighting our focus on lifelike interactions and our vision for all-day wearable voice companions. https://t.co/Edp8V8urgC

489

6K

917

3K

2M

0

7

0

410

Dan Lyth @danlyth

about 2 years ago

@FluorescentGrey @stableaudio @StabilityAI @iScienceLuvr @jordiponsdotme @ednewtonrex @harmonai_org @zqevans @chrlaf

0

1

0

110

Dan Lyth @danlyth

about 2 years ago

@Dorialexander @pleiasfr @huggingface This is awesome. Do you know approximately how many hours this comes to?

0

2

0

119

Dan Lyth @danlyth

about 2 years ago

Excellent work by @sanchitgandhi99 and @yoachlacombe reproducing the text-description-to-speech model I developed while at @StabilityAI 👏❤️

Sanchit Gandhi @sanchitgandhi99

about 2 years ago

Introducing Parler-TTS: an inference and training library for high-quality, controllable text-to-speech (TTS) models 🗣️ To fuel the development of open-source TTS research, we are open-sourcing all datasets, training code and our first iteration checkpoint: Parler-TTS Mini v0.1

13

601

128

433

72K

0

65

6

20

15K

Dan Lyth @danlyth

over 2 years ago

@HubertSiuzdak Congrats, sounds great! Any code/paper?

1

0

171

Dan Lyth @danlyth

over 2 years ago

@erogol I was wondering the same thing, there’s not a lot of detail on that.

0

1

0

49

Dan Lyth @danlyth

over 2 years ago

Moving beyond naturalness and WER, they propose a set of sentences that test the model’s ability to deal with compound nouns, emotions, foreign words, paralinguistics (e.g. whispering if the text requires it) etc. etc. The full test set is included in the appendix. 👏 2/7

danlyth's tweet photo. Moving beyond naturalness and WER, they propose a set of sentences that test the model’s ability to deal with compound nouns, emotions, foreign words, paralinguistics (e.g. whispering if the text requires it) etc. etc.

The full test set is included in the appendix. 👏

2/7 https://t.co/HCOFC43O0A

1

5

0

649

Dan Lyth @danlyth

over 2 years ago

There are a bunch of other interesting elements to this work, and it’s worth a read. Plenty of examples on the demo site too. Nice work Mateusz Łajszczak, @guillecambara, Yang Li, and all the other contributors. https://t.co/VV2SwDmidD

1

4

0

378

Dan Lyth @danlyth

over 2 years ago

The speech “de-tokenizer” (or decoder) is a convolutional model that’s streamable and 3x faster than their diffusion-based baseline (and also sounds better). It’s built around BigVGAN. 6/7

danlyth's tweet photo. The speech “de-tokenizer” (or decoder) is a convolutional model that’s streamable and 3x faster than their diffusion-based baseline (and also sounds better).
It’s built around BigVGAN.

6/7 https://t.co/IG3clZ8R0x

2

3

0

1

520

Dan Lyth

@danlyth

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users