Amphion @realamphion - Twitter Profile

Amphion @realamphion

over 1 year ago

📚 Explore Now! 🌐 Dataset: https://t.co/NMR7C6LOrZ 📄 Paper: https://t.co/ODeVXYzsCP

0

6

0

2

372

Amphion @realamphion

over 1 year ago

🚀Introducing Emilia-Large: 200K+ Hours of Open-Source Speech Data! We’re excited to release Emilia-Large, the largest TTS pretraining datasets! With 200K+ hours of multilingual speech data, fully open-source. It is ready to use for #TTS and #SpeechLM.

realamphion's tweet photo. 🚀Introducing Emilia-Large: 200K+ Hours of Open-Source Speech Data!
We’re excited to release Emilia-Large, the largest TTS pretraining datasets! With 200K+ hours of multilingual speech data, fully open-source. It is ready to use for #TTS and #SpeechLM. https://t.co/YqY5U0YgsE

2

59

12

22

6K

Amphion @realamphion

over 1 year ago

✨ What’s New? - 2x Scale: Expanded the original Emilia dataset from 101K to 200K+ hours with the new Emilia-YODAS dataset. - Low-Resource Boost: Enhanced support for languages like German, French, and Japanese. - Commercial Use: Emilia-YODAS is released under CC-BY

Amphion @realamphion

over 1 year ago

🚀Introducing Emilia-Large: 200K+ Hours of Open-Source Speech Data! We’re excited to release Emilia-Large, the largest TTS pretraining datasets! With 200K+ hours of multilingual speech data, fully open-source. It is ready to use for #TTS and #SpeechLM.

2

59

12

22

6K

1

11

1

3

583

realamphion retweeted

𝚐𝔪𝟾𝚡𝚡𝟾 @gm8xx8

over 1 year ago

Metis: A Foundation Speech Generation Model with Masked Generative Pre-training paper: https://t.co/OESKnCYA38 demo: https://t.co/yz8DGDOi1c

0

6

5

0

516

Amphion @realamphion

over 1 year ago

@234Sagyboy @discord @discordbots @DiscordBotDevs @_akhaliq @huggingface @reach_vb @Gradio Thanks for the feedback

0

2

0

48

Amphion @realamphion

over 1 year ago

🚀🚀🚀 MaskGCT! In addition to the HuggingFace demo: https://t.co/2mCZA9GLzD you can also join the discord space to play: https://t.co/22IxeaVRq5 Also pre-generated samples: https://t.co/LXiFEiz3Ax @discord @discordbots @DiscordBotDevs @_akhaliq

realamphion's tweet photo. 🚀🚀🚀 MaskGCT!
In addition to the HuggingFace demo: https://t.co/2mCZA9GLzD

you can also join the discord space to play: https://t.co/22IxeaVRq5

Also pre-generated samples: https://t.co/LXiFEiz3Ax

@discord @discordbots @DiscordBotDevs @_akhaliq https://t.co/oWWtkyRZjr

1

8

0

3

523

realamphion retweeted

Sylvain Filoni

@fffiloni

over 1 year ago

Sorry to interrupt but, YES, MaskGCT TTS works for French language ! I have not tested with other latin languages yet, but my guess is that it should work too 🤗

6

121

17

84

12K

realamphion retweeted

Sylvain Filoni

@fffiloni

over 1 year ago

I've added the MaskGCT TTS @gradio API to the Echo Mimic Space, so you can directly clone your voice before generating portrait generation 🤗 Try it —› https://t.co/vSWUt0lbEL

fffiloni's tweet photo. I've added the MaskGCT TTS @gradio API to the Echo Mimic Space, so you can directly clone your voice before generating portrait generation 🤗

Try it —› https://t.co/vSWUt0lbEL https://t.co/y4caaAitVz

8

185

38

147

15K

Amphion @realamphion

over 1 year ago

GitHub: https://t.co/zr1hJCWRxa MaskGCT: https://t.co/ggVXGAQIiC

0

2

0

2

393

Amphion @realamphion

over 1 year ago

🔥🔥🔥MaskGCT is hot, making Amphion on the list of GitHub Trending again! > SoTA TTS model > Zero-shot cloning > Emotional TTS > Multilingual, now supporting English and Chinese > Fully non-autoregressive and duration controllable Try in HF and https://t.co/FvmcJ5pm6z

3

34

6

28

9K

Amphion @realamphion

over 1 year ago

HF demo: https://t.co/2mCZA9GLzD

1

2

1

2

402

realamphion retweeted

Vaibhav (VB) Srivastav

@reach_vb

over 1 year ago

Fuck yeah! MaskGCT - New open SoTA Text to Speech model! 🔥 > Zero-shot voice cloning > Emotional TTS > Trained on 100K hours of data > Long form synthesis > Variable speed synthesis > Bilingual - Chinese & English > Available on Hugging Face Fully non-autoregressive architecture: > Stage 1: Predicts semantic tokens from text, using tokens extracted from a speech self-supervised learning (SSL) model > Stage 2: Predicts acoustic tokens conditioned on the semantic tokens. Synthesised: "Would you guys personally like to have a fake fireplace, an electric one, in your house? Or would you rather have a real fireplace? Let me know down below. Okay everybody, that's all for today's video and I hope you guys learned a bunch of furniture vocabulary!" TTS scene keeps getting lit! 🐐

40

1K

149

1K

139K

Amphion @realamphion

over 1 year ago

@reach_vb Thanks to @Reach_Vbarrels A demo from the community: https://t.co/bIRkNKWJud

0

1

100

Amphion @realamphion

over 1 year ago

@Alice2848126245 from the speech prompt

1

0

62

Amphion @realamphion

over 1 year ago

🚀🚀🚀 A Zero-Shot TTS model MaskGCT (Masked Generative Codec Transformer) is open-sourced in Amphion now. Trained with Emilia. Only needs 5 sec speech to clone Paper: https://t.co/OdoQ3niCeY HF: https://t.co/2mCZA9GLzD Discord: https://t.co/FvmcJ5pm6z Watch the demo by MaskGCT

8

75

32

62

12K

Amphion @realamphion

over 1 year ago

@Sathees89347227 In a few weeks

1

2

0

90

Amphion @realamphion

over 1 year ago

@cosmic_spec Internal version is about 500ms

0

1

0

90

Amphion @realamphion

over 1 year ago

@D3crypTor_X Yes, you can control it

1

0

17

Amphion @realamphion

over 1 year ago

@ElonMuskAOC Your new AI interview 😂

1

0

1

113

Amphion @realamphion

over 1 year ago

@mohamed17381489 Yes, it supports multi-lingual. We are going to release another checkpoint that supports 6 languages soon.

0

3

0

153

Amphion

@realamphion

Last Seen Users on Sotwe

Trends for you

Most Popular Users