Amu @akv_13 - Twitter Profile

Amu

@akv_13

about 1 year ago

@Devon07979992 What milestone would you say this hit?

1

0

118

Amu

@akv_13

about 1 year ago

Voice most important modality after text. Speech-LLMs are the future. Here is a chronological list of the important Open Source milestones in the Speech-LLM space 📖. A thread 🧵 ...

7

34

4

16

6K

Amu

@akv_13

about 1 year ago

@dctanner @uselayercode Awesome! Excited to see it!

0

1

0

162

Amu

@akv_13

about 1 year ago

@kadirnardev was trying to decide between spark and llasa :)

0

1

0

162

Amu

@akv_13

about 1 year ago

Qwen 2.5 Omni (03/2025) “No catastrophic forgetting in e2e speech” https://t.co/wQuaZn9N22

0

3

0

513

Amu

@akv_13

about 1 year ago

Orpheus (03/2025) “Finally OS speech >= closed source” https://t.co/MJCejfCiJH

1

2

0

612

akv_13 retweeted

Elias @Eliasfiz

about 1 year ago

We also sponsor visas! Apply by emailing us: jobs[at]canopylabs[dot]ai

5

434

11

308

57K

akv_13 retweeted

Thomas Wolf

@Thom_Wolf

about 1 year ago

Orpheus a SOTA text-to-speech model https://t.co/yLObMCEpS2

1

48

3

21

6K

akv_13 retweeted

Elias @Eliasfiz

about 1 year ago

OrpheusTTS is now at 200k downloads on Hugging Face with 4.7k stars and 400 forks on Github! The OS community has been crushing it

3

95

4

31

5K

Amu

@akv_13

about 1 year ago

@m4zas24 Hi - we'd appreciate any suggestions - please feel free to post them here/dm me/ or put them in a github discussion!

0

60

Amu

@akv_13

about 1 year ago

We designed Orpheus to be easily fine tuned. On Hugging Face 🤗 there are over 200 finetunes of our model. People have made Orpheus: - Speak dozens of languages 🌎 - Create personalised voice interfaces 🔊 - Recreate historical/fictional characters📜

Elias @Eliasfiz

about 1 year ago

Today, we’re launching Orpheus, an open-source TTS model that exceeds the capabilities of both open and closed-source models such as ElevenLabs and OpenAI! (1/6)

183

4K

388

4K

629K

3

108

17

67

6K

akv_13 retweeted

Philip Kiely

@philipkiely

about 1 year ago

Deploying and vibe checking Orpheus TTS, an open-source model for generating speech. Our implementation supports up to 48 concurrent real-time users per H100 GPU!

4

20

4

6

34K

Amu

@akv_13

about 1 year ago

Excited to bring optimised inference for Orpheus to everyone!

Elias @Eliasfiz

about 1 year ago

People told us they want Orpheus TTS in production. So we partnered with @baseten as our preferred inference provider! Baseten runs Orpheus with: •⁠ ⁠Low latency (<200 ms TTFB) •⁠ ⁠High throughput (up to 48 real-time streams per H100) •⁠ ⁠Secure, worldwide infra

Eliasfiz's tweet photo. People told us they want Orpheus TTS in production.

So we partnered with @baseten as our preferred inference provider!

Baseten runs Orpheus with:

•⁠ ⁠Low latency (<200 ms TTFB)
•⁠ ⁠High throughput (up to 48 real-time streams per H100)
•⁠ ⁠Secure, worldwide infra https://t.co/8E27BAXKnS

15

164

15

119

18K

1

11

1

2

784

akv_13 retweeted

Jack

@jackndwyer

about 1 year ago

Building an AI app with $1/hr AI voice, memory, tool-calling, phone—that's what you use Gabber for But time to value shouldn't be hours, it should be seconds Build a sample AI companion in 20 seconds, try it, tune it, then go live with it (last part coming soon)

3

29

3

12

2K

akv_13 retweeted

SebastianBoo

@SebastianB929

about 1 year ago

Thanks for the OrpheusTTS release! Great models, easy to finetune and you can even livestream using vllm or sglang with a single RTX 3090 with FP8 quantization. INT4 should be even faster.😃