Brandon Yang

15 days ago

@elipughresearch congrats!!

0

4

0

268

Building ai systems, Stanford cs phd @hazyresearch, Incoming assistant professor @caltech, Leading the frontier performance research team @togethercompute

15 days ago

@Shrey2809 👨‍🍳

0

3

0

69

Who to follow

Simran Arora

@simran_s_arora

Omar Khattab

@lateinteraction

asst professor @MIT CSAIL @nlp_mit. https://t.co/VgyLxl0VZz, https://t.co/ZZaSzaRIOF (@DSPyOSS), RLMs, GEPA, Pedagogical RL

Chaitanya K. Joshi

@chaitjo

AI researcher excited about biomolecule design 🧬 Postdoc @Stanford @RDasLab PhD student @Cambridge_Uni Prev. FAIR @AIatMeta @PrescientDesign @MRC_LMB

15 days ago

Ink-2 is our first streaming ASR model, built specifically for realtime voice agents - and it's #1 on @ArtificialAnlys on day 1! It's rare for models to be #1 on the first try on a new benchmark, since model development is iterative and there's so much that goes into understanding quality. We've seen great results internally and can't wait for everyone to try it!

15 days ago

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

9

126

38

49

63K

0

26

2

1

878

15 days ago

@ArtificialAnlys Congrats on the launch! Very excited to see more realtime streaming benchmarks!

0

6

0

112

15 days ago

@krandiash i thought this was deb's joke

0

2

0

389

21 days ago

@_albertgu and yet he's still not happy

0

9

0

61

21 days ago

@elipughresearch :fat_yoshi:

0

1

0

47

21 days ago

@fluorane gud

0

1

0

108

21 days ago

@chongz this does not feel like the shilling that was asked for

2

7

0

194

21 days ago

@LLMenjoyer the gifs are incredible

0

2

0

80

21 days ago

@ZhunLiu3 👀

0

3

0

68

bclyang retweeted

21 days ago

Sonic 3.5 is now the #1 text to speech model on the @ArtificialAnlys leaderboard! You no longer have to trade off quality and latency - Sonic 3.5 also has the fastest time to first audio at 82ms end to end. See full benchmark results 👇

5

85

21

16

10K

21 days ago

Sonic 3.5 is now the #1 TTS model on @ArtificialAnlys, an independent benchmark of TTS quality! It's also the fastest model with 82ms end to end latency - it's always been our dream to build realtime voice with no trade-offs. Building great models comes from getting the fundamental right - infrastructure, architecture, data, and evals - and I'm proud to see the hard work for the team recognized!

Artificial Analysis

@ArtificialAnlys

21 days ago

Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500+ voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following. Key takeaways: ➤ Quality: Sonic-3.5 has an Elo score of 1,218 (+16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209 ➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters ➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS See more details and listen to samples below 🧵

ArtificialAnlys's tweet photo. Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS

Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500+ voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following.

Key takeaways:
➤ Quality: Sonic-3.5 has an Elo score of 1,218 (+16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209

➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters

➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS

See more details and listen to samples below 🧵

18

266

54

90

110K

3

86

5

20

5K

bclyang retweeted

22 days ago

NYC happy hour with @awscloud Thurs, June 4 · 5:30pm · NY tech week RSVP in comments 👇

1

7

1

0

2K

bclyang retweeted