Did xAI just mass-murder the entire voice AI industry? 🤯
Grok just launched two voice APIs. Speech-to-Text and Text-to-Speech.
Built on the same stack powering Tesla cars and Starlink support.
And priced at 10x cheaper than ElevenLabs.
Speech-to-Text: $0.10/hr batch. $0.20/hr streaming.
Text-to-Speech: $4.20 per million characters.
25+ languages. Real-time streaming. Speaker diarization.
Already outperforming ElevenLabs, Deepgram, and AssemblyAI on word error rate.
TTS ships with expressive tags like [laugh], [sigh], <whisper>, <emphasis>.
Voices that don't sound like robots reading a script.
ElevenLabs spent years building a voice AI company.
xAI built voice AI for cars and satellites.
Last month my intern asked for help with a Kubernetes error.
He was stuck on a YAML file.
He looked desperate.
I make $275,000 a year.
I haven't written a line of code since 2017.
I don't even know what a "pod" is.
But I didn't tell him that.
I leaned back in my Herman Miller chair.
I said, "Stop trying to code. Start prompting."
I told him to paste the error into ChatGPT.
He did.
The AI told him to delete the cluster.
He did.
Production went down instantly.
The CEO called me screaming.
I didn't panic.
I told the CEO we were "testing our disaster recovery protocols."
He was impressed by my foresight.
I got a bonus.
The intern got fired.
Innovation requires sacrifice.
Just not mine.
🚨 ElevenLabs just killed every transcription tool on the market.
They dropped Scribe v2 Realtime and it's not just another speech-to-text model.
This thing processes audio in real-time with zero lag, handles multiple speakers, and actually understands context not just words.
Here's how with real examples:
Full thread 🧵
gpt-oss is a big deal; it is a state-of-the-art open-weights reasoning model, with strong real-world performance comparable to o4-mini, that you can run locally on your own computer (or phone with the smaller size). We believe this is the best and most usable open model in the world.
We're excited to make this model, the result of billions of dollars of research, available to the world to get AI into the hands of the most people possible. We believe far more good than bad will come from it; for example, gpt-oss-120b performs about as well as o3 on challenging health issues. We have worked hard to mitigate the most serious safety issues, especially around biosecurity. gpt-oss models perform comparably to our frontier models on internal safety benchmarks.
We believe in individual empowerment. Although we believe most people will want to use a convenient service like ChatGPT, people should be able to directly control and modify their own AI when they need to, and the privacy benefits are obvious.
As part of this, we are quite hopeful that this release will enable new kinds of research and the creation of new kinds of products. We expect a meaningful uptick in the rate of innovation in our field, and for many more people to do important work than were able to before.
OpenAI’s mission is to ensure AGI that benefits all of humanity. To that end, we are excited for the world to be building on an open AI stack created in the United States, based on democratic values, available for free to all and for wide benefit.
Bitcoin Core developers are about to merge a change that turns Bitcoin into a worthless altcoin, and no one seems to care to do anything about it.
I've voiced objections, lost sleep over this, and despite clear community rejection of the PR it's moving.
https://t.co/dMcQ4g8RjV
We just shipped chapter 4 of The Anatomy of Go!
For those who have already purchased the book at the discounted price; simply click the link in the next tweet, login and download the latest version completely free.
Chapter 4 adds 100+ new pages and digs into how Structs, Interfaces and Generics work under the hood.
I remain confident that no Go other book goes to the depth that this one does and everyone who reads it (including me) will become a better Go engineer after reading it.
If you haven't picked up a copy, you can still get it at a discounted price whilst it's still in early access.
@ivanfioravanti Using RedCraft Flux for txt2img and new LTX-Video distilled model for img2vid.
Running inside ComfyUI.
Here is the workflow with LTX I used as a base https://t.co/IfD2UkyJZ3
Quite slow, but we are getting there.
LTX video gen is real time on H100 with the distilled model.
BIG NEWS
@Kling_ai 2.0 is here!
It brings fluid and dynamic motion with fantastic prompt understanding.
I needed a use case to showcase how awesome it is, so I made a short film.
Enjoy your trip to Mars:
NEW: Scientists have brought back dire wolves using ancient DNA, with the first born on October 1, 2024, over 10,000 years after their extinction
The genome was reconstructed by Colossal from ancient DNA found in fossils
The fossils date back 11,500 and 72,000 years
Colossal Biosciences said: "This moment marks not only a milestone for us as a company but also a leap forward for science, conservation, and humanity. From the beginning, our goal has been clear:
To revolutionize history and be the first company to use CRISPR technology successfully in the de-extinction of previously lost species.
By achieving this, we continue to push forward our broader mission on—accepting humanity’s duty to restore Earth to a healthier state."
@forgebitz I used to fear that the new generation of programmers would be smarter than me and quickly replace me in a few years. Thanks to GPT for this generation of eternal Juniors.
@jackfriks@supabase 😎 Engineers before:
I'll set up iptables, add captcha, register only after email confirmation
🤓Engineers now:
*post to X* cry cry please stophhh
Last night I used 4 cameras to capture the ultimate HDR view of the lunar eclipse
This 300GB image shows the lunar surface in extreme detail, while revealing all rich color that was projected onto the surface
See the full res crop or get a print in the reply to this post
wow..
Sesame just open sourced its 1B base AI model
CSM (Conversational Speech Model)
- trained on 1 MILLION hours of data
- voice cloning
- based on llama architecture
- real-time synthesis
- free to try and download now
Link in comments.