We learn to speak before we learn to read.
Voice is the most natural interface we have.
We just raised a $100M to make building voice AI as easy as a web app.
Most voice AI agents forget you the second you hang up. No name, no history, no idea what you asked last time.
We gave a LiveKit voice agent persistent memory using @MongoDB Atlas Vector Search. RAG, hybrid rankFusion recall, and a profile that loads before the agent says hello.
Full walkthrough and starter kit below.
Introducing the LiveKit C++ SDK.
Realtime audio, video, and data tracks for C++ apps, with the same low-latency transport our other clients use. Built for the C++ stacks behind robotics, autonomous vehicles, and high-performance media pipelines.
https://t.co/T6NuISyRqb
Honored to be included in @Redpoint’s 2026 InfraRed 100 list alongside the most promising private companies in AI infrastructure.
What we’ve built is a reflection of the customers we get to work with, from SAP and OpenAI to thousands of teams shipping voice agents every day.
https://t.co/wCe4VsLBxq
Add a face to your voice agent.
LiveAvatar by @HeyGen is now supported in LiveKit Agents.
Add a realtime human avatar to your agent without rebuilding the conversation loop.
Your LiveKit agent still owns the room, turn-taking, model orchestration, and voice pipeline. LiveAvatar renders the synchronized face and video stream.
Useful for product demos, onboarding, tutoring, and support agents that need a visual layer.
Congrats to the @cartesia team! Sonic-3.5 just took the #1 spot on the Artificial Analysis Speech Arena and raised the bar for realtime voice generation.
It’s live on LiveKit inference today. Try it with a single line of config and ship the most natural sounding agents.
Cartesia’s Sonic-3.5 takes the #1 spot on the Artificial Analysis Speech Arena Leaderboard, surpassing Inworld Realtime TTS 1.5 Max and Google’s Gemini 3.1 Flash TTS
Sonic-3.5 is the latest TTS model from @cartesia . It supports 42 languages, including 9 Indian languages, with 500+ voices available out of the box. The model has been highly preferred among voters in the TTS Arena, with its demonstrated naturalness and accurate transcript following.
Key takeaways:
➤ Quality: Sonic-3.5 has an Elo score of 1,218 (+16/-16) based on 1,144 arena appearances, placing it ahead of Inworld Realtime TTS 1.5 Max at 1,194 and Gemini 3.1 Flash TTS at 1,209
➤ Pricing: Sonic-3.5 is priced at $39/1M characters, a premium compared to Gemini 3.1 Flash TTS at $18.3/1M characters, and Inworld Realtime TTS 1.5 Max at $35/1M characters
➤ Speed: 105.5 characters per second, compared to 205 characters per second for Inworld Realtime TTS 1.5 Max and 26.3 characters per second for Gemini 3.1 Flash TTS
See more details and listen to samples below 🧵
For AI avatars that feel engaged while your users are speaking, with eye contact, movements, and expressions generated live from a single reference image, check out @runwayml Characters.
Add one to a LiveKit voice agent with three lines of code.
"Building for enterprise isn't just about having the right AI model. It's about having a stack you can stand behind when a customer calls at 9am on a Monday with a problem."
Finn zur Mühlen, Co-founder of telli, on running 30k+ daily calls on LiveKit + @ai_coustics: https://t.co/gfgOWqghxN
Ship a voice agent on any website with a single script tag.
The widget supports voice, video, screen share, and text chat. Configure branding, capabilities, and per-visitor context from the LiveKit Cloud dashboard. Works on Shopify, Webflow, WordPress, or any custom site.
Already built a @LangChain agent? You don't have to rebuild it for voice.
With the LangChain plugin for LiveKit Agents, you can connect it to a realtime voice pipeline, complete with speech-to-text, text-to-speech, and the infrastructure to deploy it at scale.
Your outbound phone agent has 1-2 seconds to figure out if it's talking to a person, a voicemail, or an IVR.
We shipped Answering Machine Detection (AMD) in LiveKit Agents to do that for you so your agent knows when to keep talking, leave a message, use the keypad, or hang up.
“Voice makes AI feel less like a tool and more like a natural part of the experience…LiveKit gives us the scalable foundation to bring those voice experiences to life at enterprise scale, without sacrificing flexibility.” Jonathan von Rüden, @SAP's Chief AI Officer.
Add a face to your voice agent.
LiveAvatar by @HeyGen is now supported in LiveKit Agents.
Add a realtime human avatar to your agent without rebuilding the conversation loop.
Your LiveKit agent still owns the room, turn-taking, model orchestration, and voice pipeline. LiveAvatar renders the synchronized face and video stream.
Useful for product demos, onboarding, tutoring, and support agents that need a visual layer.
Voice cloning is now available on LiveKit Inference. We’re launching with @inworld_ai and @cartesia.
Clone a voice once and use it across multiple TTS providers, with automatic fallback to the same voice if a provider fails mid-call.
Free to create and available on all paid plans today.