I'm a co-founder at Bytes AI.
we build voice AI that answers the phone for restaurants. takes the order, upsells, sends it to the POS, dispatches delivery.
live in 1000+ restaurants.
Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents.
Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold.
Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.
We’ve cut ElevenAPI and ElevenAgents pricing for self-serve developers.
From today, prices are lower and you can pay as you go:
- Text to Speech is now up to 55% lower cost
- Speech to Text is now up to 45% lower cost
- Agents is now up to 20% lower cost
Performance, quality, and support remain unchanged.
Introducing hallucination correction. We have reduced hallucination by 70%. Giga's hallucination rate is at ~1%. Better than the best frontier models.
Deploy AI your customers can trust.
@GigaAI This is a solid move towards making voice agents more reliable. You need a smarter model to monitor their behavior in realtime.
We’ve also implemented a similar pattern for our restaurant voice ai and has improved behavior by 8x
We just shipped VIVA 2.0 at @krispHQ — launching live at @twilio Signal in SF.
8 years. A trillion minutes of real-world voice. This is what it all built toward.
Voice Infrastructure for Voice AI Agents 🧵
I'll be sharing what we learn building this from here. voice AI in production is genuinely weird, and a lot of what we figure out doesn't exist on the internet yet.
I'm a co-founder at Bytes AI.
we build voice AI that answers the phone for restaurants. takes the order, upsells, sends it to the POS, dispatches delivery.
live in 1000+ restaurants.