We ran the Morphiq Bench voice-agent benchmark.
The question: when a coding agent is asked to build a voice-agent feature, which provider does it actually choose?
We tested Claude Code and Codex across 1,340 repositories and 3 development intents:
• inbound call
• outbound call
• voice web widget
Top providers selected:
1. @Vapi_AI
2. @livekit
3. @ElevenLabs
4. @retellai
5. @pipecat_ai
6. @usebland
This isn’t a generic “best voice agent company” ranking.
It measures which providers agents are most likely to pick when implementing voice-agent workflows in code.
Would be happy to walk through the methodology or share what drove the results with any of the teams here.
Introducing Morphiq Affinity Bench
The first agent adoption benchmark for developer tools.
For years, developer-tool distribution was built around humans
At @TryMorphiq We ask the next question:
When agents code, what tools do they choose?
Morphiq Affinity Bench is now live.
Everyone at YC hackathon built with @browser_use computer use agents, etc
we built ���𝗵𝗼𝗻𝗲 𝘂𝘀𝗲
guess which one people actually use 8 hours a day?
we fork Browser Use and add support to just using:
'from browser_use import phone_use'
u can install these as an skill on openclaw, claude code etc
thanks to: @AmaruEscalante @CohenAiden99429
Everyone's building browser use. We built phone use for the @browser_use hackathon at YC. All night. No sleep. And it's actually unhinged.
Here's the idea: AI agents that can test apps like real humans. But not generic humans, specific personas. A 21-year-old gym bro testing a fitness app. A 45-year-old accountant testing banking software. Each with different behaviors, tech comfort levels, pain points.
We built a full stack for it.
Parallel testing across multiple devices simultaneously.
Result: structured bug reports with severity ratings, user feedback loops, and actionable insights. Not just "the app broke", but "users like this demographic get stuck here for this reason."
@AndresDevX @CohenAiden99429
Everyone at YC hackathon built with @browser_use computer use agents, etc
we built 𝗽𝗵𝗼𝗻𝗲 𝘂𝘀𝗲
guess which one people actually use 8 hours a day?
we fork Browser Use and add support to just using:
'from browser_use import phone_use'
u can install these as an skill on openclaw, claude code etc
thanks to: @AmaruEscalante @CohenAiden99429
dev twitter used to be:
- react devs
- vue devs
- backend bros
- frontend gang
now it's:
- Setup openclaw
- Orchestrate Swarms of Coding Agents
- Rate Limits on Claude Code
- still manually debugging (lol)
the social graph reorganized around methodology, not stack