Ben Carr

Verified account

@benatanam

building realtime avatars @Anam__ai

London, England

Joined November 2024

82 Following

754 Followers

431 Posts

Pinned Tweet

4 months ago

Introducing cara-3, the fastest real-time avatar model on the market. Cara model delivers unmatched realism with sub-180ms response times, setting a new industry standard. 70% of users prefer video over voice. Every pixel is generated in real time, unlocking natural eye movement, micro-expressions, and emotional subtlety so each conversation feels real. Comment "CARA" for 500 free credits.

500

1K

135

1K

406K

8 days ago

@snewmanpv how many people are training with fp8, nevermind fp4?

0

0

0

0

391

21 days ago

it appears our latest model has a hidden jack nicholson dimension 🔪

benatanam's tweet photo. it appears our latest model has a hidden jack nicholson dimension 🔪 https://t.co/BAeemfpZgu

0

1

1

0

64

benatanam retweeted

Harry Coultas Blum

21 days ago

Open source Jarvis that runs on a single GPU Today we're releasing the vui stack. A local voice agent that you can chat with in real time, with tools and can run claude to do more complex tasks. Inside this stack is the new vui nano model, a 300M TTS model that can render audio in reply to what you've said and supports a variety of non speech sounds. vui nano speaks with you, not at you. The stack can run on as little as 6GB of vram. Voice cloning supported with prompts of up to 5 minutes. The longer the better. A voice for your openclaw with our v1/realtime endpoint. I have developed this on my own so would love to get the communities feedback and help improving it. Please retweet this so that everyone knows they can have their own private Jarvis

5

32

12

15

2K

21 days ago

@harrycblum Sounding great. Nice work.

0

1

0

0

96

about 1 month ago

@harrycblum Nice

0

1

0

0

86

about 1 month ago

.@getstream_io x @anam__AI is live: add interactive avatars using the @visionagents_ai framework Read more: https://t.co/0cDeezSNcc

0

6

0

1

170

about 1 month ago

Anam is now integrated with Stream’s Vision Agents 🤙 Stream gives you the realtime multimodal agent framework: calls, state, orchestration, audio/video pipeline. Anam now gives the agent a live avatar in the call. This setup opens the door for "scene switching". The avatar starts on a neutral background, then changes based on the conversation: * ask for a recipe → kitchen * ask about weather → studio * next user turn → back to neutral It’s a relatively small thing technically: intercept the Anam video frames, chroma-key the green screen, and swap in a background based on tool calls / transcript callbacks. But it changes the feel a lot. The agent isn’t just talking over video, its environment can react too. Thanks to the Stream team for leading on the integration! docs: https://t.co/sOkjkeYmvL cc @visionagents_ai @neevash @Anam__ai

1

9

1

3

314

benatanam retweeted

2 months ago

Is autoresearch really better than classic hyperparameter tuning? We did experiments comparing Optuna & autoresearch. Autoresearch converges faster, is more cost-efficient, and even generalizes better: 🧵(1/6)

zhengyaojiang's tweet photo. Is autoresearch really better than classic hyperparameter tuning?

We did experiments comparing Optuna & autoresearch.
Autoresearch converges faster, is more cost-efficient, and even generalizes better: 🧵(1/6) https://t.co/UZvUUchQK1

26

1K

115

1K

136K

benatanam retweeted

3 months ago

Huge fans @lennysan and @bcherny at Anam. Enjoyed Boris's recent episode on Lenny's Podcast on the future of coding with AI so much we put together a demo of adding an Anam face to Claude Code.

1

4

3

1

278

3 months ago

Small change in the stack, big change in the 15% of sessions that needed it most.

0

0

0

0

48

3 months ago

~15% of users hit unstable connections during interactive avatar sessions. Most never told us. The session just quietly got worse. We shipped adaptive bitrate. Every Anam session now adjusts to network conditions in real time.

1

6

2

1

190

3 months ago

This is table-stakes infrastructure for real-time platforms like Agora and LiveKit. Now it runs on every session. The difference: conversations that used to stutter or freeze now stay smooth. Sessions run longer. Users don't bounce.

1

0

0

0

64

3 months ago

@Cloudflare @incident_io @attio @ElevenLabsDevs ^

0

0

0

0

51

3 months ago

Anam is part of the relaunched AI Startup Pack by Fin. We're in good company, alongside @ElevenLabs, @Cloudflare, @incident_io, @Attio, and more. Build with interactive avatars that respond in real time, look realistic, and deploy via API. No upfront cost for 7 months.

benatanam's tweet photo. Anam is part of the relaunched AI Startup Pack by Fin.

We're in good company, alongside @ElevenLabs, @Cloudflare, @incident_io, @Attio, and more.

Build with interactive avatars that respond in real time, look realistic, and deploy via API. No upfront cost for 7 months. https://t.co/89C6kqGnAi

2

13

2

2

362

3 months ago

@Cloudflare @incident_io @attio Check it out here: https://t.co/2W5kwoGT21

0

0

0

0

47

3 months ago

pip install pipecat-anam Repo and working example: https://t.co/zaBqDobkuH Thanks @kwindla and team @pipecat_ai

0

0

0

0

46

3 months ago

Our pipecat contribution just got merged. Anam is now listed as an official community video integration in the pipecat ecosystem. In case you don't know, pipecat is Daily's open-source framework for building real-time voice agents. 10.5k GitHub stars, used by NVIDIA, Mercor, Descript. We built a video service that takes TTS audio from the pipeline, streams it to Anam over WebRTC, and returns a synchronized interactive avatar face in real time. The avatar speaks, reacts, handles interrupts natively.

1

5

0

0

168

4 months ago

@TheFuturist2045 already possible with our mcp : ) https://t.co/Akk3VZeDW8

0

0

0

0

15

4 months ago

@danielbigham which LLM were you using? we offer a few, perhaps we should switch our default...

0

0

0

0

2

Last Seen Users on Sotwe

Trends for you

Most Popular Users