Justin Uberti @juberti - Twitter Profile

Pinned Tweet

29 days ago

New post on the OpenAI eng blog from two engineers on our Realtime AI team here in Seattle, outlining how we designed our v2 realtime infra and how we've optimized it for easy scalability and low latency. Check it out! https://t.co/MuGbCumHAY

2

35

3

8

4K

juberti retweeted

Farza 🇵🇰🇺🇸

@FarzaTV

5 days ago

Watch me control my computer with just my voice. This is the future of operating systems. No hands. GPT-Realtime 2.0 is very, very underrated. Demo:

925

14K

823

8K

4M

Justin Uberti

@juberti

13 days ago

If you’re using gpt-realtime-whisper as the ASR model in a cascade pipeline, you can try this as an end-of-utterance signal.

Justin Uberti

@juberti

14 days ago

@xanderberkein there's nothing exactly like that right now, although punctuation + timeout on deltas would probably work quite well

1

0

2

4K

3

17

0

11

4K

Justin Uberti

@juberti

14 days ago

@xanderberkein there's nothing exactly like that right now, although punctuation + timeout on deltas would probably work quite well

1

0

2

4K

Who to follow

Pion

@_pion

The Open Source, Cross Platform Stack for RTC. Pure Go implementations of WebRTC, TURN, DTLS and more. https://t.co/2C44MIUcsi

Tsahi Levent-Levi

@tsahil

Consulting ⭐ Training ⭐ Marketing outreach ⭐ Expert in WebRTC and Programmable Communications

Philipp Hancke

@HCornflower

rtc@meta. Still cheering for @jitsinews and @NVIDIAGFN. Opinions are my own and I have plenty of them.

Justin Uberti

@juberti

28 days ago

Guess who's back, back again. Whisper, but now with realtime streaming. Check out the new gpt-realtime-whisper transcription model in my https://t.co/b2UTuSxhOI demo.

juberti's tweet photo. Guess who's back, back again. Whisper, but now with realtime streaming. Check out the new gpt-realtime-whisper transcription model in my https://t.co/b2UTuSxhOI demo. https://t.co/ONusKRSRD2

6

43

7

9

5K

Justin Uberti

@juberti

16 days ago

@xanderberkein The internal VAD should be much more robust than the previous server VAD. Or am I misunderstanding your question?

1

0

171

juberti retweeted

Richard Sutton

@RichardSSutton

17 days ago

The bitter lesson in 26 words: Don’t be distracted by human knowledge, as AI has been historically. Instead focus on methods for creating knowledge that scale with computation, like search and learning.

136

7K

973

3K

571K

Justin Uberti

@juberti

17 days ago

@xanderberkein This model has internal VAD, no need to set turn_detection.

1

0

165

Justin Uberti

@juberti

21 days ago

@rown congrats on the research preview! Looks like a promising direction 😁

2

18

0

2K

Justin Uberti

@juberti

22 days ago

@syllogistic LOL, the future is hard to predict. Nice demo here, would love to learn more about how it works.

2

1

0

160

juberti retweeted

Newcomer

@NewcomerMedia

27 days ago

.@sama posted this week that he's "pretty excited for voice models to get great." But when will that happen? What @OpenAI's @juberti told @jameswilsterman at Cerebral Valley Voice:

0

3

1

2

1K

Justin Uberti

@juberti

27 days ago

@yi_ding there was an issue if you specified `null` for `turn_detection` which led to a bad state. I removed `turn_detection` altogether and the problem went away.

1

0

43

Justin Uberti

@juberti

27 days ago

@yi_ding this is a good question, we are debugging on the server side now

1

0

42

Justin Uberti

@juberti

27 days ago

cool realtime UX demo

Levin Stanley

@levinstanley

28 days ago

Using @OpenAI gpt-realtime-2 to get a glimpse of future voice-first experiences. A market dashboard you don’t click through. You direct it. Say, “Focus on Apple,” and the whole interface changes. Ask, “How did it do over the last 30 days?” and the chart updates. Say, “Go back,” and the market view returns. No menus. No filters. No hunting around. Just intent. What makes this model especially interesting is the interaction loop: you can interrupt it, add more context, change direction, and it keeps reasoning in real time while updating the experience around you. The interface doesn’t ask you to navigate. It just takes you there.

23

615

37

577

98K

0

12

0

5

2K

Justin Uberti

@juberti

27 days ago

@levinstanley @FonsoWealthy @OpenAI try speed=1.2

1

3

0

53

juberti retweeted

Justin Uberti

@juberti

about 2 months ago

We're looking for a creative iOS engineer to join our realtime AI team here at OpenAI Seattle to help build the future of human-AI interaction. If you know WebRTC, AVFoundation, and/or Core Audio and like open-ended challenges, apply at https://t.co/Yjvc5kuUPR or just DM!

8

273

23

123

42K

Justin Uberti

@juberti

27 days ago

Incredible. No notes.

Daniel Green @dgrreen

27 days ago

The Sam Altman and @miramurati texts from the day he got fired from @OpenAI in 2023 just became evidence in the @elonmusk v. @sama trial. It felt like a meaningful moment in AI history, so I turned it into a musical. The lyrics are the texts.

107

2K

199

897

382K

1

3

0

1

2K

Justin Uberti

@juberti

27 days ago

@tsahil and @ArinSime discuss our recent blog post on delivering low-latency voice AI at scale. Quote: "I don't recall up until this post [...] any company or vendor that took the time [...] to say technically why they made the decisions that they made." https://t.co/SceG8bjTUu

1

11

0

1

482

Justin Uberti

@juberti

27 days ago

@yi_ding there's no playout audio so there shouldn't be any echo cancellation. Your fix is forcing input audio to be committed, but that should already be happening, so I'm confused about what's going on here.

1

0

41