uRun

4 days ago

"Real-time" in marketing materials usually means "we batched it and it returns in 8 seconds." Real-time means under 300ms. That's the threshold UI designers use to decide when to show a loading spinner. There's a big difference. We build for the second one at @urunml.

0

1

0

40

5 days ago

A few seats left for our happy hour @ CVPR on thursday. come hang out and say hi. Jun 4 @ 6:00 PM in denver. RSVP → https://t.co/AkELseZsOp

0

68

urunml retweeted

11 days ago

Most providers route every request to a different accelerator. Mostly fine. Until you need a stateful loop. Drift across long workflows is real. At @urunml you're pinned to the same GPU on the same machine for the whole session. Stateful by design.

0

4

1

0

204

22 days ago

heading to @MLSysConf 2026 next week in Bellevue if you're working on inference, real-time systems, or ML infra, come say hi. always up to trade notes on what "production" actually looks like for stateful, interactive AI workloads.

0

2

0

52

urunml retweeted

23 days ago

Every modality in AI follows the same arc: single-shot expensive generations to multi-turn cheap interactive loops. Text and image already went through it. Video is next.

1

4

1

0

81

24 days ago

Thank you to the early ones. 🙏 More to come. #WhatCanuRun

1

4

1

0

74

urunml retweeted

25 days ago

Two years ago, the open problem was getting an AI video model to produce a coherent 5-second clip. Recent techniques like Long Live and self-forcing solved that piece. The new bottleneck is serving it interactively. Labs are chasing the next model. The infra layer underneath is wide open.

1

7

1

0

116

urunml retweeted

30 days ago

Real-time interactive video is the hardest workload there is. Every frame has to land inside the 300ms human-perception bar. That's why we're starting there with @urunml. The rest is downhill.

1

7

2

1

569

about 1 month ago

Who we most want building on uRun: creative tooling companies and the studios behind tomorrow's video games. They'll go places we can't imagine → https://t.co/kbL1maA4T8 #AIvideo #GameDev #VFX

13

17

0

280

urunml retweeted

about 1 month ago

@OpenAI and @Anthropic both charge ~2.5x for "fast mode." The most underrated pricing signal in AI right now.

1

3

1

0

88

about 1 month ago

some snapshots of our launch party @ Joey the Cat in SF last week. skee-ball, open bar, and real-time AI video on every screen. thank you to everyone who came out and pushed the demos somewhere great and weird. #WhatCanuRun → https://t.co/kbL1maA4T8

urunml's tweet photo. some snapshots of our launch party @ Joey the Cat in SF last week.

skee-ball, open bar, and real-time AI video on every screen.

thank you to everyone who came out and pushed the demos somewhere great and weird.

#WhatCanuRun → https://t.co/kbL1maA4T8 https://t.co/CQLAaABqtW

4

7

0

339

urunml retweeted

about 1 month ago

Introducing the founding team with three unique angles on the same problem. Keegan ran inference at Luma during the Dream Machine launch. Sean wrote the O'Reilly book on Docker and has our GPU orchestration dialed in. Matt was running low-latency edge inference at AWS in 2017 (back when "real-time AI" meant the cameras at Amazon Go). We built uRun for the infrastructure bottleneck no one else is solving. https://t.co/LHSjGRD8Un #AIvideo #FounderStory #realtimeAI #VideoInfra #GenerativeAI

13

17

1

0

308

about 1 month ago

Introducing the founding team with three unique angles on the same problem. Keegan ran inference at Luma during the Dream Machine launch. Sean wrote the O'Reilly book on Docker and has our GPU orchestration dialed in. Matt was running low-latency edge inference at AWS in 2017 (back when "real-time AI" meant the cameras at Amazon Go). We built uRun for the infrastructure bottleneck no one else is solving. https://t.co/LHSjGRD8Un #AIvideo #FounderStory #realtimeAI #VideoInfra #GenerativeAI

13

17

1

0

308

about 1 month ago

https://t.co/kbL1maA4T8 launch party - Wednesday, April 29 · 6PM: 🕹️ Arcade games 🍹 Open bar 💻 Live demos 🥽 Meta Quest Giveaway Spots are limited - click the link to grab your invite. 👉 https://t.co/Hx5owyyoMX

3

6

0

183

urunml retweeted

about 2 months ago

The model moat is shrinking fast. Kimi K2.6 just beat GPT-5.4 and Claude Opus 4.6 on SWE-Bench Pro. But the story isn't the benchmarks - it's the execution layer: → 300 parallel agents → 13 hours autonomous coding → 4,000+ tool calls in one run It's no longer intelligence per token. It's tokens per second. Source: https://t.co/DbrvNIHSMy #claude #moonshot #OpenSource

keeganmccallum3's tweet photo. The model moat is shrinking fast.

Kimi K2.6 just beat GPT-5.4 and Claude Opus 4.6 on SWE-Bench Pro.

But the story isn't the benchmarks - it's the execution layer:

→ 300 parallel agents
→ 13 hours autonomous coding
→ 4,000+ tool calls in one run

It's no longer intelligence per token. It's tokens per second.

Source: https://t.co/DbrvNIHSMy

#claude #moonshot #OpenSource

0

4

1

0

237

urunml retweeted

about 2 months ago

Reminder 🚨 We're going live on Twitch TODAY at 2pm PT. Come hang and bring your questions 👇 https://t.co/JYTShUg45P #Inference #Infrastructure #twitch

0

3

1

0

242

about 2 months ago

@faizan10114 @keeganmccallum3 Keep an eye in the coming weeks. Very exciting things on the horizon!

0

1

0

7