Junru Shao @junrushao - Twitter Profile

1 day ago

Higgs Audio v3 TTS is here. Built for voice AI that speaks, not just reads: • 100 languages with single-digit WER/CER • inline control over emotion, style, prosody, and sound effects • API, Workspace, and open weights • Blog 👉 https://t.co/C8frDlfO5D Watch the demo 👇

13

342

55

327

35K

Junru Shao

@junrushao

4 days ago

gitlab is way worse than github :((

0

2

0

549

junrushao retweeted

tender

@tenderizzation

8 days ago

we have become weaker as a species

12

823

52

175

68K

junrushao retweeted

Yixin Dong @yi_xin_dong

14 days ago

🚀 The wait is over! Today at #MLSys, we'll give a talk to reveal the final results and present the awards for the FlashInfer AI GPU Competition! 🏆 I'll also introduce FlashInfer-Bench: an agent-oriented Benchmark Engine designed for production kernels. Join us from 11:00 AM - 1:00 PM PT to see who takes the crown and learn more. Everyone is welcome to attend—see you there! ✨ 🌐 Competition & Results: https://t.co/GS21eemEZv 💻 FlashInfer-Bench Benchmark Engine: https://t.co/rlzNUXJq5e #FlashInfer #MLSys26 #AI #GPU

yi_xin_dong's tweet photo. 🚀 The wait is over! Today at #MLSys, we'll give a talk to reveal the final results and present the awards for the FlashInfer AI GPU Competition! 🏆

I'll also introduce FlashInfer-Bench: an agent-oriented Benchmark Engine designed for production kernels.

Join us from 11:00 AM - 1:00 PM PT to see who takes the crown and learn more. Everyone is welcome to attend—see you there! ✨

🌐 Competition & Results: https://t.co/GS21eemEZv
💻 FlashInfer-Bench Benchmark Engine: https://t.co/rlzNUXJq5e

#FlashInfer #MLSys26 #AI #GPU

1

95

15

31

26K

Who to follow

Lianmin Zheng

@lm_zheng

Inference @meta | Prev: Engineer @xAI, Ph.D. @UCBerkeley, Co-founder @lmsysorg

Tianqi Chen

@tqchenml

AssistProf @CarnegieMellon. Distinguished Eng @NVIDIA. Creator of @XGBoostProject, @ApacheTVM. Member https://t.co/QYyfjQNp4p, @TheASF. Views are on my own

Ying Sheng

@ying11231

Cofounder & CEO @radixark @lmsysorg | @sgl_project (https://t.co/6e9BrnaWXK) | Do it anyway | Be the light

Junru Shao

@junrushao

19 days ago

@tenderizzation id say wow tea in south bay is very much like 不要对我尖叫

0

1

0

174

junrushao retweeted

Edward Z. Yang @ezyang

24 days ago

A thread about the history and internal implementation details of activation checkpointing APIs in PyTorch. 🧵

6

250

29

221

19K

Junru Shao

@junrushao

29 days ago

@tenderizzation where does this name come from 😂

0

1

0

223

Junru Shao

@junrushao

29 days ago

good stuff 🦀

Jared Roesch

@roeschinc

29 days ago

We open-sourced some amazing work on an experimental Rust compiler for GPU from my colleagues at @nvidia. It takes a slightly different approach to expose GPU programming concepts natively in Rust. Check it out https://t.co/xR4Ho2LUMR.

12

569

94

285

36K

0

2

0

659

Junru Shao

@junrushao

29 days ago

once-in-a-lifetime opportunity to join Yuchen’s team!

Yuchen Jin

@Yuchenj_UW

29 days ago

An OpenAI friend told me he burns 300M GPT-5.5 tokens/day. The top one in his team burns billions of tokens/day. Codex coding for them every night. Databricks also gives engineers unlimited tokens. We're looking for cracked inference engineers to join us at Databricks AI to produce trillions of tokens, insanely fast. DM me if you have: - Contributed to open-source ML systems like SGLang/vLLM/PyTorch - Experience serving LLMs at large scale Databricks AI runs like a startup. Lots of exciting things to build!

98

1K

52

326

214K

1

18

0

2

5K

junrushao retweeted

Lequn Chen @abcdabcd987

about 1 month ago

#MLSys2026 is happening in two weeks! Our AI Infra team at @perplexity_ai is throwing a happy hour event at Bellevue on May 19. Come chat with us about inference, post-training, RL, kernels, GPUs, RDMA, agents, anything... https://t.co/mXf7laYt1s

abcdabcd987's tweet photo. #MLSys2026 is happening in two weeks! Our AI Infra team at @perplexity_ai is throwing a happy hour event at Bellevue on May 19. Come chat with us about inference, post-training, RL, kernels, GPUs, RDMA, agents, anything... https://t.co/mXf7laYt1s https://t.co/D4xdO4QgOw

1

13

3

1K

junrushao retweeted

Yixin Dong @yi_xin_dong

about 1 month ago

Introducing XGrammar-2: structured generation for complex agent harnesses. Strict tool-calling formats. Built-in DeepSeek-V4 and Qwen-3.6 support. Up to 80x speedup over XGrammar. Ready-to-use integrations with vLLM, SGLang, TensorRT-LLM, and more! ⚡ From Claude Code to OpenClaw, agents are defining more complex harnesses. XGrammar-2 ensures LLMs always interact with them in the right way. Built in collaboration with DeepSeek, Databricks, and leading frontier AI labs to bring XGrammar-2 into latest models and products. 🧩 Structural Tag: one unified abstraction to describe any format your agent needs 🚀 Scales to 500+ strictly typed tools for complex agent harnesses 🌐 Native APIs in Python, C++, Rust, and JS, running everywhere from cloud to edge 🛠️ Integrated with vLLM, SGLang, TensorRT-LLM, and more Excited to see what agent builders create with it! Blog: https://t.co/N0Tbl588BH GitHub: https://t.co/lo4yScuI2f

yi_xin_dong's tweet photo. Introducing XGrammar-2: structured generation for complex agent harnesses.

Strict tool-calling formats. Built-in DeepSeek-V4 and Qwen-3.6 support. Up to 80x speedup over XGrammar. Ready-to-use integrations with vLLM, SGLang, TensorRT-LLM, and more! ⚡

From Claude Code to OpenClaw, agents are defining more complex harnesses. XGrammar-2 ensures LLMs always interact with them in the right way.

Built in collaboration with DeepSeek, Databricks, and leading frontier AI labs to bring XGrammar-2 into latest models and products.

🧩 Structural Tag: one unified abstraction to describe any format your agent needs
🚀 Scales to 500+ strictly typed tools for complex agent harnesses
🌐 Native APIs in Python, C++, Rust, and JS, running everywhere from cloud to edge
🛠️ Integrated with vLLM, SGLang, TensorRT-LLM, and more

Excited to see what agent builders create with it!

Blog: https://t.co/N0Tbl588BH
GitHub: https://t.co/lo4yScuI2f

8

149

53

73

42K

Junru Shao

@junrushao

about 1 month ago

@tenderizzation famous last loving message

0

1

0

115

junrushao retweeted

Tian Xia @tian_xia_

about 1 month ago

Excited to share that I’ll be presenting SkyWalker at #EuroSys26 in Edinburgh tomorrow!🚀 We asks: Can we reduce the cost of multi-region LLM serving by cross-region offloading, without losing the benefits of KV-cache locality? Talk: April 29, afternoon track A, ~16:20-16:40📍

1

2

1

0

386

Junru Shao

@junrushao

about 1 month ago

TIL: python’s set doesn’t maintain insertion order unlike dict which does 😅

0

3

0

267

Junru Shao

@junrushao

about 1 month ago

@shibainu_vodka 我的眼睛🙃

0

1

0

39

junrushao retweeted

difficultyang @difficultyang

about 1 month ago

The DeepSeek proposals section is always fun lol

1

96

5

19

7K

Junru Shao

@junrushao

about 1 month ago

TileLang is DeepSeek’s kernel maxxxing DSL on both Nvidia and Huawei’s Ascend cards

0

61

2

22

3K

Junru Shao

@junrushao

about 2 months ago

@Yuchenj_UW @alighodsi @pwendell @matei_zaharia so excited to see what you guys are gonna build!

1

0

2K

junrushao retweeted

Underfox @Underfox3

about 2 months ago

In this paper is presented Event Tensor, an abstraction designed to simplify the compilation and execution of dynamic megakernels, providing first-class support for both shape and data-dependent dynamism. https://t.co/x399Y9onle

Underfox3's tweet photo. In this paper is presented Event Tensor, an abstraction designed to simplify the compilation and execution of dynamic megakernels, providing first-class support for both shape and data-dependent dynamism.

https://t.co/x399Y9onle https://t.co/nfzEfJIvEj

1

114

18

107

11K

Junru Shao

@junrushao

about 2 months ago

@m0d8ye Costco 会员卡可以吗🥺

0

2

0

101

Junru Shao

@junrushao

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users