William Chen @solidwillity - Twitter Profile

Pinned Tweet

William Chen

@solidwillity

about 2 months ago

I love inference! Tap in🧑🏻‍💻

Scale at GMI

@scale_at_gmi

about 2 months ago

Welcoming @solidwillity to SCALE! 🚀 His startup, Touchdown Labs, makes AI inference optimization simple and accessible for resource-constrained organizations. Can't wait to see what you build! #SCALEatGMI #GMICloud #Startup

scale_at_gmi's tweet photo. Welcoming @solidwillity to SCALE! 🚀

His startup, Touchdown Labs, makes AI inference optimization simple and accessible for resource-constrained organizations.

Can't wait to see what you build!
#SCALEatGMI #GMICloud #Startup https://t.co/nusNZcLOmR

0

2

0

1K

1

8

0

1K

William Chen

@solidwillity

2 days ago

Jensen needs to start a food review TikTok I swear

0

68

solidwillity retweeted

Lex Fridman

@lexfridman

3 days ago

I got to spend all day today with Jensen in Taiwan: talking with thousands of engineers and eating street food at a night market. Jensen is received as a rockstar in Taiwan, like it's Beatles in the 60's. It's mind-blowing and fun to watch. But most importantly, through all the interactions and all my conversations with him, he remained the same humble, kind, thoughtful, funny guy he always was, even as a kid who went to these same night markets many years ago. Btw, we tried a crazy amount of different street food. It's legit some of the most delicious food I've ever had. I can't wait to share video of it, including a ton of our conversations and hangout. When I can pause for a moment from all the travel to edit the video, I'll post it. Can't wait to continue talking to Jensen and engineers at Computex this week, and exploring more of Taiwan, and of course roaming the night markets for some more delicious street food. Days like these, even more than usual, I feel like the luckiest kid in the world. Love you all! ❤️

lexfridman's tweet photo. I got to spend all day today with Jensen in Taiwan: talking with thousands of engineers and eating street food at a night market. Jensen is received as a rockstar in Taiwan, like it's Beatles in the 60's. It's mind-blowing and fun to watch. But most importantly, through all the interactions and all my conversations with him, he remained the same humble, kind, thoughtful, funny guy he always was, even as a kid who went to these same night markets many years ago.

Btw, we tried a crazy amount of different street food. It's legit some of the most delicious food I've ever had. I can't wait to share video of it, including a ton of our conversations and hangout. When I can pause for a moment from all the travel to edit the video, I'll post it.

Can't wait to continue talking to Jensen and engineers at Computex this week, and exploring more of Taiwan, and of course roaming the night markets for some more delicious street food.

Days like these, even more than usual, I feel like the luckiest kid in the world.

Love you all! ❤️

1K

27K

1K

2K

1M

William Chen

@solidwillity

3 days ago

First thing you see when you land in Taipei

0

29

Who to follow

Mohammad Farid Gulabzai

MR. GIWA (YOU’VE GOT MONEY💰 NOT AN EXTRA LIFE)

@mrdelegiwa

🔥I OVERCAME ATYCHIPHOBIA🔥

William Chen

@solidwillity

3 days ago

Everything is falling into place

Perplexity

@perplexity_ai

3 days ago

Today we're announcing that hybrid agentic inference is coming to Perplexity Computer. Computer can split tasks between a local model running on your machine and frontier models in the cloud. This keeps private data on your device and maximizes token efficiency. Coming soon.

145

2K

200

735

328K

0

82

William Chen

@solidwillity

4 days ago

Insane

Nous Research

@NousResearch

4 days ago

We have worked with @nvidia to integrate their official Agent Skills catalog into the Hermes Skills Hub. These skills teach your agent how to use CUDA-X libraries, Omniverse and Physical AI workflows, NeMo training and inference tools, and other platform components.

NousResearch's tweet photo. We have worked with @nvidia to integrate their official Agent Skills catalog into the Hermes Skills Hub.

These skills teach your agent how to use CUDA-X libraries, Omniverse and Physical AI workflows, NeMo training and inference tools, and other platform components. https://t.co/Ryp50dHqOL

93

2K

157

421

817K

0

73

William Chen

@solidwillity

4 days ago

@nicoleegong @nvidia @gmi_cloud @scale_at_gmi @lillian_ma_

0

1

0

33

William Chen

@solidwillity

4 days ago

@yuqih @lillian_ma_ @NVIDIAGTC @nicoleegong reporting for duty 🫡

0

8

William Chen

@solidwillity

8 days ago

@MilksandMatcha @tyler_fong_ congrats!!

0

1

0

69

William Chen

@solidwillity

9 days ago

@HaochengXiUCB Let's gooo

0

42

William Chen

@solidwillity

10 days ago

@lqiao @ivanleomk @FireworksAI_HQ Congrats!

0

1

0

172

William Chen

@solidwillity

10 days ago

@Shaughnessy119 Bet

0

58

solidwillity retweeted

Hedgie

@HedgieMarkets

14 days ago

🦔Fortune published a piece this afternoon connecting Microsoft and Uber's AI cost overruns to token economics, with a headline that lands hard: "Microsoft reports are exposing AI's real cost problem: Using the tech is more expensive than paying human employees." Underneath those headlines, the unit economics tell the story. OpenAI is projected to lose $14 billion in 2026, spending roughly $2 for every dollar of revenue it brings in. Anthropic is in a similar position with break-even not projected until 2028. GPU rental prices for Nvidia's newest Blackwell chips jumped 48% in just two months. OpenAI's response was to close a $122 billion private funding round at an $852 billion valuation, the largest in history. My Take The token pricing story is really an IPO timing story. OpenAI, Anthropic, and xAI all need to go public in the next 18 to 24 months because the private market cannot keep absorbing burn rates like these indefinitely. Public markets do not accept "we will figure it out" as a line item on an S-1, they require disclosed unit economics with a credible path to profitability and a date attached. That deadline is why the price increases are happening now rather than next year. The labs need to show declining loss curves before the filings hit, and that means enterprise customers have to start covering more of the actual cost regardless of whether the productivity math holds on their end. Every token bought over the last two years was effectively subsidized below cost by venture capital and hyperscaler cross-subsidies, and that subsidy has a hard deadline. Uber publicly admitted burning through its entire 2026 AI budget in four months, and CFOs at major enterprises are starting to flag the same pressure. The labs cannot keep losing $2 per dollar of revenue once they file public statements, so the cost transfer to customers accelerates from here. For investors, the question is not whether these companies are valuable. They clearly are. The question is who absorbs the difference between what enterprises can budget and what the models actually consume between now and 2028, and right now the answer is the hyperscalers funding the buildout. That is why I have been watching Microsoft and Amazon capex commentary more closely than the lab announcements themselves. Hedgie🤗 Link: https://t.co/S2oIgUSijV

HedgieMarkets's tweet photo. 🦔Fortune published a piece this afternoon connecting Microsoft and Uber's AI cost overruns to token economics, with a headline that lands hard: "Microsoft reports are exposing AI's real cost problem: Using the tech is more expensive than paying human employees." Underneath those headlines, the unit economics tell the story. OpenAI is projected to lose $14 billion in 2026, spending roughly $2 for every dollar of revenue it brings in. Anthropic is in a similar position with break-even not projected until 2028. GPU rental prices for Nvidia's newest Blackwell chips jumped 48% in just two months. OpenAI's response was to close a $122 billion private funding round at an $852 billion valuation, the largest in history.

My Take
The token pricing story is really an IPO timing story. OpenAI, Anthropic, and xAI all need to go public in the next 18 to 24 months because the private market cannot keep absorbing burn rates like these indefinitely. Public markets do not accept "we will figure it out" as a line item on an S-1, they require disclosed unit economics with a credible path to profitability and a date attached. That deadline is why the price increases are happening now rather than next year. The labs need to show declining loss curves before the filings hit, and that means enterprise customers have to start covering more of the actual cost regardless of whether the productivity math holds on their end.

Every token bought over the last two years was effectively subsidized below cost by venture capital and hyperscaler cross-subsidies, and that subsidy has a hard deadline. Uber publicly admitted burning through its entire 2026 AI budget in four months, and CFOs at major enterprises are starting to flag the same pressure. The labs cannot keep losing $2 per dollar of revenue once they file public statements, so the cost transfer to customers accelerates from here. For investors, the question is not whether these companies are valuable. They clearly are. The question is who absorbs the difference between what enterprises can budget and what the models actually consume between now and 2028, and right now the answer is the hyperscalers funding the buildout. That is why I have been watching Microsoft and Amazon capex commentary more closely than the lab announcements themselves.

Hedgie🤗

Link: https://t.co/S2oIgUSijV

75

1K

315

579

110K

William Chen

@solidwillity

13 days ago

@_doubleAI_ @nvidia Fire

0

197

William Chen

@solidwillity

13 days ago

@GT_HaoKang Love to chat!

0

1

0

76

William Chen

@solidwillity

13 days ago

@zhzHNN @BanghuaZ 好东西

1

0

124

William Chen

@solidwillity

13 days ago

@LijieyYang Jealous!

1

0

25

William Chen

@solidwillity

14 days ago

@VivianCaiIAm @namdao2000 @engelsbergc @quanmhuynh @Jeffjinlin @NickelReady 是

1

0

129

William Chen

@solidwillity

14 days ago

@hamzaelshafie Diving in this weekend

0

1

0

99

William Chen

@solidwillity

15 days ago

Who's going computex?

0

1

0

84

solidwillity retweeted

Maksym Andriushchenko

@maksym_andr

16 days ago

💥Today we release InferenceBench, our next benchmark after PostTrainBench that measures progress on AI R&D automation. AI R&D automation will very likely unfold gradually, starting from “boring” tasks like inference speed optimization that are very easily verifiable (accuracy + inference time). We show a rather negative result for current frontier agents. They are not good at system-level engineering and managing complex dependencies. They do show non-trivial performance, but they fail compared to a simple baseline: hyperparameter tuning of vLLM/SGLang hyperparameters. Importantly, InferenceBench tests *open-ended* inference optimization capabilities. This is different from more narrow benchmarks like KernelBench that only let agents optimize kernels (which is a very valuable task, too!). The benchmark is intentionally open-ended, so the poor performance of the agents is not an underelicitation issue. The agents have everything needed to succeed, but they still fail because they are not yet reliable enough for this task. Our results suggest an inverse scaling phenomenon: Claude Sonnet 4.6 and GLM-5 rank highly because they more often preserve simple, valid, high-performing final servers, while several larger models show stronger peak runs but lose utility through brittle final-state choices. This contrasts with benchmarks where rankings track raw capability (e.g., SWE-Bench, Terminal-Bench, PostTrainBench, FrontierSWE). One of the primary bottlenecks we have clearly observed is the lack of diversity of strategies: nearly all agents just use vLLM, without exploring alternatives. Overall, proper exploration is lacking: the current agents are not ready to tackle broad enough goals and get stuck after the first found solution (such as vLLM). I’m sure future agents will do much better, but here is where we are now. This benchmark is our 2nd one in a suite of benchmarks that will track the progress on AI R&D automation. We will develop many more benchmarks that will cover different aspects of AI R&D automation, culminating in recursive self-improvement. Stay tuned!

maksym_andr's tweet photo. 💥Today we release InferenceBench, our next benchmark after PostTrainBench that measures progress on AI R&D automation.

AI R&D automation will very likely unfold gradually, starting from “boring” tasks like inference speed optimization that are very easily verifiable (accuracy + inference time). We show a rather negative result for current frontier agents. They are not good at system-level engineering and managing complex dependencies. They do show non-trivial performance, but they fail compared to a simple baseline: hyperparameter tuning of vLLM/SGLang hyperparameters.

Importantly, InferenceBench tests *open-ended* inference optimization capabilities. This is different from more narrow benchmarks like KernelBench that only let agents optimize kernels (which is a very valuable task, too!). The benchmark is intentionally open-ended, so the poor performance of the agents is not an underelicitation issue. The agents have everything needed to succeed, but they still fail because they are not yet reliable enough for this task.

Our results suggest an inverse scaling phenomenon: Claude Sonnet 4.6 and GLM-5 rank highly because they more often preserve simple, valid, high-performing final servers, while several larger models show stronger peak runs but lose utility through brittle final-state choices. This contrasts with benchmarks where rankings track raw capability (e.g., SWE-Bench, Terminal-Bench, PostTrainBench, FrontierSWE).

One of the primary bottlenecks we have clearly observed is the lack of diversity of strategies: nearly all agents just use vLLM, without exploring alternatives. Overall, proper exploration is lacking: the current agents are not ready to tackle broad enough goals and get stuck after the first found solution (such as vLLM). I’m sure future agents will do much better, but here is where we are now.

This benchmark is our 2nd one in a suite of benchmarks that will track the progress on AI R&D automation. We will develop many more benchmarks that will cover different aspects of AI R&D automation, culminating in recursive self-improvement. Stay tuned!

12

347

48

212

42K

William Chen

@solidwillity

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users