Patrick Doyle

@Pat_D_Doyle

Finance for NeoClouds and Semiconductors Opinions expressed are solely my own and do not express the views or opinions of my employer

New York

Joined March 2025

37 Following

32 Followers

51 Posts

Patrick Doyle

@Pat_D_Doyle

about 20 hours ago

@jukan05 i don’t think there’s any precedent of a leading edge foundry packaging another leading edge foundry’s silicon e.g. typically TSMC only packaged their own silicon

Pat_D_Doyle retweeted

Shanu Mathew

@ShanuMathew93

2 days ago

More bullish read. Dylan Patel dropped some aggressive AI compute numbers on the Sequoia pod: +20 GW deployed in 2026, net of delays +30 GW forecast for 2027, also net of delays -OpenAI + Anthropic alone at 100+ GW combined by 2030 -Compute moving into the terawatt range by 2040 He never said whether the 20/30 GW figures are US or global, or whether they're IT load, facility load, AI-only, or broader data center capacity. Sounded more like global additions to me. I wouldn't treat it as a clean US number but it's in the range of possibilities assuming more aggressive conversions/timelines. Same problem with the 100 GW forecast. I don't read that as OpenAI and Anthropic owning 100 GW of self-built campuses. Neither company owns gigawatt-scale generation today. Most of their committed capacity sits inside hyperscaler and neocloud contracts, GPUs, TPUs, Trainium, third-party deals. Stack "100 GW for OpenAI + Anthropic" on top of hyperscaler/neocloud buildout numbers and you're double counting. Better read is compute available to the labs, not a separate owned supply stack. Still, the ramp is nuts if it lands! From Dylan's prior comments on pods, OpenAI + Anthropic were sitting around 3.5-4 GW combined exiting 2025, heading toward 10 GW exiting 2026 and 20 GW exiting 2027. Get to 100 GW by 2030 and that's: >~96 GW added from YE25 to YE30, ~19 GW/year >~80 GW added from YE27 to YE30, ~27 GW/year For scale: -100 GW × $50B/GW = ~$5T -Global GDP 2025: ~$117T. $5T is ~4.3% of it -US GDP 2025: ~$30.8T. $5T is ~16.2% of it -100 GW is roughly Texas-scale power capacity Given I don't have specifics, the more interesting part is token economics to me vs the GW headline. If frontier labs can monetize scarce tokens at very high gross margins, they can rationally overbid for compute. Dylan cited Anthropic as net-income profitable ex-stock comp in Q2, possibly profitable including stock comp by Q3, with frontier API token gross margins above 80% before blended-channel effects kick in. That changes the clearing price for compute. At those margins, doubling the compute cost still leaves attractive gross margins and higher absolute gross profit if capacity is the bottleneck. Renting more GPUs/TPUs/Trainium doesn't scale headcount 1:1 either. Also explains the rental dispersion he cited: -Trainium under $10B/GW/year -GPUs historically ~$12-13B/GW/year -SpaceX-Google GPU deal ~$25B/GW/year annualized (ST spot deal given compute constraints) Same story at the data center layer --> Power-only colo has moved from ~$60/kW-month historically to $120-160/kW-month, with a wider spread depending on grid reliability, counterparty credit, location, time-to-power. Comes back down to we can all guess how many GW get built but the real question is whether model-driven demand and token monetization keep outrunning physical supply, capital, and grid execution.

369

525

106K

Patrick Doyle

@Pat_D_Doyle

1 day ago

@bubbleboi @bubbleboi go meet your wife at the Corgi Cafe 🐕

298

Pat_D_Doyle retweeted

Ivan Zhao

@ivanhzhao

6 months ago

https://t.co/fxHYR6S29D

376

13K

22K

Pat_D_Doyle retweeted

arjun

@ArjChi

1 day ago

we are hiring for many engineering roles electrical, mechanical, SWE, MTS, TPM, and more apply below or dm me. if you don't see a role that fits you exactly, still dm me. we make roles for exceptional talent. https://t.co/UVGMRXtfak

Patrick Doyle

@Pat_D_Doyle

1 day ago

in finance but wanting to build something beyond pitch decks and management presentations? Come build the future of AI at Fluidstack! If you've been wanting to make the jump but not sure if you have the skill set, we are looking for the following backgrounds: Capital markets, project finance, real estate, power and utilities, FP&A, and accounting Apply below or dm me. If you don't see a role that fits you exactly, still dm me. We make roles for exceptional talent. We are hiring people that want to get hands on and create the scale AI needs to revolutionize work https://t.co/d6Ya6qZhAo

273

Pat_D_Doyle retweeted

Rohan Paul

@rohanpaul_ai

2 days ago

Mira Murati's Thinking Machines made Bridgewater’s private expert judgment trainable, beating frontier models with 29.8% fewer errors. With naive prompts, all tested models sit around coin-flip accuracy, roughly 46% to 50%. Expert prompts lift them sharply, reaching about 74% to 78% average accuracy. The workflow was filtering finance articles, reports, central-bank documents, and emails to decide what investors should read. This is a serious signal for enterprise AI, that bringing private judgment in the loop beats general intelligence. The problem was not reading finance documents, because frontier LLMs can already read them. The harder task was deciding which facts deserve attention inside an investor’s workflow. A tariff headline can move markets, while another geopolitical headline may add no signal. The breakthrough came from replacing written rules with high-quality labels from expert investors. Non-expert labels failed because the task depends on taste, not surface financial language. Bridgewater cleaned those labels by sending model-disputed cases back to experts for review. The model then learned patterns that experts could recognize, but could not fully verbalize. Training used interleaved batches, CISPO loss, and on-policy distillation from stronger teacher checkpoints. Interleaving helped the model share judgment across tasks without blending them into noise. CISPO controlled policy updates, so learning stayed aggressive without drifting into brittle shortcuts. (CISPO is a new reinforcement-learning loss that caps how strongly each generated token can update the model, improving training stability while keeping useful rare tokens active. It was initially proposed by MiniMax team in 2025) On-policy distillation penalized moves away from better teachers, then promoted stronger checkpoints. The result beat the best frontier model, with 29.8% fewer mistakes and 13.8x lower inference cost.

rohanpaul_ai's tweet photo. Mira Murati's Thinking Machines made Bridgewater’s private expert judgment trainable, beating frontier models with 29.8% fewer errors.

With naive prompts, all tested models sit around coin-flip accuracy, roughly 46% to 50%.
Expert prompts lift them sharply, reaching about 74% to 78% average accuracy.

The workflow was filtering finance articles, reports, central-bank documents, and emails to decide what investors should read.

This is a serious signal for enterprise AI, that bringing private judgment in the loop beats general intelligence.

The problem was not reading finance documents, because frontier LLMs can already read them.

The harder task was deciding which facts deserve attention inside an investor’s workflow.

A tariff headline can move markets, while another geopolitical headline may add no signal.

The breakthrough came from replacing written rules with high-quality labels from expert investors.

Non-expert labels failed because the task depends on taste, not surface financial language.

Bridgewater cleaned those labels by sending model-disputed cases back to experts for review.

The model then learned patterns that experts could recognize, but could not fully verbalize.

Training used interleaved batches, CISPO loss, and on-policy distillation from stronger teacher checkpoints.

Interleaving helped the model share judgment across tasks without blending them into noise.

CISPO controlled policy updates, so learning stayed aggressive without drifting into brittle shortcuts.

(CISPO is a new reinforcement-learning loss that caps how strongly each generated token can update the model, improving training stability while keeping useful rare tokens active. It was initially proposed by MiniMax team in 2025)

On-policy distillation penalized moves away from better teachers, then promoted stronger checkpoints.

The result beat the best frontier model, with 29.8% fewer mistakes and 13.8x lower inference cost.

292

221

48K

Pat_D_Doyle retweeted

tae kim

@firstadopter

2 days ago

An Interview with Arm CEO Rene Haas: AI CPU Demand is 'Off the Charts' https://t.co/Rp86RN0YJ1

159

78K

Patrick Doyle

@Pat_D_Doyle

2 days ago

@bubbleboi @bubbleboi Also any of the pure-play companies outside of the supply chain play are 1) typically private, 2) likely to be acquired by AMD/NVDA/INTC 3) hard to diligence due to different tech approaches

Patrick Doyle

@Pat_D_Doyle

2 days ago

I've seen people on X talk about how this is a signal that demand is not as unconstrained as analysts make the market out to be - but I believe this is a backwards read Building a NeoCloud, especially a GPUaaS one, is a lot like baking a cake where you need all of the ingredients to come together at once: 1. A viable site with power 2. A customer to offtake the capacity or a backstop agreement to provide contractual support 3. Financing for said equipment 4. Equipment delivery dates that meets customer and site timelines Financing is the ingredient people underrate. Lenders want a collateral base, and they treat these offtake contracts as something they can securitize (think SaaS ARR-based lending). No contracted demand, no financing Arrangements like this enable NeoClouds to meet insatiable demand by accessing more financing - this article speaks to that supply-side challenge, not demand weakness https://t.co/EBiG2MpFOT

360

Pat_D_Doyle retweeted

Serenity

@aleabitoreddit

3 days ago

Wells Fargo: $META intent to sell excess compute is a positive signal around underlying demand and unit economics of AI. “Despite this shift, we don’t expect a pullback in Meta’s capex or that overall compute needs are lower” Regarding Neoclouds: WF thinks it validated the massive AI infra opportunity as well as acquisition opportunities. Despite any potential competition for Neoclouds. I’m inclined to agree with Wells Fargo here and say markets completely misunderstood Meta’s excess compute comment.

aleabitoreddit's tweet photo. Wells Fargo: $META intent to sell excess compute is a positive signal around underlying demand and unit economics of AI.

“Despite this shift, we don’t expect a pullback in Meta’s capex or that overall compute needs are lower”

Regarding Neoclouds: WF thinks it validated the massive AI infra opportunity as well as acquisition opportunities. Despite any potential competition for Neoclouds.

I’m inclined to agree with Wells Fargo here and say markets completely misunderstood Meta’s excess compute comment.

193

147

260

419K

Pat_D_Doyle retweeted

shoshin

@zen_tropy

3 days ago

$nbis, $meta, $crwv, $orcl Mstanley out tonight w/ clarifying comments re: bb report. Tldr: If meta does sell compute, it will be as a bare-metal offering of spare internal 1P capacity (not 3rd party-leased) to serve as an "eps bridge" while they develop their core products. What's key is ms doesn't believe meta can or wants to compete as a full-service stack, since their models are limited & they don't have the expertise, software, or salespeople to service specialized, high-touch enterprise inference markets (precisely the markets $nbis, for instance, seeks to serve). MS suggests meta is not contractually-allowed to resell any of their 3P leased raw silicon from nbis, crwv, orcl, etc., Believes they will save their cutting-edge contracted capacity (e.g., the nebius 12B Vera Rubin order), for internal use. Net/net: If meta does enter the compute market it will be as a stopgap, and in the bare-metal, older chip market. A minor supply addition, at best; least threatening, arguably, to nebius of all the neo's; and all this only assuming a backdrop where the compute supply were to materially loosen.

zen_tropy's tweet photo. $nbis, $meta, $crwv, $orcl Mstanley out tonight w/ clarifying comments re: bb report. Tldr: If meta does sell compute, it will be as a bare-metal offering of spare internal 1P capacity (not 3rd party-leased) to serve as an "eps bridge" while they develop their core products.

What's key is ms doesn't believe meta can or wants to compete as a full-service stack, since their models are limited & they don't have the expertise, software, or salespeople to service specialized, high-touch enterprise inference markets (precisely the markets $nbis, for instance, seeks to serve). MS suggests meta is not contractually-allowed to resell any of their 3P leased raw silicon from nbis, crwv, orcl, etc., Believes they will save their cutting-edge contracted capacity (e.g., the nebius 12B Vera Rubin order), for internal use.

Net/net: If meta does enter the compute market it will be as a stopgap, and in the bare-metal, older chip market. A minor supply addition, at best; least threatening, arguably, to nebius of all the neo's; and all this only assuming a backdrop where the compute supply were to materially loosen.

455

289

146K

Patrick Doyle

@Pat_D_Doyle

3 days ago

@TheAlexCoppess @doooyle @TheAlexCoppess This town ain't big enough for the two of us dirt dan

Patrick Doyle

@Pat_D_Doyle

3 days ago

A data point on the supply-demand balance for AI, users across percentiles continue to see their usage scale over time - demand follows the scalability of SaaS, while supply is constrained to hardware/infrastructure limits https://t.co/wejaNpiN1Q

Pat_D_Doyle's tweet photo. A data point on the supply-demand balance for AI, users across percentiles continue to see their usage scale over time - demand follows the scalability of SaaS, while supply is constrained to hardware/infrastructure limits https://t.co/wejaNpiN1Q https://t.co/cwy0kR9F9z

Patrick Doyle

@Pat_D_Doyle

3 days ago

@dee_bosa this speaks exactly to the scarcity of compute and highlights 2 things: 1. this is an allocation issue. Firms like SpaceX and Meta have frontier models that are underutilized, while anthropic continues to see revenue ramp with enterprise adoption (SemiAnalysis). Reallocating capacity used to support these in-house models is key to enable those that need capacity the most 2. selling in-house silicon capacity to outside parties allows you to get better economics when using your own silicon. The costs paid to DSPs (Broadcom, Mediatek, etc.) get scaled across more silicon, making the economics of running an in-house silicon program even more appearling

275

Pat_D_Doyle retweeted

Ben Pouladian

@benitoz

4 days ago

Etched came out of stealth at $800M and by lunch X had NVIDIA in the ground We do this every few months. A chip launches, the deck says killer, the timeline holds a funeral, and NVIDIA closes green anyway Etched hardwires the transformer into silicon. That is where the speed comes from, nearly the whole die on one job instead of the ~30% a GPU uses. It is also the trap. The day that chip tapes out is the best it will ever be. You cannot patch it. You burned progress into a wafer and now pray the field stops moving NVIDIA made the opposite bet. Same board, faster every quarter in software. Dynamo is pulling more tokens per watt out of the same rack, on version 1.0 The depreciation risk the bears aimed at NVIDIA for two years does not live at NVIDIA. It lives here, on the chip built to bury it Etched is not a fraud. It is a niche tool priced like a general one, and $800M is not enough to run a frontier supply chain. The rest get bought on the next down cycle Bury the lead, not the leader. Full case with @JackFarley96 and @maxwiethe on @MTSlive And special thanks for @baseten for the cool T Shirt! Chapters 00:00 Switching from bonds to semis 00:33 What Etched actually is 01:31 Faster and cheaper, but how much HBM 02:56 Maturing market, not an NVIDIA killer 05:01 $1B in contracts and a Taiwan factory 05:20 Why these startups all get absorbed 06:48 Tiered inference and the obsolescence trap 09:39 Etched vs TPUs and Trainium 12:17 Is the CUDA moat weakening 13:21 Co-design, squeezing every token per watt 14:37 NVIDIA is a software company that sells a chip 14:58 Who is NVIDIA's most dangerous competitor 16:23 The NVIDIA killers, ranked 18:19 A rich man's game 18:43 AMD's MI500 vs Rubin Ultra 20:19 The neocloud business decision 22:46 Lightning round, Rambus the toll on HBM 25:11 The CXL run-up on Astera, Marvell, Credo 26:22 Use AI less, go to the booth 28:10 EDA is not dead

149

135

30K

Pat_D_Doyle retweeted

arjun

@ArjChi

4 days ago

We are hiring: - Talent Acquisition - Electrical Engineers - Mechanical Engineers - Prod Engineers - MTS - SWE - People Ops Dm me or apply https://t.co/c8der2CHJQ

137

20K

Patrick Doyle

@Pat_D_Doyle

4 days ago

If modular construction is waiting on a scalable application to take off, data centers are an interesting angle to see how this process can really take off. Compared to traditional housing, data centers tend to leverage a defined basis of design and then drop it at various sites. If players like Crusoe, GIGA, etc., can learn the difficult lessons of modular manufacturing and then deploy at scale, we may see the benefits heralded for housing in the data center space, where time to market and cost are at even more of a premium https://t.co/5rkvN2OhoS

106

Pat_D_Doyle retweeted

Ricky Ho

@rickyho_1989

5 days ago

Hey everyone, I have received permission from my employer to publish this research. It is a lengthy investment thesis on what I believe will be the next chapter of AI. After months of research, we are turning bullish on the hyperscalers and explain why we believe the market is underestimating where the economics of AI are ultimately heading. The rest is covered in the X article below. I hope you enjoy reading it, and I look forward to hearing your thoughts and challenges.

170

401K

Pat_D_Doyle retweeted

Pythia Cap: Partially Conductive

@PythiaR

6 days ago

You are asking when Google will cut capex and Google is limiting their biggest competitors use of GCP because they don’t have enough compute.

387

66K

Patrick Doyle

@Pat_D_Doyle

Last Seen Users on Sotwe

Trends for you

Most Popular Users