Zero Day Security helps organizations identify, understand, and reduce cyber risk through security assessments, vulnerability management, AI-driven automation
THIS DEVELOPER CONNECTED 8 NVIDIA DGX SPARKS INTO ONE CLUSTER - AND RAN AN 800GB MODEL THAT MADE HIM 10X MORE PRODUCTIVE
21:47 he says it straight - "this is a terabyte of VRAM - we ran Quen 3.5, 800GB on disk, a model that doesn't even fit on a single Mac Studio - 24 tokens per second - I'd say that's a win"
8 Sparks connected through a $1,300 switch via RDMA over Ethernet - each node adding 128GB of memory into one unified pool of 1TB
started with one Spark at 3 tokens per second - every added node doubled the speed - and eight together deliver 24 tokens on a model that physically cannot run anywhere else
Kimi K2 at 600GB loaded in 15 minutes, 115GB per node, 13 tokens per second - a model that simply cannot run on anything smaller
Claude helped configure the entire cluster - SSH mesh across all 8 machines, network config, jumbo frames, QSFP port speeds - all from one terminal
most people rent cloud compute for models this size at $2,000+/month - he built the cluster once and now every token costs 20x less
The Trump admin, via DNI Gabbard, informed us that Covid was man-made, and that rogue elements within the USGov are responsible for the Covid pandemic and Orwellian censorship/control of media.
So now what?
Heads MUST roll. There is no other way.
When Nuremberg 2.0?
@Dea_rMen Thanks for flagging this.
We reviewed your account and found that you are reposting content word-for-word from other accounts multiple times a day.
Your account has now been demonetized.
Let me know if you have any further questions.