NVIDIA charges you $19.99 a month to stream games you already own.
And starting January 2026, they cap you at 100 hours.
One engineer from New Zealand built the free version with no cap.
It is called Steam Headless. 3,177 stars on GitHub. GPL-2.0.
Built by Josh Sunnex. 225 commits. The next contributor has 16. He has done more work than everyone else combined.
It is a Docker container that turns any spare PC, server, or NAS into your own personal cloud gaming machine.
Install Steam inside it. Mount your games folder. Open a browser on your phone, your laptop, your tablet, your TV.
Your games are right there. Streaming. From your own hardware. To anywhere in the world.
It supports NVIDIA, AMD, and Intel GPUs.
It streams over Moonlight, Steam Link, or straight to a web browser.
It runs Proton so Windows games work on Linux.
It installs Heroic, Lutris, and EmuDeck with one click for your non-Steam games.
It runs on Debian Trixie, Unraid, Ubuntu Server, or Docker Compose.
Last update: April 20, 2026. Still maintained. Still by one man from New Zealand.
Now compare the math.
GeForce NOW Ultimate: $19.99 a month. $239.88 a year. Forever. Capped at 100 hours per month. Run out? Pay $5.99 for another 15 hours.
Xbox Game Pass Ultimate: $22.99 a month. $275.88 a year. Forever. You stream Microsoft's games on Microsoft's hardware on Microsoft's terms.
Steam Headless: $0. Forever. Your hardware. Your games. Your network. No hour cap. No queue. No throttle.
Buy a used GPU once. Run this container. Stream your entire Steam library to any device on the planet.
That is the entire pitch.
But DO NOT install it. We should all keep paying NVIDIA and Microsoft to play the games we already bought.
100% Open Source. (Link in the comments)
Former pro wrestler and X-Men/Deadpool & Wolverine star Tyler Mane has revealed he has breast cancer
"I start chemo today. One in 750 men will be diagnosed with breast cancer in their lifetime, and I'm one of them."
(via Tyler Mane FB | https://t.co/c23Cc0CDIX)
I'm looking for a pocket-sized multimeter.
Pokit Pro needs a phone. I need something standalone.
Considering Sanwa pm7a (or Hioki 3244)
Any other recommendations?
Tom Kenny will officially return to voice Spyro the Dragon in Spyro A Realm Beyond
Releasing Spring 2027 on Playstation 5, Xbox Series S, Nintendo Switch 2 and Steam
Breaking News:
For the first time ever, Mister Rogers’ Neighborhood has officially launched a YouTube channel.
Full episodes. Classic Clips. 100% wholesome.
💗
🐾 Pet2Dial — Turn your #M5Stack#Dial into a physical Codex pet + task remote 🔄
Made by zzliu-coder, Pet2Dial brings the Codex desktop pet onto #M5StackDial v1.1 and turns the rotary knob into a tactile browser for waiting, failed, review, and running tasks — with tap-to-open conversation control over BLE 📡
🔹 Converts the selected Codex custom pet atlas into compact animated firmware assets
🔹 Uses the Dial’s round touch screen + rotary encoder for pet view and task card browsing
🔹 Syncs Codex task states over Bluetooth Low Energy
🔹 Tap a task card to jump straight to the matching Codex conversation on Mac 💻
🔹 Part desk companion, part physical AI workflow controller 🤖
👉 Project link: https://t.co/viK02k12GC
#M5Stack #ESP32 #BLE #Codex #DeskPet #IoT #M5StackInnovationContest
@Google created an ecosystem within my life that I didn't realize because it was almost second nature. I would share youtube videos with myself or look up gift ideas. Can't do the last one anymore. Because its just going to keep only showing me a show I watched in college.
YouTuber Kane Parsons' movie Backrooms has become A24's highest-grossing movie ever
It has reached a domestic total of $97.7 million, surpassing Marty Supreme, which previously held the title with $96 million
This Chinese developer linked two $2,999 NVIDIA DGX Sparks into one box and runs the full Qwen3-235B at home, after dropping his $1,999-a-month cloud bill to zero.
He wired 2 small boxes into a single computer, split a giant 235-billion-parameter model in half between them, and serves it across his own network at about 10 tokens a second, with no internet, no cloud, right there on the desk.
No data center, no thousand-dollar graphics cards, no monthly cloud bill.
Just him, 2 gold boxes the size of a sandwich, one cable between them, and 1 power strip.
And here is the whole payoff.
He used to pay the cloud $1,999 a month for the same model, and the meter ticked on every request. Now he paid $5,998 once for 2 boxes, they covered their cost in 3 months, and after that he sends as many requests as he wants for free, only electricity.
The two Sparks talk over one fast cable, each holds 128GB of memory, and together they carry the whole model, about 73GB loaded per box, with the chip inside pinned near the limit at 96%.
Both boxes work as one and keep trading data over the cable, with no cloud in the loop and no single word leaking out. The ready model sits on one local address, and any app on his network calls it as easily as ChatGPT.
And here is how he described, in plain words, what this pair of boxes does:
"this is a pair of boxes that holds the huge Qwen3-235B model and serves it to one network. the model is split in half, and each box owns its half.
parts:
// Box 1 (holds the first half of the model and starts the answer fast, the first word appears in under a second)
// Box 2 (holds the second half and writes out the rest, about 10 tokens a second)
// Cable (connects the 2 boxes and moves data between them on every step, with no lag)
// Address (one local address where any app sends its request, like to a cloud model)
// Test (a script that runs big prompts through and measures speed and delays)
// Monitor (checks temperature, power draw, and load on both boxes every 2 seconds).
the model never goes to the cloud. he only steps in when a box runs hotter than 80 degrees or the cable between them starts dropping data."
So the system knows exactly what it is, what it is for, and where its limits are.
It knows it has to hold the whole huge model across 2 boxes on its own.
It knows it has to answer every request locally, with no meter, no limits, and no internet.
It knows the human is only needed when a box overheats or the link between them stalls.
→ The setup runs around the clock on 2 boxes, each pulling under 60 watts
→ However many requests he sends, the monthly bill is $0, only electricity
→ The first box starts the answer in under a second
→ The second writes text at about 10 tokens a second
→ One request at a time: 838 tokens in 85 seconds, first word in 0.8s
→ Two requests at once: 697 tokens in 108 seconds, first word in 0.7s
→ Both boxes sit at 96% load and warm up to 76-78 degrees
And only when a chip in a box runs hotter than 80 degrees or the cable between the 2 Sparks drops data does the system call the owner.
And when he himself is out on a run or in a coffee shop, he still reaches his own model at home from his phone: sends a big prompt to the local Qwen3-235B, gets the full answer back in under a minute and a half, with no token meter ticking and no limit to hit.
Here is what the test shows on his screen during one of the night runs:
"one request at a time: 838 tokens in 84.9 seconds, first word in 0.8s, then 0.1s per token."
"two requests at once: 697 tokens in 107.6 seconds, first word in 0.7s, then 0.15s per token."
"Box 1: chip at 96% load, 76 degrees, 56 watts, 73GB used in memory."
"Box 2: chip at 96% load, 78 degrees, 56 watts, the Qwen3-235B model fully loaded."
And while everyone around is paying for AI by the month and bumping into limits, his top-tier model just sits on the desk and works as much as he wants: his own little power plant instead of a forever meter.
He has no server rack of his own and no cloud account behind it.
Just 2 DGX Spark boxes on a desk, one model split in half between them, one local address, and a folder of prompts next to it.
Out of everything I have seen this year, this is the cleanest way to stop paying for AI: $5,998 of hardware on the desk once, $0 a month to the cloud, unlimited forever, and between them 2 gold boxes, 1 cable, and the full Qwen3-235B answering at home with no internet.