@_space_punk_ Your page says bay area? Park at safeway and use the bathroom there. Try to get a job at a ski resort. Lots of them have housing and will work with you. They tend to have summer jobs too
1/3 The message
2/3 The numbers
3/3 The names
1/3;
A small cluster of companies is quietly inserting itself into the most important layer of modern civilisation: the layer between citizens and knowledge, between institutions and decisions, between governments and operational reality.
Control no longer needs to look like censorship.
It can look like classification, ranking, routing, access management, model policy, safety review, compliance architecture and enterprise risk mitigation.
Information does not need to disappear when it can be delayed, filtered, deprioritised, reframed, priced, monitored or made technically inaccessible to everyone outside the approved institutional class.
The more plausible danger is cold, clean and far more bureaucratic: a privately owned control layer through which research, intelligence, public administration, defence, finance, healthcare, media and political decision making increasingly pass.
Once that layer becomes infrastructure, power changes form.
It no longer needs to announce itself. It becomes embedded in dashboards, APIs, audit logs, permission systems, safety policies and subscription tiers.
It becomes the invisible architecture through which reality is processed before ordinary people are even allowed to see it.
The new ruling class will not need crowns, uniforms or speeches.
It will sell access, call restriction protection, call dependency innovation and tell the public that every locked door was built for their safety.
Holy shit π€―
Microsoft just open-sourced a framework that runs a 100B parameter LLM on a single CPU.
No GPU.
No cloud.
No expensive setup.
Just your laptop.
Itβs called BitNet.
And it breaks one of the biggest assumptions in AI.
Hereβs the trick:
Most LLMs use 16-bit or 32-bit floats.
BitNet uses:
1.58 bits.
Yes⦠bits.
Weights are just:
-1, 0, +1
Thatβs it.
No heavy matrix math.
Just simple integer operations your CPU already handles efficiently.
The result is insane:
β’ 100B model runs on CPU at 5β7 tokens/sec
β’ 2β6Γ faster than llama.cpp on x86
β’ 82% less energy usage
β’ 1β5Γ faster on ARM (MacBooks)
β’ 16β32Γ lower memory
The craziest part?
Accuracy barely drops.
Their flagship model (trained on 4 trillion tokens) performs competitively with full-precision models.
They didnβt break the model.
They removed the waste.
What this unlocks:
β Run LLMs fully offline
β AI on phones, edge devices, IoT
β No API costs for inference
β Works even without reliable internet
MacBook.
Linux.
Windows.
It just runs.
27K+ GitHub stars.
Built by Microsoft Research.
100% open source.
This might be the moment AI stops being cloud-firstβ¦
and becomes device-first.
@MastrXYZ Wow. Dead Internet Theory sounded plausible to me when I first heard it. Looks like the theory may be coming true as we speak. @SumOneHueMan check this out
Turns out the dead internet theory is starting to look less like a conspiracy...
According to Cloudflare, bots are now responsible for around 57% of HTML traffic on the web.
Humans are down to 43%. Across all HTTP requests, humans are still ahead with 66%, but the direction is obvious.
The web is no longer mainly a place where people visit websites. It is becoming a machine layer where crawlers, scrapers, scripts and AI agents collect, rank, rewrite and distribute information before most humans ever see it.
That should worry everyone, especially in Web3.
Once bots dominate the information layer, wealthy actors, exchanges, funds, corporations and political groups do not need to convince people directly anymore.
They only need to influence what the machines see, repeat, rank and amplify.
With enough money, accounts, APIs, fake engagement, paid narratives and automated distribution, truth becomes easier to bury and opinion becomes easier to manufacture.
And this space is already full of people who will sell a narrative for a bag.
You can use an app on your phone called Off-Grid to connect to local ollama servers allowing you to use your desktop's GPU for inference while chatting from your phone. It supports RAG, inference and voice. This completely bypasses the cloud and datacenters π