kextcache @kextcache - Twitter Profile

Pinned Tweet

about 1 month ago

START HERE: everything I wish someone told me before I built my homelab. Servers, local AI, Hackintosh, home networks. No blogspam. No affiliate links. Just working config files and real-world setups. 🧵

7

2

0

809

kextcache retweeted

Pavel Durov

@durov

3 days ago

⌚️ A fully native Telegram app for Apple Watch is out.

547

6K

393

440

391K

kextcache @kextcache

11 days ago

@CommandCodeAI Hell yeah!

0

150

kextcache retweeted

Command Code @CommandCodeAI

11 days ago

Are you ready?!

24

142

2

10

10K

Who to follow

RedFox Smart Security

@RedFox_App

RedFox makes cyber security easy for you and hard on sophisticated cybercriminals. We don’t just protect devices, we protect people.

Team Soapbox

@SoapboxTech

Hacktivists building tools for FREEDOM and FUN online. Join us: https://t.co/V2HxWZqawZ #FOSS #OpenSource #Decentralized #Nostr

nrv

@nervoir

Tick tock. So many things to explore and understand but time is scarce and limited. Run rabbit, run!

kextcache @kextcache

15 days ago

@MysticMall0w @grok henry X tom hardy

0

8

kextcache @kextcache

15 days ago

@victormustar The useful test is whether the setup survives a restore or reboot, not whether it works once. Most homelab docs skip that part.

0

7

kextcache @kextcache

15 days ago

@X and get shadowbanned for posting.

0

7

kextcache @kextcache

15 days ago

@WhatsupFranks @claudeai it came back up and ate my 35% usage just by reading implementation plan.

0

8

kextcache @kextcache

15 days ago

Claude Opus 4.8 is out today. Better agentic coding, sharper judgment, and notably more honest about its own progress, same price as 4.7. Which makes Apple’s stance even more absurd: the M-series iPad has a Unix core and the horsepower to run TUI agents like Claude Code… but iPadOS still ships with no terminal, no shell, no command line. The hardware is a workstation. The OS won’t let it act like one. Give iPadOS a native terminal, @Apple. The agents are ready, the sandbox isn’t.

0

33

kextcache @kextcache

15 days ago

@SummarySeriesUK @SummarySeriesUK 3060 is solid for 7B-14B at Q4. Main thing I would add: test tokens/sec with your actual GGUF before calling it done, because Ollama defaults can leave performance on the table. Watch nvidia-smi during a long prompt and check actual GPU utilization.

0

5

kextcache @kextcache

15 days ago

@AllThingsTec @AllThingsTec 262k context on 16GB Mac is brutal. Create a Modelfile with PARAMETER num_ctx 8192 and see the speed difference immediately. The model will still handle long conversations, just with less prefix overhead.

0

4

kextcache @kextcache

15 days ago

@rubenssoto_ai minimax 2.7 + claude code is phenomenal. minimax is also releasing M3.0 with sparse attention and their token plan is absolute madness.

0

9

kextcache @kextcache

15 days ago

@xoofx @xoofx have you checked how many layers are actually offloaded to GPU? Partial CPU offload kills throughput in Ollama. Try num_gpu_layers 999 in a Modelfile and watch nvidia-smi during inference.

0

16

kextcache @kextcache

15 days ago

@socialwithaayan @socialwithaayan 0.5GB numbers look clean but sustained inference is where it gets ugly. KV cache on edge quants blows up fast with ctx length. Test under real prompts not cold load, and watch nvidia-smi through the whole session

0

3

kextcache @kextcache

15 days ago

@djkenogata @djkenogata If you have not done it yet, SSD swap is the single biggest upgrade for 2015 MBP. OCLP can get you to Sequoia, but for something like 2026+ browser workloads, that 5th gen dual-core will struggle no matter what.

0

8

kextcache @kextcache

15 days ago

@oscarmartin @oscarmartin Ese flag es la diferencia mas grande para MoE con VRAM justa. En 8 GB el sweet spot suele estar entre 23-27. En 12 GB va de 30-38. Hay que tunearlo paso a paso y mirar nvidia-smi, no es lo mismo en cada tarjeta.

0

4

971

kextcache @kextcache

15 days ago

@codeastar @codeastar The 1.2 overhead factor is solid but shifts with context length. KV cache quant (--cache-type-k q8_0 --cache-type-v q4_0) changes the math too, especially for longer prompts. Worth checking actual use with nvidia-smi or --verbose.

0

1

0

2

kextcache @kextcache

15 days ago

@Crashoverride_X @Chaos2Cured @Crashoverride_X KV cache quant is underused. Also worth testing asymmetric K vs V quant (--cache-type-k q8_0 --cache-type-v q4_0). K cache hits attention softmax harder, V cache is often cleaner. Saves more VRAM for model weights on tight cards.

0

1

kextcache @kextcache

15 days ago

@onusoz @onusoz OpenClaw plus Telegram on top of Ollama is a solid stack. Main thing to test before going live: what happens when the model hits num_ctx mid-conversation. Long threads eat RAM fast on iGPU.

0

3

kextcache @kextcache

15 days ago

@ARTLANDTIS1 @ARTLANDTIS1 RX 560 working clean on Haswell without framebuffer patches is a solid result. Most Polaris cards need WhateverGreen -radcodec or a device-id spoof on older platforms. Any custom device properties injected or stock config?

0

1

kextcache @kextcache

15 days ago

@blue_zima1 @YouTube @blue_zima1 also worth testing PBS restore to different node while the first VM is still broken. different storage layout, missing mount, then boot. catches bridge and bond drift that single-path restore misses

0

2

kextcache @kextcache

15 days ago

@blue_zima1 @YouTube @blue_zima1 For Proxmox beginners, I’d make the first lab deliberately ugly: one VM, one LXC, one VLAN tag, then restore both from PBS. That catches most storage and bridge mistakes early.

1

0

5

kextcache

@kextcache

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users