Built an Uncensored / Abliterated version of Gemma 4 12B that actually scored HIGHER on OpenAI Human Eval CODING over the base official 12B model.
Currently only in BF16 but NVFP4 Quantization coming in hot. Will evaluate the quantized model to see if we get similar results.
https://t.co/6TjStamWvu
@microcenter Thanks for the reply! I took them back the next day and swapped out for new ones that did in fact have the cover in them! Very easy and handled well by staff!
@nvidiacc Two of the four sparks I purchased were missing the magnetic covers. This seems like a QA issue at the factory. Online chat said to return them to @microcenter.
Can't you just mail me a couple $5 covers? I'm trying to expand this cluster!
@aijoey Im currently running alot of benchmarks using this llama.cpp forked for the spark:
https://t.co/yZD9T6nJaD
(i created two issues in the repo to get it working)
That plus "Nemotron 120B Q4_K_M" is looking real interesting atm...Not done yet, but here's a random chart!
@SpaceTimeViking I have two Sparks atm, may buy more if I can justify them. I'm working on putting together a benchmark suite cobbled together from various people's repos... I'll be sure to post some results once I have them.
@SpaceTimeViking
As a new Spark owner, I appreciate all your hard work!
You seem to be using NVFP4 on your github releases for Spark. Have you tried any other variants lately?
@aijoey Have you tested fp4 and non-nvfp4 variants?
I’m running a highly optimized 122B model, and the most shocking thing I learned on the forums is the sm121 arch sucks at nvfp4. YMMV.
Check out the Qwen Intel autoround models, shocking performance increases.
For my friends who are still using UV and might be a little weary about recent compromises to PyPi packages, stick this in your pyproject.toml.
You can let all of those pip users find and report the compromises...
Turns out with claude code, my decades long strategy of NOT deeply learning:
- regexs
- sql
- nginx confs
- elaborate shell commands
- advanced shell scripting
- any javascript framework
- perf optimization
- webpack, cdns, bundlers
- 1000 other things
...was entirely correct.