@MiaAI_lab I reviewed your Ds-v4 recipe against my own. You’re actually doing better than me! I falsely assumed my config was the most I could get for 2 DGX setup (200k, 5 seqs, 12k batch prefil)
But turns out your recipe does way better (1M, 6 seqs, 8kbatch) for the same 40-50 t/s !!
@0xSero i've been trying to search around for the best DGX spark sg alng recipes..
your reap looked interesting and I seen promising results on dev forum getting this model running native on 2 DGX sparks..
benchmarks would be a huge help to determine recipe and best use case!
@willreil real question: how would you keep the yard green? is it all astro turf? else make the roof entirely electrochromatic glass to control sun light in the neighbourhoods. consistent weather all year round lol
@0xSero I’ve been trying to build an MCP with it.
After long enough chat it can hallucinate bad
It’ll say ready to implement the plan but then never take action. Even at 30% context window usage
Some ppl saying it might be the system prompt that breaks model over time
@r0b0t_sp1der@alexinexxx we live in a world where elon can hire devs on the spot when its go time to lock in talent. pandering for corporate just has no upside when you taste and see what high agency can be
@Daniel_Farinax@StudioZamudio i think Opencode is around the same size for thier system prompt.
intitial prompt or fist chat is the system prompt getting consumed.
sharing it is typically apart of the closed model aspect of frontier models.
grok cli is interesting. i love and hate it so far
@tueks3 the chinese do this with Fio BTR product with many audio codecs support. this module is powerful because it can be adapted for anything! In college i worked on spatial audio, I wonder how easy it could be to add gyroscope and processing unit to enable this device!
@0xSero Docker containers defined alloc of resources but sometimes go over or under the RAM durring runtime.
Would be nice to have for debug and more consistent recipe deployment
@0xSero I like vllm studio.
I tried running it on my Spark recently but it didn’t support container deployment of vllm.
Have you added support since then?
I vibe coded my own patch but still haven’t worked out the bugs with live resource observability.
This man is a legend and this project is truly amazing. He went from zero to selling an assembled pcb on his website and shipping it internationally in a single project. Shipping the thing is as impressive as making it, congrats will.
You guys should buy the thing NOW, it's $1