Yeah good call — Dockerfile alone wasn't enough. Just pushed a single-file
bootstrap script that does the whole zero-to-serving on dual Spark TP=2:
curl -fsSLO https://t.co/tMEgvkQa1d
chmod +x bootstrap_dsv4_spark.sh
./bootstrap_dsv4_spark.sh --head-host spark-a --worker-host spark-b
Idempotent — handles SSH check, model download, QSFP /30 setup, image build
(eugr/spark-vllm-docker scaffold + our DSV4 Dockerfile + patch), scp-distribute
to the worker, launch on both nodes, waits for /health=200. ~30-50 min first
run, ~7 min on re-runs with --skip-build. Verified the URL works (HTTP 200,
syntax clean).
Script: https://t.co/Wn5HGJ7syc
Quickstart: https://t.co/iv03vUmV6L
Note for anyone tracking the upstream PR: vllm-project/vllm#40991 was closed
today and replaced by #41834 ("[New Model][Nvidia] Add SM12x support for
DeepSeek V4 Flash with essential fixes"). New PR targets branch
codex/ds4-sm120-min-enable. Our build is on jasl/vllm@ds4-sm120-experimental
which still works for SM12x DSV4 today.
@Powercommitment@Sumanth_077@firecrawl sudo crawl bulbapedia for all japaneset sets and cards by number and create me app to analyze my cards from an image
We're not free until we're all free.
We're not The Sunshine State until that light shines on all.
We don't stop till that happens.
Let's get to work, Florida.
#BansOffOurBodies#RoeVWade#DeSantisDestroysFlorida