Parallax @tryParallax - Twitter Profile

4 days ago

nvidia going all in on local ai. here's our take: it shouldn't depend on which chip you bought. sparks, macs, the 5090 already on your desk, we cluster across all of it and split your favorite model pipeline-parallel so it runs fully private and local.

NVIDIA

@nvidia

4 days ago

NVIDIA RTX Spark: a 1-petaflop superchip, the full CUDA and RTX ecosystem, and Windows-native agents. A new beginning for personal computers.

nvidia's tweet photo. NVIDIA RTX Spark: a 1-petaflop superchip, the full CUDA and RTX ecosystem, and Windows-native agents. A new beginning for personal computers. https://t.co/3OPOCNJBz5

326

5K

466

323

726K

12

87

22

5

10K

Parallax

@tryParallax

about 1 month ago

mine looks better

Gradient @Gradient_HQ

about 1 month ago

Gradient_HQ's tweet photo. https://t.co/bDtBJaSdQr

151

522

47

9

55K

21

86

4

0

3K

tryParallax retweeted

Yuan ./

@yuangao

about 1 month ago

Thrilled to see @tryParallax live in production on @Theta_Network. This is exactly why @Gradient_HQ built Parallax: turning the world’s GPU mesh into a sovereign, distributed token factory. Congrats on the milestone! 🫡

25

356

65

13

41K

Parallax

@tryParallax

about 1 month ago

glad we could help! with the agentic adoption soaring, privacy and token cost are already the top concerns for both agent and human users. that's what parallax's built for.

Theta Network

@Theta_Network

about 2 months ago

To make this work, we adapted Parallax, @Gradient_HQ's distributed inference framework, to run across EdgeCloud's global node network. One API endpoint, model split across many machines, no centralized cluster required.

11

265

33

10

88K

17

206

33

7

24K

Parallax

@tryParallax

about 1 month ago

@Theta_Network @Gradient_HQ 🫡

0

2

0

206

Parallax

@tryParallax

2 months ago

@VitalikButerin buy a GPU, get together a group of friends. don’t carry the world on your own shoulders. we’ve been building this for a while. try parallax for local ai.

1

13

0

2K

Parallax

@tryParallax

2 months ago

@RoundtableSpace 35b model on a macbook with compressed cache is a solid result. local inference keeps getting more accessible and it's fun to watch people push the limits of what consumer hardware can do!

0

4

0

230

Parallax

@tryParallax

2 months ago

@adrgrondin @PrismML 1-bit model running at 40 tok/s on an iphone. mlx is making on-device inference surprisingly usable now.

0

6

0

556

Parallax

@tryParallax

2 months ago

@ollama local llm + mlx is a great combo! apple silicon keeps getting better for local inference and it's nice to see more players in the ecosystem lean into it properly.

1

6

0

189

Parallax

@tryParallax

2 months ago

@tom_doerr single binary, self-hosted, no dependencies. this is the way local ai should ship. less config, more building.

1

6

0

59

Parallax

@tryParallax

2 months ago

@karaage0703 9bから27bへのローカル性能の差がすごい。qwen3.5は今セルフホストするなら最高のモデルの一つ。特に異なるデバイス間でシャーディングするなら。

1

6

0

142

Parallax

@tryParallax

2 months ago

TurboQuant tackles one bottleneck: KV cache memory. there's another one that matters just as much in distributed setups: communication latency between nodes. we built Decentralized Speculative Decoding (DSD) to turn that idle network wait time into useful computation, 2.56x speedup on HumanEval, no retraining needed. combine cache compression with latency compression and local inference starts looking very different. https://t.co/AF0wpqXIhd

6

20

3

1

565

Parallax

@tryParallax

2 months ago

hf-mount solves the storage side: any model, mounted locally like a drive. the next piece is actually running those models across whatever hardware you have. that's what parallax does: schedule inference across a pool of heterogeneous GPUs so the model doesn't just live on your machine, it runs there too. mount + serve, fully local.

0

1

0

135

Parallax

@tryParallax

2 months ago

@oprydai you don't need to go into debt though. a couple of mac minis or an nvidia card can already run serious models locally. parallax lets you connect whatever hardware you have into one cluster. start small, add devices as you go. the whole point is using what's already on your desk.

0

4

0

59

Parallax

@tryParallax

2 months ago

@openclaw solid release. deepseek provider plugin + qwen pay-as-you-go opens up a lot of new local setups. parallax users running openclaw stacks should have a smoother time with this one.

0

6

0

2K

Parallax

@tryParallax

2 months ago

@wolfejosh the ceiling for on-device keeps moving. a year ago people argued you couldn't run anything useful locally. now it's 400B on a phone. parallax already supports mixed hardware clusters — apple silicon, nvidia, whatever you've got. the trend is clear.

0

4

0

182

Parallax

@tryParallax

2 months ago

the $3,469 single-night burn is a good reminder of what you're actually signing up for with cloud inference. when the meter's always running, one stuck agent is a bill. parallax runs models on your own machines. no token meter, no overnight surprises.

Ziwen

@ziwenxu_

2 months ago

https://t.co/mL6G7N23C9

14

298

27

1K

240K

5

40

8

2

2K

Parallax

@tryParallax

3 months ago

@JustinLin610 🙋‍♂️and we are here to help

0

2

0

485

Parallax

@tryParallax

3 months ago

local ai has picked up fast since openclaw dropped. with the latest wave of small capable models, more people are running serious workloads on their own hardware. if you missed this good local ai tutorial from @yacinelearning or want a refresher on how distributed scheduling actually works under the hood, it's worth the rewatch over the weekend!

Yacine Mahdid

@yacinelearning

4 months ago

I am continuing my adventure into distributed AI system with the parallax scheduling strat from @Gradient_HQ in this 37min tutorial I go through: - heuristic used to make scheduling tractable - dynamic programming formulation - filling GPU with water - shoving them into shelves

yacinelearning's tweet photo. I am continuing my adventure into distributed AI system with the parallax scheduling strat from @Gradient_HQ

in this 37min tutorial I go through:
- heuristic used to make scheduling tractable
- dynamic programming formulation
- filling GPU with water
- shoving them into shelves https://t.co/UkzMN6Ci7x

11

265

19

188

24K

14

121

15

8

16K

Parallax

@tryParallax

Last Seen Users on Sotwe

Trends for you

Most Popular Users