DusterBloom @dusterbloom - Twitter Profile

10 days ago

The founder of Ethereum is using Lucebox on his RTX 5090 at x2 speed of llama.cpp with Qwen dense. Many more speed ups on the way! 🏎️

13

143

14

52

37K

dusterbloom retweeted

Cycles

@cyclesmoney

11 days ago

This is a great write-up from @HighStakes_CH. We’re bringing that clearing on-chain so that businesses everywhere can reduce settlement costs and liquidity requirements.

2

31

5

2

4K

DusterBloom @dusterbloom

11 days ago

Lots of pain in delivering but it was worth it. More certain than ever that Local AI is the way to go. Big shoutout to @davideciffa @pupposandro for the work on @lucebox LFG

Lucebox

@luceboxai

11 days ago

Great work from @dusterbloom 🔥

4

11

2

1

5K

0

3

0

114

dusterbloom retweeted

mrciffa

@davideciffa

24 days ago

Luce PFlash now run @poolsideai’s Laguna-XS.2 (33B-A3B MoE) on a single RTX 3090. - 111 tok/s decode @ short ctx - 128K TTFT in 15.91s, 5.4x faster prefill vs llama.cpp - NIAH passes every (ctx, keep) point up to 131K - first MoE target supported by PFlash - hand-rolled CUDA, ggml only, no libllama Great collab w/ @eisokant, @erc, and the rest of the team. looking forward to working more on their great coding models.🏎️ repo + GGUF in first comment

davideciffa's tweet photo. Luce PFlash now run @poolsideai’s Laguna-XS.2 (33B-A3B MoE) on a single RTX 3090.

- 111 tok/s decode @ short ctx
- 128K TTFT in 15.91s, 5.4x faster prefill vs llama.cpp
- NIAH passes every (ctx, keep) point up to 131K
- first MoE target supported by PFlash
- hand-rolled CUDA, ggml only, no libllama

Great collab w/ @eisokant, @erc, and the rest of the team. looking forward to working more on their great coding models.🏎️

repo + GGUF in first comment

4

40

3

28

2K

Who to follow

Michaelpeel🇳🇬

@Michael01961106

CRYPTO ENTHUSIAST || COMMUNITY MOD || surfing 🌊

OogaBooga

@8OogaBooga8

What a wild wonderful wicked world.

2084🛡️

@George0rwhal3

If the doors of perception were cleansed every thing would appear to man as it is, Infinite.- William Blake

dusterbloom retweeted

Joel - coffee/acc

@JoelDeTeves

25 days ago

Update on @luceboxai OOMing with Hermes Agent on RTX 3090: @davideciffa gave me a great suggestion this morning to try with Lucebox and I am happy to report that it works! Here are the settings to make it work with Hermes Agent on RTX 3090: DFLASH27B_KV_TQ3=1 DFLASH27B_PREFILL_UBATCH=128 python3 scripts/server.py --tokenizer Qwen/Qwen3.6-27B --port 8000 --max-ctx 65536 --fa-window 1024 --prefix-cache-slots 1 --budget 8 --daemon This *also* works with @DJLougen Ornstein model! Really looking forward to testing this out! Thank you David! This is one of the most exciting projects in local AI right now!

7

36

5

46

4K

dusterbloom retweeted

Sandro

@pupposandro

26 days ago

https://t.co/qdj5Re0SmU

13

110

13

91

39K

dusterbloom retweeted

Teknium 🪽

@Teknium

29 days ago

We just hit number one globally across all AI apps on OpenRouter. Super grateful to the nearly 1000 contributors who've helped make Hermes Agent great, thank you! What do you want to see next?

221

2K

129

351

2M

DusterBloom @dusterbloom

29 days ago

@exolabs 1st time when Mistral came out, I was disappointed. Everything changed in April with Qwen 3.6...as for @exolabs if I had a wish -> get CUDA and MLX working in tandem that would turn my rtx3090 and mac into something greater and each by itself. Thx

0

2

0

199

DusterBloom @dusterbloom

about 1 month ago

@malikwas1f @davideciffa @easel @malikwas1f https://t.co/dmSf9etAel

0

1

0

38

DusterBloom @dusterbloom

about 1 month ago

@chrisbward @Anthropic It does not make sense indeed but it wont be for much longer. I own compute. Their edge is vanishing. It is only a matter of time for me to be off of any cloud model for exactly the reasons you mention + reliability. A local model doesn't degrade over time.

0

1

0

15

DusterBloom @dusterbloom

about 1 month ago

@anthropic WTF !!! GET YOUR SH!7 TOGETHER LIKE...YESTERDAY

2

0

45

DusterBloom @dusterbloom

about 1 month ago

@chrisbward @Anthropic I wish it was that simple bro. It is way more nuanced than that ... anyway it won't be for long :-)

1

0

20

DusterBloom @dusterbloom

about 1 month ago

@malikwas1f @davideciffa @easel I always stay on one session, I have been using this thing since like version 2.0 that is what hurts....seeing 4.5 and 4.6 shine like two months ago and 4.7 going from really God Tier when it works to total disaster when it doesn't plus the sweet talk....unbearable

0

1

0

27

DusterBloom @dusterbloom

about 1 month ago

@malikwas1f @davideciffa @easel I know but damn

0

16

DusterBloom @dusterbloom

about 1 month ago

@easel @malikwas1f @davideciffa Totally, can't wait to be off of this shit! LocalAI is the only way out of progressive enshittification. It just drives me nuts that it acts way worse on WSL than on MacOS....god knows what they did to the thing but it is a disgrace.

0

2

0

31

DusterBloom @dusterbloom

about 1 month ago

@malikwas1f @davideciffa @easel Key findings 1. pFlash compression: clean. No quality regression vs baseline. 2. BSA: clean. No quality regression. 3. all_on: clean. Production stack matches baseline within noise.

1

2

1

0

79

DusterBloom @dusterbloom

about 1 month ago

@stuff383864 @davideciffa @easel Yes it can, I am working on it on my fork of `panbanda/higgs` but not finished yet. Just a hint, using ANE does magic on prefill but it is definetely non trivial

1

0

1

30

DusterBloom @dusterbloom

about 1 month ago

@malikwas1f @davideciffa @easel @malikwas1f working on a bench just to track quality

0

1

0

30

dusterbloom retweeted

Sandro

@pupposandro

about 1 month ago

@davideciffa @ivanfioravanti @dusterbloom @easel Thank you guys 🙏 Lucebox community is just incredible 🔥

0

7

1

0

482

DusterBloom

@dusterbloom

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users