Thought Provoking Vile VR Man @tompettyflacko - Twitter Profile

Thought Provoking Vile VR Man @tompettyflacko

about 2 hours ago

@jun_song show me any model running on an 128gb m5 and I can show you it running it faster on 128gb strix halo that's half the cost

0

3

0

67

Thought Provoking Vile VR Man @tompettyflacko

about 24 hours ago

@MiaAI_lab 🙋‍♂️

0

585

Thought Provoking Vile VR Man @tompettyflacko

1 day ago

@AnushElangovan could you also please fully port oga to linux? would love to run onnx hybrids on my linux os strix!!

0

12

Thought Provoking Vile VR Man @tompettyflacko

2 days ago

@Italianclownz @barackomaba @sudoingX @LarryAGuy1 with the work from you too and i have 35b at peaks of 137.9 t/s (~83 mean) and 27b at peaks of 51.4 t/s (~32 mean) Thats matching/beating 2x 3090 setups, but mine are co-loaded, and have access to a small army of helper, speech, img, and video models in parallel on the npu 🤘

1

2

1

0

112

Who to follow

Michael Cole

@BeastMcBeastly

Pro Wrestling Trainee. Just trying to live a dream.

Lester

@inaglasstunnel

International AI nonproliferation treaty now

very very slight

@gracewellin

dirtsy flirtsy angel bitch

Thought Provoking Vile VR Man @tompettyflacko

2 days ago

@LottoLabs hey lotto! my agent accidentally made some mistake (or was being overly literal) and lots of my 5060ti egpu runs on localmaxxing are also showing on the strix halo boards. despite patching my submissions they still show there. I don't want to slop ur site up, plz help

0

27

Thought Provoking Vile VR Man @tompettyflacko

4 days ago

@AMD please fully port oga to Linux, not being able to run onnx hybrid is a travesty

0

466

Thought Provoking Vile VR Man @tompettyflacko

6 days ago

@Italianclownz beast

0

1

0

43

Thought Provoking Vile VR Man @tompettyflacko

7 days ago

@bwaltens based alert

0

1

0

10

Thought Provoking Vile VR Man @tompettyflacko

9 days ago

@barackomaba @rocketman110us @sudoingX @Italianclownz yes and only p2, afaik mtp bugs at >2

1

0

27

Thought Provoking Vile VR Man @tompettyflacko

9 days ago

@populartourist @barackomaba @Italianclownz @rocketman110us @sudoingX @DJLougen WOW. now those are impressive numbers. and a killer idea. my latest frustration has been the bottleneck hermes seems to put on the model in the 10-15 t/s range, now going to try and claw some of that back. ty!

1

2

0

61

Thought Provoking Vile VR Man @tompettyflacko

9 days ago

@Italianclownz @barackomaba @rocketman110us @sudoingX @DJLougen @populartourist I dont see why not. maybe a multi task bench testing small runs on code, creative, extraction, facts, multi-turn convo, short gen, long gen, etc? score each section. select for configs w best robust performance bias towards stable gen across tasks > bursty peaks on select tasks

0

1

0

28

Thought Provoking Vile VR Man @tompettyflacko

9 days ago

@barackomaba @Italianclownz @rocketman110us @sudoingX @DJLougen :( spec draft type stacking i picked up from @populartourist . but relative success does depend on task type. I have a coherence baseline quick bench that gets run to keep things sane. I just give all the flags I want to stack/test and toss into an autoreseach hill climb loop.

2

0

39

Thought Provoking Vile VR Man @tompettyflacko

9 days ago

@Italianclownz @rocketman110us @sudoingX @barackomaba @DJLougen same config with 137.91 tg/s , just uploaded lots of my runs to localmaxxing . across the board with different models im beating or matching 2x 3090 setups

tompettyflacko's tweet photo. @Italianclownz @rocketman110us @sudoingX @barackomaba @DJLougen same config with 137.91 tg/s , just uploaded lots of my runs to localmaxxing . across the board with different models im beating or matching 2x 3090 setups https://t.co/5yovwPeoxl

1

2

0

46

Thought Provoking Vile VR Man @tompettyflacko

10 days ago

@Italianclownz @rocketman110us @sudoingX @barackomaba @DJLougen same idea! I had the saber 35b shards already on my box when you dropped your repo. Just ran them through your quantizer at the strix lean profile 🤘

1

3

1

0

183

Thought Provoking Vile VR Man @tompettyflacko

10 days ago

@rocketman110us @sudoingX @Italianclownz i went back to double check and 35b top was actually 🔥128.88 tg/s🔥

1

3

1

0

171

Thought Provoking Vile VR Man @tompettyflacko

10 days ago

@rocketman110us @sudoingX @Italianclownz the receipe:

3

0

2

71

Thought Provoking Vile VR Man @tompettyflacko

14 days ago

@barackomaba @Italianclownz @DJLougen from same tests had strix lean orstein 27b at mean 34.25 and max of 46 !!!

0

1

0

41

Thought Provoking Vile VR Man @tompettyflacko

14 days ago

@barackomaba @Italianclownz @DJLougen forgot I got long bench running rn but for stats on a previous run my strix lean 35 had a mean tps of 82.38 and max of 122.80. pp 772 @ 50k, 1101 @ 10k f16 kv b 32768 ub 2048 t 32 tb 16 draft-mtp , ngram-mod draft n max 6 draft p min .25 ngram-mod match 32 ngram-mod max 64

2

1

0

177

Thought Provoking Vile VR Man @tompettyflacko

14 days ago

@barackomaba @Italianclownz @DJLougen and yeah the nsc ace saber series is insane work. now if we could only get the 9b with mtp to round it out... @DJLougen 👀😁

0

2

0

24

Thought Provoking Vile VR Man

@tompettyflacko

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users