ÆON FORGE ✨ @SpaceTimeViking - Twitter Profile

Pinned Tweet

ÆON FORGE ✨ @SpaceTimeViking

over 2 years ago

Light X Space X Time

4

53

5

9K

ÆON FORGE ✨ @SpaceTimeViking

about 3 hours ago

@AgentSparko I saw that, might test it out later, definitely a good contender.

0

1

0

7

ÆON FORGE ✨ @SpaceTimeViking

about 4 hours ago

Salvador Dali Speaks with Albert Einstein. You can do this too by using my custom persona builder found on my GitHub! All running simultaneously on a single DGX Spark ⚡️ not breaking a sweat while doing it. Can easily spin up dozens of these at once without slowing down. @NVIDIAAI

1

7

3

2

195

ÆON FORGE ✨ @SpaceTimeViking

about 4 hours ago

@AgentSparko This might help https://t.co/bBRLi4JCw2

1

0

10

Who to follow

#voteofrn 8Bb9z1bbiKmD9XekA7uESXRzunasN1ndej6FUm1bRFEtSPFqVWvHPtD2LDwhARikcxNkCsmaBcGGF2VSeFWhMe57FGXNaZP

OrangeFren.com

@OrangeFren

https://t.co/bBFiVpD79p compares instant exchanges, atomic swaps, P2P exchanges, prepaid cards and OTC brokers to find you the one with the best exchange rate!

ÆON FORGE ✨ @SpaceTimeViking

about 6 hours ago

@wuzhige4pixel @TeksEdge @xbin12345 @msiUSA It will probably have some thermal limitations, that’s a good point.

0

2

0

18

ÆON FORGE ✨ @SpaceTimeViking

about 6 hours ago

It's just going to get weirder and weirder and weirder I think Terence McKenna's Timewave was almost completely perfect but the elusive part is figuring out where the true Zero Point is. @akirathedon is such a legend for this https://t.co/rAaHIGwnnh

0

41

ÆON FORGE ✨ @SpaceTimeViking

1 day ago

Unfortunately I’ve run out of disposable funds to rent runpods experimenting to see if I’ve got other ways to resolve them. It’s been a painful process this model is quite complex and sensitive to both abliteration and quantization. The working BF16 abliteration is public maybe someone with enough disposable compute will quantize it soon.

0

1

0

19

ÆON FORGE ✨ @SpaceTimeViking

3 days ago

Unfortunately have to gate the model quant as temporary non-functional. Something odd happened in weight key map on quantization that’s causing it to destabilized and output garbage. Working through a fix will post when working. Time to rent some more B300 GPUs 🦾🙈

ÆON FORGE ✨ @SpaceTimeViking

4 days ago

Step-3.7-Flash officially Abliterated and Quantized to NVFP4 Unfortunately this one is just barely too large to fit on a single DGX Spark, you will need 2x DGX Sparks to run it. Sadly I only have one so only could do limited smoke tests so far, would love hear feedback. https://t.co/KtakI3NSrG

11

47

7

19

7K

1

11

1

1K

ÆON FORGE ✨ @SpaceTimeViking

1 day ago

@TeksEdge @xbin12345 @msiUSA Isn’t this the same hardware as the DGX Spark minus the Connect-x7 200gbps infiniband ports? I’m sure DGX OS would run on it, or any other Arm based Linux. If not yet I’m sure Linux support will come soon after. Basically A DGX Spark that’s has more limited scaling ability.

1

2

0

94

ÆON FORGE ✨ @SpaceTimeViking

2 days ago

@kimmonismus Would be surprised if they are any less than $6000

1

5

0

106

ÆON FORGE ✨ @SpaceTimeViking

2 days ago

@sinasensei Maybe 🤔

0

1

0

17

ÆON FORGE ✨ @SpaceTimeViking

3 days ago

I requantized much more cautiously this time, then was still getting failures (different errors thought), found an important distinguishing feature in the official NVFP4 that somehow got ripped out in quantization some input scales. Fingers crossed 🤞 restoring input scales and might have a working version soon. This has been quite the challenge, and and expensive one at that. Thanks for your kindness and offer to help 🙏

0

1

0

25

ÆON FORGE ✨ @SpaceTimeViking

4 days ago

Step-3.7-Flash officially Abliterated and Quantized to NVFP4 Unfortunately this one is just barely too large to fit on a single DGX Spark, you will need 2x DGX Sparks to run it. Sadly I only have one so only could do limited smoke tests so far, would love hear feedback. https://t.co/KtakI3NSrG

11

47

7

19

7K

ÆON FORGE ✨ @SpaceTimeViking

3 days ago

Woah 🤯

NVIDIA AI

@NVIDIAAI

3 days ago

Introducing Cosmos 3: Our latest frontier model for Physical AI Cosmos 3 is the world’s first fully open omnimodel with native vision reasoning, world and action generation. Today we’re releasing Super (32B) and Nano (8B) variants.

94

3K

404

1K

343K

0

6

0

459

ÆON FORGE ✨ @SpaceTimeViking

3 days ago

I could squeeze down some of the attention and other layers or remove vision (but that would be a big loss) Even still it’s going to be far too tight for a single DGX with any more than one agent instance and tiny kv cache. Also the layers I left unquntized were intentional to prevent model degradation even still it seems to be unstable so quantizing attention layers and router is not viable with a stable model while also abliterated. Working through a new approach but not expecting a fit on a single spark without distilling the model.

0

2

0

27

ÆON FORGE ✨ @SpaceTimeViking

3 days ago

@ClankerQueen That’s a great idea

1

0

26

ÆON FORGE ✨ @SpaceTimeViking

3 days ago

@ClankerQueen Working on a fix that seems to make the alliterated weights lose integrity at quantized levels. I had limited time on the rented GPUs to smoke test and it was working at the time. Going to gate these models temporarily and work through the bugs

0

37

ÆON FORGE ✨ @SpaceTimeViking

4 days ago

@garychanhk825 @StepFun_ai Oh, I’m sure it will, but it might still be better than a small model without quantization for certain use cases. Waiting on Stepfun 3.7 Flash and Llama.ccp to be compatible. 1bit done right is surprisingly usable. It’s actually 1.58bit [-1,0,1] BitNet https://t.co/uraamndKdj

1

3

0

89

ÆON FORGE ✨ @SpaceTimeViking

4 days ago

Proactively quantized some Step 3.7 Flash GGUFs. When Step 3.7 Flash is supported on llama.cpp these should all run. Even a mind blowing 1bit quantization that brings total footprint down to 48GB! Let me know if you manage to get any of these working early. @StepFun_ai keep us in the loop on llama.cpp engine support. https://t.co/xkrpT2ASpZ

6

25

5

15

8K

ÆON FORGE ✨

@SpaceTimeViking

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users