Krispychip

@krispywe

Joined March 2022

32 Following

95 Followers

1.4K Posts

Pinned Tweet

Krispychip @krispywe

about 1 month ago

Now, ComicTL has finally released the stable version. You can now choose to fully run it locally or use Gemini in cloud mode. I also made a website for it, you can read it at https://t.co/CRAXTFijfx.

1

2

1

0

52

krispywe retweeted

18 days ago

Introducing DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation https://t.co/c9AvsRKybj What if we didn’t have to hold an entire neural network in memory to train it? Standard neural net training optimizes all parameters jointly. As a result, the memory required during training grows linearly with the depth of the network. In our #ICLR2026 paper, we propose DiffusionBlocks, a principled framework to train networks one block at a time, drastically reducing memory requirements while matching end-to-end performance. With DiffusionBlocks, we split the network into blocks and train them one at a time, so you only need memory for a single block. How? We explicitly assign each block a role: to move the representation a little closer to the target than the block before it did. That role turns out to be precisely what a diffusion model does, step by step. Each block only needs to optimize its own objective and can be trained independently. We validated this across five different architectures: • ViT • DiT • Masked diffusion • Autoregressive transformers • Recurrent-depth transformers In each case, performance is competitive with end-to-end training while using a fraction of the memory. This perspective also extends naturally to recurrent-depth (Looped) transformers, which apply the same network iteratively and normally require expensive backpropagation through time (BPTT). Viewed through DiffusionBlocks, we can replace those multiple iterations with a single forward pass during training. Read our paper and code, to learn more. Paper: https://t.co/CRj96VGYQn GitHub: https://t.co/eNW0K9Xh8E 🐟

56

2K

366

2K

862K

Krispychip @krispywe

about 1 month ago

@gavinrbrown1 Lmao that's good one

0

0

0

0

5

Krispychip @krispywe

about 1 month ago

@iyoushetwt Hope you're not wasting your time doin Linux ricing

0

4

0

0

501

Who to follow

i make edit sometimes.

Verified account

Krispychip @krispywe

about 1 month ago

@iRainbowsaur @Birchlabs I got u bro. Almost every time I read papers, they are always invent new word for something already exist

0

0

0

0

30

Krispychip @krispywe

about 1 month ago

@twtayaan No wonder linux user usually not in a relationship, especially Arch guys

0

0

0

0

44

Krispychip @krispywe

about 1 month ago

Ah, CVE-2026-31431. For the first time, I might be rethinking Linux and AI. Damn, it 732 Bytes of Python can make you access root access from user. Fk insane founding.

0

0

1

0

111

Krispychip @krispywe

about 1 month ago

Not only Mangadex, but you could also use it in X https://t.co/g1RVt66Gb5

krispywe's tweet photo. Not only Mangadex, but you could also use it in X
https://t.co/g1RVt66Gb5 https://t.co/Pp8GBhRYYv

fanks @fanks_carol

about 1 month ago

@Animex_Tweet 加工された翻訳だこれが本物です S「このあたりになにか用事でも？」 A「ただの通りすがりだ。君こそなぜここに」 S「こちらの児童館で、時折手伝いを、今日は催し物があるのです」

fanks_carol's tweet photo. @Animex_Tweet 加工された翻訳だ
これが本物です

S「このあたりになにか用事でも？」
A「ただの通りすがりだ。君こそなぜここに」
S「こちらの児童館で、時折手伝いを、今日は催し物があるのです」 https://t.co/ZJ9zCwRo1O

4

786

12

61

54K

0

1

0

0

98

Krispychip @krispywe

about 1 month ago

Now, ComicTL has finally released the stable version. You can now choose to fully run it locally or use Gemini in cloud mode. I also made a website for it, you can read it at https://t.co/CRAXTFijfx.

1

2

1

0

52

Krispychip @krispywe

about 2 months ago

Finally, the long project that I almost forgot got into beta release. You can try it by downloading the artifact from GitHub. https://t.co/w9IefZCXYe. Currently, it is only available with Gemini translation. The stable release will try to include the local LLM as a translator.

1

2

0

1

78

Krispychip @krispywe

about 2 months ago

@pnkrtn_ @rayzhudev @catboosted wait a month is serious gap?

2

11

0

0

3K

Krispychip @krispywe

about 2 months ago

@necrosamus I hear that a bit long time ago, and the @idwiki already post that

0

0

0

0

87

Krispychip @krispywe

about 2 months ago

Bruh I just upload PDF and it just vommit its own think and endless loop like I do prompt attack @GeminiApp

0

1

1

0

49

Krispychip @krispywe

2 months ago

@cneuralnetwork They're too innocent for this cruel world

0

0

1

0

504

Krispychip @krispywe

2 months ago

@julioleiva2121 @BrunsJulian1541 @ChatgptLunatics Bro got trigger immediately just because he talk about model type

0

1

0

0

16

Krispychip @krispywe

2 months ago

LinkedIn is a positively toxic echo chamber. Everyone licks each other's boots in the comments, saying "nice insight" because they are too afraid to correct wrong math. Politeness is just a mask for incompetence. Stop trusting vibes and start reading the docs. (12/12)

0

0

0

0

18

Krispychip @krispywe

2 months ago

This is why I hate LinkedIn. Too many people with glorified titles but they confidently post trash because they can’t even read properly. An "AI Engineer" promoting a custom training loop for imbalance that is actually tutorial how to sabotage your own model. (1/12)

1

0

1

1

46

Krispychip @krispywe

2 months ago

But because Focal Loss doesn't look as "complex" as a 40-line custom filtering pipeline, people think it is less professional. Complexity is not a feature; it is a bug. A senior engineer deletes code; they don't add unreadable dataset mutilation. (11/12)

1

0

0

0

18

Last Seen Users on Sotwe

Trends for you

Most Popular Users