Genesis Ai @_genesis_ai_ - Twitter Profile

28 days ago

Yo @tilderesearch, great work on Wall Attention! However I have not got it stable without convs(?), for the FLA module most maybe. Also I would not rely on tl.where to protect exp2 in your kernel, simple fix to mask scores to -inf before exponentiation, dk is fragile <3

0

1

0

498

Genesis Ai

@_genesis_ai_

12 months ago

There are only 2 possible reasons to delay weights: 1. It sucks 2. You stole something and want to know if ppl can figure it out You can jailbreak any open source model with some simple trained kv-injections so "safety" is bs.

Sam Altman

@sama

12 months ago

we planned to launch our open-weight model next week. we are delaying it; we need time to run additional safety tests and review high-risk areas. we are not yet sure how long it will take us. while we trust the community will build great things with this model, once weights are out, they can’t be pulled back. this is new for us and we want to get it right. sorry to be the bearer of bad news; we are working super hard!

2K

19K

1K

2K

4M

1

5

1

0

3K

Genesis Ai

@_genesis_ai_

12 months ago

I prefer code, but this one is an exception.

2

1

0

2

2K

Genesis Ai

@_genesis_ai_

12 months ago

One of the most efficient ways of wasting compute is to use JSON in LLMs. Not to mention the degradation of perplexity... Don't take my word for it tho, just inspect the attention weights when using MCPs or force the output to follow a JSON schema.

2

0

2K

Who to follow

Olle Qvarnström

@Snaljapen

På korståg för avkastningens bevarande.

Colosseum Global Alpha - Twittrar i egenskap av privatperson, inga åsikter eller påståenden ska förknippas med min arbetsgivare ISEC Services AB

Genesis Ai

@_genesis_ai_

about 1 year ago

@komplexkonjugat Äntligen, dags att kasta ut egna taffliga cuda kernels för detta

1

0

322

Genesis Ai

@_genesis_ai_

over 1 year ago

Got a really stupid idea this morning BUT seems its possible to solve arithmetic in ML by just tokenizing smarter and do selective activations. It also solves strawberrry out of the box.

0

2

0

1K

Genesis Ai

@_genesis_ai_

over 1 year ago

@_carlhannes @komplexkonjugat @J_Landstroem Tar med nästa gång så testar vi

1

2

0

117

Genesis Ai

@_genesis_ai_

over 1 year ago

@_carlhannes @komplexkonjugat @J_Landstroem Bruh jag har ju risers, iofs 1x-16x men endån.

1

2

0

98

Genesis Ai

@_genesis_ai_

over 1 year ago

@danielhanchen @UnslothAI That was fun! Got a nf4 fused dequant kernel to x1.31 speedup at least with the given constrains.

0

235

Genesis Ai

@_genesis_ai_

over 1 year ago

@UnslothAI @danielhanchen @UnslothAI was asking for a x1.15 speedup, I give you x1.31 💃 aaaand works with torch.compile, triton autotune, T4 gpus or just like these benchmarks, out of the box. Still have some more tricks on optimizing it but that is for another night! Also should do the MM in there.

_genesis_ai_'s tweet photo. @UnslothAI @danielhanchen @UnslothAI was asking for a x1.15 speedup, I give you x1.31 💃 aaaand works with torch.compile, triton autotune, T4 gpus or just like these benchmarks, out of the box. Still have some more tricks on optimizing it but that is for another night! Also should do the MM in there. https://t.co/qdhIH8RlC9

1

5

1

0

1K

Genesis Ai

@_genesis_ai_

over 1 year ago

I think its time for a hacknight! @UnslothAI makes good kernels so lets try their challenge. Always start with the hard ones right? Lets start with a fused nf4 tensor kernel in Triton!

Daniel Han

@danielhanchen

over 1 year ago

We made 5 challenges and if you score 47 points we'll offer you $500K/year + equity to join us at 🦥@UnslothAI! No experience or PhD needed. $400K - $500K/yr: Founding Engineer (47 points) $250K - $300K/yr: ML Engineer (32 points) Challenges: 1. Convert nf4 / BnB 4bit to Triton 2. Make FSDP2 work with QLoRA 3. Remove graph breaks in torch.compile 4. Help solve Unsloth issues! 5. Memory Efficient Backprop If you have any questions about the challenges, please feel free to ask! We're looking for people to help push Unsloth forward - so come join us to democratize AI further! Our past work includes: 1. 1.58bit DeepSeek R1 GGUFs: https://t.co/gALGkUg5Cg 2. GRPO with Llama 3.1 8B in a Colab: https://t.co/LFdkNxwAYg 3. Gemma bug fixes: https://t.co/7kX94PyKQR 4. Gradient accumulation bug fixes: https://t.co/Tq4c5Qwqyw Details & submission guide: https://t.co/iXxRUTijWV

danielhanchen's tweet photo. We made 5 challenges and if you score 47 points we'll offer you $500K/year + equity to join us at 🦥@UnslothAI!

No experience or PhD needed.

$400K - $500K/yr: Founding Engineer (47 points)
$250K - $300K/yr: ML Engineer (32 points)

Challenges:
1. Convert nf4 / BnB 4bit to Triton
2. Make FSDP2 work with QLoRA
3. Remove graph breaks in torch.compile
4. Help solve Unsloth issues!
5. Memory Efficient Backprop

If you have any questions about the challenges, please feel free to ask! We're looking for people to help push Unsloth forward - so come join us to democratize AI further!

Our past work includes:
1. 1.58bit DeepSeek R1 GGUFs: https://t.co/gALGkUg5Cg
2. GRPO with Llama 3.1 8B in a Colab: https://t.co/LFdkNxwAYg
3. Gemma bug fixes: https://t.co/7kX94PyKQR
4. Gradient accumulation bug fixes: https://t.co/Tq4c5Qwqyw

Details & submission guide: https://t.co/iXxRUTijWV

183

6K

776

9K

1M

1

9

1

2

7K

Genesis Ai

@_genesis_ai_

over 1 year ago

@UnslothAI @danielhanchen 3 hours in, need to wrap it up now. Just some last optimizations and then showtime!

2

1

0

1K

Genesis Ai

@_genesis_ai_

almost 2 years ago

@_carlhannes @JoakimEwenson @0x4a45 @jhakansson_ Det är ditt samvete som pratar, du vet att du kan göra det där med typ hälften av kod och dubbelt så effektivt. Är det rimligt? Troligen inte, bra jobbat kompis ❤️

1

2

0

222

Genesis Ai

@_genesis_ai_

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users