Loïck BOURDOIS @bdsloick - Twitter Profile

Pinned Tweet

26 days ago

New blog post on @huggingface! An introdution to Trimming ✂️ a little-known but highly effective model reduction method. We achieved up to 87.24% size reduction while preserving performance 🧵

BdsLoick's tweet photo. New blog post on @huggingface!
An introdution to Trimming ✂️ a little-known but highly effective model reduction method. We achieved up to 87.24% size reduction while preserving performance 🧵 https://t.co/LyfHqN6lEj

1

9

3

367

Loïck BOURDOIS @BdsLoick

26 days ago

@IBM @Microsoft @GoogleDeepMind @baai @qwen @orionweller @vllm @Google @Meta @huggingface @Nils_Reimers @jaseweston Big thanks to my HF Fellows bros for multilingual evaluation @tomaarsen, Bram Vanroy, @christopher, @w00jun_ @mrm8488, @prithivMLmods and to @AI_AlphaEdge for the time dedicated to this project 🙏 Links 👇 Blogpost: https://t.co/hBvVKpk7So Models: https://t.co/FsycUcxyAK

0

4

2

370

Loïck BOURDOIS @BdsLoick

26 days ago

New blog post on @huggingface! An introdution to Trimming ✂️ a little-known but highly effective model reduction method. We achieved up to 87.24% size reduction while preserving performance 🧵

1

9

3

367

Loïck BOURDOIS @BdsLoick

26 days ago

@IBM @Microsoft @GoogleDeepMind @baai @qwen @orionweller @vllm @Google @Meta @huggingface @Nils_Reimers @jaseweston From these 16 families, we generated more than 5,500 monolingual models in 124 different languages.

BdsLoick's tweet photo. @IBM @Microsoft @GoogleDeepMind @baai @qwen @orionweller @vllm @Google @Meta @huggingface @Nils_Reimers @jaseweston From these 16 families, we generated more than 5,500 monolingual models in 124 different languages. https://t.co/Ett7fss0Vm

1

2

0

82

Who to follow

ChunTe Lee

@lee_chunte

UI/UX @ HuggingFace🤗

Bonaventure F. P. Dossou

@bonadossou

@UN Scholar | PhD Fellow @RBCBorealis | PhD @mcgillu |🥇MSc in CS🥇BSc in Maths | Research @Mila_Quebec @MasakhaneNLP @GoogleAI @GoogleDeepMind @lelapaai @Roche

Alexandre TL

@AlexandreTL2

Intern at @DragonLLM in Paris. (Pre|post)-training LLMs

BdsLoick retweeted

Niels Rogge @NielsRogge

about 1 month ago

Introducing a revival of PapersWithCode! As @ilyasut said, we're back to the "age of research". Hence, it's important to share research and build on each other's work. > find SOTA per domain, not just LLMs > leaderboards > methods > all parsed at scale using AI agents.

33

611

91

482

79K

BdsLoick retweeted

tomaarsen @tomaarsen

2 months ago

🌐 I've just released Sentence Transformers v5.4: we're going fully multimodal for embeddings & reranking! Also featuring a modular CrossEncoder, and automatic Flash Attention 2 input flattening. Highlights in 🧵

tomaarsen's tweet photo. 🌐 I've just released Sentence Transformers v5.4: we're going fully multimodal for embeddings & reranking!
Also featuring a modular CrossEncoder, and automatic Flash Attention 2 input flattening.

Highlights in 🧵 https://t.co/IDaRPVYc2g

19

175

31

58

29K

BdsLoick retweeted

みぃ🍵 @mithernet

3 months ago

著者です！ Attentionの「相対比較しかできない」という制約を外した、新しい機構を提案しました ①まずわかりやすい利点 ✅学習時より圧倒的に長い文でも性能維持＆正確な情報取得 ✅収束が非常に高速（LR=1でも学習可能） ✅モデルサイズ4割削減 ✅推論速度3倍超 (続く) https://t.co/75rZpnqieu

mithernet's tweet photo. 著者です！
Attentionの「相対比較しかできない」という制約を外した、新しい機構を提案しました

①まずわかりやすい利点

✅学習時より圧倒的に長い文でも性能維持＆正確な情報取得
✅収束が非常に高速（LR=1でも学習可能）
✅モデルサイズ4割削減
✅推論速度3倍超

(続く)

https://t.co/75rZpnqieu https://t.co/7enHZCXjDn

15

802

133

608

88K

Loïck BOURDOIS @BdsLoick

4 months ago

CuTeDSL is really nice For those wishing to get into writing kernels in this language, https://t.co/7fjj26yj0F can be useful Boris ALBAR reimplemented Flash Attention, RoPE, RMSnorm, etc. Everything compatible with HF Transformers (tests on llama3, GLM4.7, Qwen3), TRL, PEFT/LoRA

maharshi

@maharshii

4 months ago

CuTeDSL is my new favourite thing: I wrote a kernel for RMS norm after learning about layouts, tiling, copying tensors, reductions and so on, especially for inference and it is about 2.13x faster than a triton fused kernel for the given shape.

maharshii's tweet photo. CuTeDSL is my new favourite thing: I wrote a kernel for RMS norm after learning about layouts, tiling, copying tensors, reductions and so on, especially for inference and it is about 2.13x faster than a triton fused kernel for the given shape. https://t.co/7tvDNH6HBM

13

266

7

78

17K

1

0

2

168

BdsLoick retweeted

Basile Terver

@BasileTerv987

5 months ago

𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗘𝗕-𝗝𝗘𝗣𝗔 ⚡ An open-source library making JEPAs accessible, trainable on a single GPU in hours! 🚀 🔗 Paper: https://t.co/7YDSt0AiiA 💻 Code: https://t.co/KWkhcoDidU

BasileTerv987's tweet photo. 𝗜𝗻𝘁𝗿𝗼𝗱𝘂𝗰𝗶𝗻𝗴 𝗘𝗕-𝗝𝗘𝗣𝗔 ⚡

An open-source library making JEPAs accessible, trainable on a single GPU in hours! 🚀

🔗 Paper: https://t.co/7YDSt0AiiA
💻 Code: https://t.co/KWkhcoDidU https://t.co/vw1Fp2Lxu6

13

657

96

516

92K

Loïck BOURDOIS @BdsLoick

5 months ago

@MaziyarPanahi @OpenMed_AI @huggingface huggingface_hub api is all you need to do it programmably 👀

1

0

15

Loïck BOURDOIS @BdsLoick

5 months ago

@gui_penedo I suppose all good things must come to an end. Thank you very much for the high-quality multilingual datasets

0

1

0

25

Loïck BOURDOIS @BdsLoick

6 months ago

@lhoestq @huggingface @mervenoyann @abhi1thakur If you rename the `datasets` library to `nlp` as in early 2020, I'll make sure it passes 700k before the end of the year 👀

0

1

0

24

Loïck BOURDOIS

@BdsLoick

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users