Luc Georges

Verified account

@LucSGeorges

Software & ML Engineer @huggingface 🦀

Paris, France

Joined December 2020

474 Following

1.8K Followers

382 Posts

Pinned Tweet

9 months ago

we've been pushing commits to transformers discretely, time to talk about we've been cooking the last few months: ⚡️ Continuous Batching is in transformers ⚡️ this will simplify, most notably, evaluation and your training loop: no need for extra dependencies or infra to get fast inference, and no need for convoluted code to update your weights note that speed is currently not on par with the best inference frameworks and servers out there and probably never will be the goal is *not* to become as fast: we want to complement the existing landscape with features like these, aiming for transformers to be the toolbox for tinkering with and building models

LucSGeorges's tweet photo. we've been pushing commits to transformers discretely, time to talk about we've been cooking the last few months:

⚡️ Continuous Batching is in transformers ⚡️

this will simplify, most notably, evaluation and your training loop: no need for extra dependencies or infra to get fast inference, and no need for convoluted code to update your weights

note that speed is currently not on par with the best inference frameworks and servers out there and probably never will be

the goal is *not* to become as fast: we want to complement the existing landscape with features like these, aiming for transformers to be the toolbox for tinkering with and building models

15

178

23

45

52K

LucSGeorges retweeted

26 days ago

Anyone interested in a CUDA deep dive that makes your workload 25% faster? 🧐 Just published a new blog post on asynchronous CPU / GPU inference: 100% insight, zero slop 😊 To learn how to remove all CPU overhead and use your GPU to the max, just read it 🔥

remi_or_'s tweet photo. Anyone interested in a CUDA deep dive that makes your workload 25% faster? 🧐

Just published a new blog post on asynchronous CPU / GPU inference: 100% insight, zero slop 😊
To learn how to remove all CPU overhead and use your GPU to the max, just read it 🔥 https://t.co/g1l4But2x2

1

25

11

14

4K

about 1 month ago

@remilouf @dottxtai

0

7

0

0

478

about 1 month ago

LucSGeorges's tweet photo. https://t.co/jYCU5tmsLd

about 1 month ago

Wait? Is this a win for the MoE doomers? We are so back!

6

28

0

9

12K

0

3

0

0

273

Who to follow

Quentin Lhoest 🤗

Datasets @huggingface | Open Source + HF Dataset Hub

Verified account

Research Engineer @GoogleDeepMind, Gemini Diffusion prev: huggingface 🤗 (transformers team), nPlan, PhD@IST 🇵🇹

about 1 month ago

@IlysMoutawwakil Woah hold up, the MoE police is gonna be after you for this one

1

2

0

0

142

about 1 month ago

this is the way

about 1 month ago

Reading @deepseek_ai 's v4 paper.... absolute hats off. Every problem has a mathematical solution, nothing is left to chance. I have so much respect for them, putting out months or years of efforts entirely for free, in the open for anyone to benefit. Real goats 🫡

74

5K

374

706

252K

0

3

0

2

308

about 1 month ago

@pcuenq LET'S GOOOOOOOO 🔥🔥

0

1

0

0

71

about 2 months ago

🐐 Feel free to drop some more gems in safetensors 🫡

about 2 months ago

This marks the end of my first week at @huggingface! I'm joining as a founding engineer on HF's PyTorch team. My first project: safetensors on Mac is up to 3x faster🚀 Parallel reads straight into MPS unified memory, no CPU staging. MB Pro M5 Pro - Cold 16 GB: **2.97 → 8.23 GB/s** (2.8×) - Warm 3 GB: **10.3 → 26.6 GB/s** (2.6×)

Is36E's tweet photo. This marks the end of my first week at @huggingface! I'm joining as a founding engineer on HF's PyTorch team.

My first project: safetensors on Mac is up to 3x faster🚀

Parallel reads straight into MPS unified memory, no CPU staging.

MB Pro M5 Pro
- Cold 16 GB: **2.97 → 8.23 GB/s** (2.8×)
- Warm 3 GB: **10.3 → 26.6 GB/s** (2.6×)

6

154

7

43

12K

0

3

0

1

414

LucSGeorges retweeted

about 2 months ago

We're opening a Hugging Face office in Tokyo! Our goal: help open-source AI develop in Japan and grow the local community. Let's meet! ハギングフェイスの東京オフィスがオープンしました！私たちの目標は、日本におけるオープンソースAIの発展を支援し、ローカルコミュニティを育てることです。ぜひお会いしましょう！

LysandreJik's tweet photo. We're opening a Hugging Face office in Tokyo!

Our goal: help open-source AI develop in Japan and grow the local community. Let's meet!

ハギングフェイスの東京オフィスがオープンしました！

私たちの目標は、日本におけるオープンソースAIの発展を支援し、ローカルコミュニティを育てることです。ぜひお会いしましょう！

131

3K

476

433

309K

about 2 months ago

@LysandreJik @art_zucker @remi_or_ スゴイ !! 🥹🥹

0

6

0

0

577

about 2 months ago

Arise, Sir Ilyas, Knight of the Upper Dims

Ilyas @IlysMoutawwakil

about 2 months ago

Tri Dao praised my PR 😳

IlysMoutawwakil's tweet photo. Tri Dao praised my PR 😳 https://t.co/nvK1Ql7aoe

4

413

6

59

20K

0

5

0

0

413

about 2 months ago

👁️👄👁️

LucSGeorges's tweet photo. 👁️👄👁️ https://t.co/2IsLckP07e

0

5

0

0

386

about 2 months ago

First release of safetensors under the PyTorch Foundation umbrella! 0.8.0-rc.0 is out: - GIL-free serialization - Windows ARM64 wheels - AMD FP8 FNUZ support - little perf improvements here and there ✨ Would appreciate feedback if you feel inclined 🥹

LucSGeorges's tweet photo. First release of safetensors under the PyTorch Foundation umbrella! 0.8.0-rc.0 is out:
- GIL-free serialization
- Windows ARM64 wheels
- AMD FP8 FNUZ support
- little perf improvements here and there ✨

Would appreciate feedback if you feel inclined 🥹 https://t.co/0CKHRb2LoO

1

9

2

1

626

2 months ago

https://t.co/Uai95oH6aR

0

0

0

0

93

2 months ago

Big moment for open source ML: safetensors is joining the PyTorch foundation! This means first class citizen support for safetensors in PyTorch’s core library, amongst other things 🥹 Super proud of being a maintainer in such an essential tool for ML 🫡

LucSGeorges's tweet photo. Big moment for open source ML: safetensors is joining the PyTorch foundation!

This means first class citizen support for safetensors in PyTorch’s core library, amongst other things 🥹

Super proud of being a maintainer in such an essential tool for ML 🫡 https://t.co/ymTquEJ6P3

1

17

7

0

1K

2 months ago

@tunguz @calebfahlgren @huggingface Any specific library or product in mind? Would appreciate actionable feedback!

0

2

0

0

109

3 months ago

@eliebakouch 🥺🫶🔊🔊🐐

0

1

0

0

118

3 months ago

I seem to have found somewhat of a sweet spot. Talk into Claude for the ideation phase, write down the plan, and do everything by hand myself, apart from tests maybe, who likes writing test amiright I question / rework / ignore everything written in plan as it often misses the target, but it does help me think through the problem in great detail. I go from one big plan to smaller in depth plans for each substep which works quite nicely. Co-ideating with Claude keeps the fun alive imo, so long you ask it to tweak / give feedback on your original ideas and have vision for what you want to do. It kind of feels like pair programming!

3 months ago

Programming was deeply satisfying work to me. Work for hours/days before getting the payoff of the code working well on your machine. I’m feeling so much friction now to open the editor and do this kind of task by hand, but also increasingly depressed with the nature of work in an AI assisted dev workflow. Back and forth prompting seems to eat at my soul. Need to find a balance that brings back some of the toil.

252

4K

221

571

503K

0

1

0

0

354

4 months ago

@ggerganov Incredible news, let's go 🔥🔥

0

2

0

0

481

4 months ago

@XciD_ @huggingface Claude climbing the echelons of HF leadership was not on my bingo card

0

4

0

0

427

LucSGeorges retweeted

4 months ago

Transformers v5's FINAL, stable release is out 🔥 Transformers' biggest release. The big Ws of this release: - Performance, especially for MoE (6x-11x speedups) - No more slow/fast tokenizers -> way simpler API, explicit backends, better performance - dynamic weight loading: way faster, and enabling: MoE now working w/ {quants, tp, peft, ...} We have a migration guide on the main branch; please take a look at it in case you run into issues. Come in our GH issues if you still do after reading it 😀

LysandreJik's tweet photo. Transformers v5's FINAL, stable release is out 🔥 Transformers' biggest release.

The big Ws of this release:
- Performance, especially for MoE (6x-11x speedups)
- No more slow/fast tokenizers -> way simpler API, explicit backends, better performance
- dynamic weight loading: way faster, and enabling: MoE now working w/ {quants, tp, peft, ...}

We have a migration guide on the main branch; please take a look at it in case you run into issues. Come in our GH issues if you still do after reading it 😀

9

430

81

132

76K

Last Seen Users on Sotwe

Trends for you

Most Popular Users