Charles @charlesbben - Twitter Profile

9 months ago

✨ bitsandbytes now has native ZeroGPU support since the new multi-backend refactor. This shows how central @PyTorch has become in the AI landscape: 🔗 Extend torch the right way (custom ops, modes, etc.) and your software plugs seamlessly into a thriving ecosystem

0

6

3

0

378

Charles @charlesbben

9 months ago

Recently finished writing a new blogpost about @PyTorch compilation in ZeroGPU Spaces. Worth reading if you're interested in learning about : - PyTorch ahead-of-time compilation - ZeroGPU internals https://t.co/Jk3IwBlqEE

1

35

7

14

23K

charlesbben retweeted

Sayak Paul

@RisingSayak

10 months ago

Wrote an FA3 attention processor for @Alibaba_Qwen Image using the 🤗 Kernels library. The process is so enjoyable! Stuff cooking stuff coming 🥠 https://t.co/qVtQkwoB6o

RisingSayak's tweet photo. Wrote an FA3 attention processor for @Alibaba_Qwen Image using the 🤗 Kernels library. The process is so enjoyable!

Stuff cooking stuff coming 🥠

https://t.co/qVtQkwoB6o https://t.co/nkkvP67xyz

7

130

15

35

23K

Charles @charlesbben

11 months ago

@RisingSayak Amazing

0

8

Who to follow

Michelle Habonneau

@michelleyhbn

❤️ PM at @Huggingface 🤗

Nicolas Patry

@narsilou

ML Engineer @huggingface. Anything ML inference related. Maintaining `pipeline` in `transformers`. Loving Rust.

Jeff Boudier 🤗

@jeffboudier

Product + Growth @HuggingFace 🤗, the #1 open platform for AI builders. Co-founder Stupeflix (acquired by @GoPro).

charlesbben retweeted

Sayak Paul

@RisingSayak

11 months ago

Felt frustrated when using `torch.compile` as it takes forever? 🤬 You SHOULD switch to regional compilation & see if it is just as beneficial as using full compilation. Let the numbers (Flux.1-Dev) convince you 🫡

RisingSayak's tweet photo. Felt frustrated when using `torch.compile` as it takes forever? 🤬

You SHOULD switch to regional compilation & see if it is just as beneficial as using full compilation.

Let the numbers (Flux.1-Dev) convince you 🫡 https://t.co/ZYWWjmaxQq

3

21

2

5

1K

Charles @charlesbben

about 1 year ago

🚀 ZeroGPU v2 update We just switch to Nvidia H200 last week It means that @huggingface Spaces are now equipped with: - 🧠 70GB vram - ⚡ 2.5x more flops 🔓This will hopefully unlock unseen use cases 💰 It also makes Pro plan a seriously cheap CUDA compute option

charlesbben's tweet photo. 🚀 ZeroGPU v2 update

We just switch to Nvidia H200 last week

It means that @huggingface Spaces are now equipped with:
- 🧠 70GB vram
- ⚡ 2.5x more flops

🔓This will hopefully unlock unseen use cases

💰 It also makes Pro plan a seriously cheap CUDA compute option https://t.co/wssI6qy2x6

14

197

45

54

81K

charlesbben retweeted

apolinario (poli)

@multimodalart

over 1 year ago

ComfyUI → @huggingface Spaces → serverless ZeroGPU ✨😌 We wrote a tutorial on how to turn any ComfyUI workflow into an easy to use Gradio app and (optionally) host it for free with ZeroGPU 💥 https://t.co/1Ij5nqA5rS

3

217

35

164

26K

Charles @charlesbben

almost 2 years ago

⚡We've just rolled out a major update on ZeroGPU! Major improvements: - 2x faster GPU coldstarts - More efficient CPU memory usage (meaning more slots for the community) - ZeroGPU initialization now displays a progress bar - Greatly improved PyTorch compatibility

charlesbben's tweet photo. ⚡We've just rolled out a major update on ZeroGPU!
Major improvements:

- 2x faster GPU coldstarts
- More efficient CPU memory usage (meaning more slots for the community)
- ZeroGPU initialization now displays a progress bar
- Greatly improved PyTorch compatibility https://t.co/STt1UslOd4

2

40

12

15

13K

charlesbben retweeted

clem 🤗

@ClementDelangue

almost 2 years ago

Most frustrating message on HF? Would you be ok with a paid option there?

35

123

4

31K

charlesbben retweeted

clem 🤗

@ClementDelangue

about 2 years ago

GPU-Poor no more: super excited to officially release ZeroGPU in beta today. Congrats @victormustar & team for the release! In the past few months, the open-source AI community has been thriving. Not only Meta but also Apple, NVIDIA, Bytedance, Snowflake, Databricks, Microsoft, Google, and more have released open models and datasets on Hugging Face, which now hosts over 1M models on the Hub which have been downloaded over a billion times. More than that, many are starting to be better than proprietary APIs. This movement has been supported not only by big tech but also by a thriving open-source AI community that includes academic labs, startups, and independent hobbyists. For example, more than 35,000 variation models of Llama have been shared on Hugging Face since Meta’s first version a year ago—including more than 7,000 based on Llama-3—ranging from quantized and merged models to specialized models in biology and Mandarin, to name a few. More than 4 million AI builders are now using Hugging Face. However, the open-source community doesn’t have the same resources available to train and demo these models that big tech have at their disposal, which is why ChatGPT remains the most used AI application today. @huggingface is fighting this by launching ZeroGPU, a shared infrastructure for indie and academic AI builders to run AI demos on Spaces, giving them the freedom to pursue their work without the financial burden of compute costs. Spaces have been the most popular way to build AI demos, with over 300,000 AI demos created so far on CPU or paid GPU (and a thousand more every day). To foster the continued development of the AI ecosystem, Hugging Face is committing $10M of free GPUs with the launch today of ZeroGPU. Technically speaking, ZeroGPU leverages Hugging Face's experience in hosting and serving more than 100 Petabytes monthly from the Hugging Face Hub. ZeroGPU allows Spaces to run on multiple GPUs by making Spaces efficiently hold and release GPUs as needed (as opposed to a classical GPU Space that holds exactly one GPU at any time). This architecture is also more energy-efficient since GPUs are shared rather than duplicated. ZeroGPU uses @nvidia A100 GPU devices under the hood. You can learn more about ZeroGPU here: https://t.co/1mxUxXmElv More than 1,300 ZeroGPU spaces have been built since we started giving early access to AI builders on May 1, 2024: https://t.co/XvJ2MkcK7R You can explore some examples from @victormustar: https://t.co/b8SUcRelJf You can find the article from @kyliebytes: https://t.co/87uN1vnMu8 🤗🤗🤗

ClementDelangue's tweet photo. GPU-Poor no more: super excited to officially release ZeroGPU in beta today. Congrats @victormustar & team for the release!

In the past few months, the open-source AI community has been thriving. Not only Meta but also Apple, NVIDIA, Bytedance, Snowflake, Databricks, Microsoft, Google, and more have released open models and datasets on Hugging Face, which now hosts over 1M models on the Hub which have been downloaded over a billion times. More than that, many are starting to be better than proprietary APIs.

This movement has been supported not only by big tech but also by a thriving open-source AI community that includes academic labs, startups, and independent hobbyists. For example, more than 35,000 variation models of Llama have been shared on Hugging Face since Meta’s first version a year ago—including more than 7,000 based on Llama-3—ranging from quantized and merged models to specialized models in biology and Mandarin, to name a few. More than 4 million AI builders are now using Hugging Face.

However, the open-source community doesn’t have the same resources available to train and demo these models that big tech have at their disposal, which is why ChatGPT remains the most used AI application today.

@huggingface is fighting this by launching ZeroGPU, a shared infrastructure for indie and academic AI builders to run AI demos on Spaces, giving them the freedom to pursue their work without the financial burden of compute costs. Spaces have been the most popular way to build AI demos, with over 300,000 AI demos created so far on CPU or paid GPU (and a thousand more every day). To foster the continued development of the AI ecosystem, Hugging Face is committing $10M of free GPUs with the launch today of ZeroGPU.

Technically speaking, ZeroGPU leverages Hugging Face's experience in hosting and serving more than 100 Petabytes monthly from the Hugging Face Hub. ZeroGPU allows Spaces to run on multiple GPUs by making Spaces efficiently hold and release GPUs as needed (as opposed to a classical GPU Space that holds exactly one GPU at any time). This architecture is also more energy-efficient since GPUs are shared rather than duplicated. ZeroGPU uses @nvidia A100 GPU devices under the hood.

You can learn more about ZeroGPU here: https://t.co/1mxUxXmElv

More than 1,300 ZeroGPU spaces have been built since we started giving early access to AI builders on May 1, 2024: https://t.co/XvJ2MkcK7R

You can explore some examples from @victormustar: https://t.co/b8SUcRelJf

You can find the article from @kyliebytes: https://t.co/87uN1vnMu8

🤗🤗🤗

67

1K

220

368

276K

Charles @charlesbben

over 2 years ago

@yikesawjeez @Xianbao_QIAN @huggingface You don't need to ask for grants with ZeroGPU, that makes a big difference (I just approved your HF profile on https://t.co/oKuw0pBG61, you should now have access)

1

2

0

20

Charles @charlesbben

over 2 years ago

@realmrfakename @Xianbao_QIAN @huggingface We’ll update the name, it’s an error and it should display “Zero Nvidia A100” (just try checking the output of nvidia-smi inside at_spaces.GPU!). Have fun with ZeroGPU

1

2

1

0

74

charlesbben retweeted

Tiezhen WANG

@Xianbao_QIAN

over 2 years ago

@huggingface GPU zero are now running on A100! https://t.co/zqK9h8sNgT By adding a simple annotation, your Spaces with grants are able to run - on multiple GPUs - on demand GPUs, release as needed Come join the org and start making awesome demos on many A100!

8

157

32

69

110K

charlesbben retweeted

merve

@mervenoyann

over 2 years ago

migrated all my GPU using Spaces to ZERO, was like a walk on a beach 🏖️

3

28

1

8

8K

charlesbben retweeted

clem 🤗

@ClementDelangue

over 2 years ago

@karpathy we need you on the open/decentralized side!

15

571

11

9

50K

charlesbben retweeted

Omar Sanseviero

@osanseviero

over 2 years ago

ChatGPT vs HuggingChat

27

391

45

37

69K

Charles @charlesbben

over 2 years ago

@marksaroufim @huggingface @TheZachMueller https://t.co/72a8avzbUq is responsible for the speedup

0

1

0

32

Charles @charlesbben

over 2 years ago

Model downloads from @huggingface Spaces should be way faster overall This https://t.co/JiWbn2EuOB Llama-13B Space boots in 30s instead of 15min previously. That's a 30x speedup It is running on a new Space hardware 🤫 but you can expect decent speed-ups on regular Spaces too

2

21

3

2

4K

Charles @charlesbben

over 2 years ago

@huggingface

0

175

Charles @charlesbben

over 2 years ago

@karpathy Thank you so much for this 2015 RNN blogpost @karpathy. It has been the most impactful thing in my career so far

0

1

0

158

Charles

@charlesbben

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users