Serving @OpenAI Whisper, powered by @vllm_project, deployed on @huggingface Inference Endpoint, with astonishing @Gradio demo ... What else? 😎
AI community is all you need to create blazing fast whisper transcriptions 💥🚀
More in our blog 📰 https://t.co/zE1G4VAvHb
Last but not least, vLLM and llama.cpp are currently being worked on! 🚀
Initial artifacts for testing will be made available for the community around end of Q1'25 🛠️ and follow up blog posts will pop as we start rolling them out.
Anyone looking to deploy LLMs in production,���got something for you👀
@huggingface TGI is expanding its capabilities to bring support for different inference backends (vLLM, TensorRT-LLM, llama.cpp, etc. etc.) 🏋️
🗞️ More info in our last blogpost https://t.co/0vQnSXQZaJ
TensorRT-LLM, from our partner @nvidia, is the first backend we will be shipping. We are currently wrapping up the last parts.
It'll bring cutting edge float8, quantization & sparsity support🤏
We'll soon publish a part 2 with more in-depth about this integration, stay tuned!
Want to leverage @AMDInstinct MI300x GPUs for LLMs? Check out how easy it is and the performance in our blog post! 🚀
👉https://t.co/dyDXrQ6HYU
Still hungry?🧐Register for the next HuggingCast on June 6th 🎙️https://t.co/zUYJp41QRW
PS: Deploy these GPUs on @Azure right now! 😱
Hello @Free_1337 👋🏻, plus de fibre depuis le 19/12 sur Lessard le National (71530) est-ce que vous avez de la visibilité sur une résolution future ? 🤗🙏🏻
In the coming release we will focus on:
- Making optimum-nvidia pip installable
- Enabling GPTQ/AWQ models from the @huggingface hub
- Officially supporting Mistral models
Stay tuned! 🤗
We just released🤗optimum-nvidia 0.1.0b2 which brings many quality-of-life improvements, should ease the developer experience & extend NVIDIA GPU architectures coverage to Volta and Turing!
Check it out! 🚀
Release notes 📜: https://t.co/1ayU7KUZaJ
As part of @huggingface growth, we have been working on extending our data center capacities to meet the new challenges ahead 🚀😎.
Proud to partner with @VerneGlobal on sustainable AI powered by 100% renewable energy 🌎.