🤗 Tokenizers v0.8.0 is out with many new features and improvements:
- Up to 10x faster to train
- Ability to encode pre-tokenized inputs for NER datasets and such
- Saving/Loading/Pickling tokenizers now takes a single line of code
- Compatibility with Python multiprocessing
🇫🇷 Je suis disponible pour une mission iOS en freelance; +10ans d'xp dans le mobile, j'ai récemment travaillé sur les superbes apps de @mojo_video_app, @GoPro et @zenly. Remote 🌍
Envoyez moi un petit message. Retweet apprécié 🤗
Jobs, jobs, jobs 📢
We continue to grow and are looking for the next hugging faces!
We are looking for a System Administrator to manage our internal infrastructure.
More details here: https://t.co/OVHvk3YfXX
🎉🎉Personal Update: Today is my first day @huggingface. I will be joining the research team in Palo Alto, CA. I am excited to do amazing open-science multimodal research here so look out for new things from me. 🤗
Static Spaces (ie. raw html/js web apps) are now promoted to a first class option on https://t.co/jv3Lwmcj95 ⤵️
Anything you could host on a GitHub Pages, you can host on @huggingface Spaces!
Where is the @huggingface team residing? Out of 100 team members, we have 20 different countries, #1 the USA, #2 France, and then Canada, Switzerland, Germany, Belgium, Brazil, China, Croatia, UK, India, Ireland, Netherlands, Nigeria, Norway, Russia, Spain, Turkey, Uruguay!
TODAY'S A BIG DAY
Spaces are now publicly available
Build, host, and share your ML apps on @huggingface in just a few minutes.
There's no limit to what you can build. Be creative, and share what you make with the community.
🙏 @streamlit and @gradio
https://t.co/KyehQt3Z8u
Our usage & revenue is 📈 so we're hiring in customer success, infrastructure, backend, frontend, product, full-stack, hardware optimization, and more in NYC, Paris & remote. Here's how you can apply: https://t.co/7Cm8NVuHyC!
You can now fine-tune (almost) ANY Transformer model on the HuggingFace Hub with AutoNLP! 🤯
🧠 No code
💁♀️ Nothing to install
💥 Choose from THOUSANDS of models
🚀 Instantly ready for deployment
⚡ In a matter of minutes
➡ https://t.co/iGJ0802guD
📢 Introducing 🤗 Optimum
A new open source library to optimize 🤗Transformers for production performance. 🏎
Quantize, Prune, Optimize models easily, targeting hardware from our partners @intel@graphcoreai@Qualcomm! 🤩
https://t.co/oemVDWlnxI
We just added support for **Tensorboard for private models** 🔥
Just push your TensorBoard traces to any model repo (public or private) and we automatically spawn a TensorBoard server to visualize them.
See latest models that include TensorBoard traces➡️ https://t.co/weJdTmjroI
@ovh_support_fr Après un peu plus de 24h, la mise à jour est finalement terminée ! Je suis surpris par le temps que ça aura pris, mais tout est bon maintenant
@ovh_support_fr Bonjour, j'ai changé les serveurs DNS d'un domaine hier matin et la modification est toujours "en cours". Aucune réponse sur mon ticket 2824335 créé aussi hier matin. Peut-être pourrez vous m'aider ? Merci !
Thanks to the 914 @huggingface contributors that have made this possible!
What happened over the last 10k stars?
- 🔊: Wav2Vec2, XLSR, Hubert, S2T
- 🖼️: ViT, DeiT, CLIP, DETR, VisualBERT
- Added support for Jax/Flax with dozens with architectures supported!
New in 🤗 Datasets v1.11:
🇷🇺 RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
❓ Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering
🗣️ TimeDial: Temporal Commonsense Reasoning in Dialog
For the summer, we are applying the official recommendations for the machines in our @huggingface office in Paris 🇫🇷 ... Staying in a fresh spot, correctly hydrated ❄️🤗
The easiest way to train Transformers on your data:
🌐 15 languages & 5 tasks supported
🚀 Ready to serve instantly
🔒 Secure with private models and datasets
🤯 NO code needed !!
That's #AutoNLP on the web.
That's democratization of #AI.
Try it 👉https://t.co/NwlmXUaqsz
🚨New 🤗Hub feature alert: you can now easily rename your models/datasets/spaces, and even transfer them to one of your orgs! 😎
Try it now from any of your repos' settings page ⬇️
LOW-code 👉 NO-code 🤯
A few clicks are all you'll need to train & deploy state-of-the-art NLP models on your own datasets.
Take a peek into the future with this teaser of the 🤗 AutoNLP experience on https://t.co/HlHI7TXWep 😍 🚀
Join the beta! https://t.co/ItJPOSYK8K
We made it easier to train a new tokenizer on a given corpus with 🤗Transformers and 🤗Tokenizers — check out this new example done with @LucileSaulnier!
https://t.co/cKhFRWO2bf
Similarly to @Github copilot, you can now do question-answering in Google Sheet thanks to the TAPAS model from @GoogleAI & the @huggingface inference API.
Machine Learning making its way in each and every product! Great job @osanseviero!