🗂️ Multi-label text classification weak labeling
Get started with this brand new feature with this tutorial by @vid_algo
https://t.co/NwaSvGykJF
#python#opensource#datacentricai
🎉Today we crossed 900 stars on GitHub 🌟
If you don't know Rubrix yet 👉🏽 https://t.co/ReXgN3Chsu
Thanks everyone for your support!
We're just getting started.
Follow us @rubrixml#Python#opensource#NLProc
Training a Hugging Face text classifier directly from search queries?
User Weasel & Rubrix 👉
https://t.co/7jP7RuhQgy
Weasel: End-to-End weak supervision by @CachaySalva & @BenBoecking:
https://t.co/PoJofSy0Ns
Rubrix:
https://t.co/ReXgN3Chsu
#nlproc#python#opensource
What is a text2text model?
In simple words: a model which given a text returns another text
Text summarization is one of such models
This is how to use the scitldr, summarization dataset by @allen_ai 👇
More task examples:
https://t.co/Kzp9BZIMfB
#nlproc#python#opensource
Epic new show out with @ylecun and @randall_balestr where we discuss their recent everything is extrapolation paper, interpolation and the curse of dimensionality, and also dig deep into Randall's work on the spline theory of deep learning. @DoctorDuggar@ecsquendor@ykilcher
The recent push_to_hub feature of the @huggingface datasets library is 🔥🔥🔥
This is how easy is to share a custom @rubrixml dataset with your own annotations
The result:
https://t.co/h6mLwN4oOq
Congrats @qlhoest, @avillanovamoral and team!
A simple active learning loop with the amazing ModAL library
Active learning for YT spam classification inside a Jupyter notebook, a tutorial by @vid_algo
https://t.co/GAbzY8ezh3
Rubrix: https://t.co/ReXgN3Chsu
ModAL: https://t.co/sVmygmRVa6
#nlproc#python#opensource
Fine-tune a Hugging Face transformers for your own domain
Iteratively build a training set and fine-tune a sentiment classifier for the banking domain
Tutorial by @dvilasuero
https://t.co/UbCHBM7EOo
#NLProc#python#opensource
Finding and 𝗰𝗼𝗿𝗿𝗲𝗰𝘁𝗶𝗻𝗴 label errors
1⃣ Train a text classifier, predict over the test set
2⃣ Find label errors with the built-in cleanlab integration
3⃣ Correct errors with the UI
Practical tutorial by @vid_algo
https://t.co/q54nCKBwhq
#nlproc#opensource#ml
Building a text classifier with Flyingsquid it's never been easier
Flyingsquid is a label model for fast and accurate weak supervision
Weak supervision guide:
https://t.co/NH3uCOZs2e
Flyingsquid: https://t.co/pfSemZWcy5
Rubrix:
https://t.co/ReXgN3Chsu
#nlproc#opensource#ml
Find label errors using your model's loss
Practical example finding 100s of label errors in the AGNews benchmark
Uploaded the dataset with losses to the @huggingface Hub
https://t.co/nxv5hvasQw
More at https://t.co/ReXgN3Chsu
#nlproc#datacentricAI#python#opensource
Rubrix auto-monitor now supports FlairNLP
Easily register NER predictions for pre-annotation, error analysis, fine-grained metrics and production monitoring
Available at https://t.co/ReXgN3Chsu
Find more details and guides bellow 👇
#nlproc#mlops#python#opensource
Build a news classifier from scratch with weak supervision
1. Programatically label 38.000 examples with rules and Snorkel.
2. Train a downstream classifier with scikit-learn to achieve 0.81 macro avg. f1-score.
Tutorial link below 👇
#nlproc#ml#datascience#opensource
El lunes @dvilasuero nos enseñó @rubrixml una herramienta opensource que ayuda en la creación de conjuntos de datos. Si te perdiste la charla puedes verla aquí:
https://t.co/CjLCpnbznn
Puedes contribuir al proyecto por cualquiera de los diferentes canales que tienen disponible
Rubrix: Python framework for data-centric NLP
Monitoring pipelines & predictions just got easier
Supports Hugging Face text/zero-shot classification pipelines & spaCy NER
https://t.co/DOrtloiBfH
Guide: https://t.co/KFT7AA1rnP
#MLOps#python#opensource#nlproc#datascience
📢 Científic@ de datos, lingüista, expert@ de producto y desarrollador@:
¡Esta charla gratuita de @NlpSpain es para tí!
3 días para que @dvilasuero nos presente @rubrixml y cómo crear, gestionar y cuidar tus datos de entrenamiento para PLN
#python#pln#ia#opensource