In this blog, I have listed various noises/distortions, and pre-processing steps to improve the document quality for different document sources. https://t.co/R3CUZwx2sy
#ocr#tesseract#ComputerVision
OCR Protip:
Understanding your document source and designing pre-processing pipeline accordingly boosts the OCR performance and makes your platform more robust.
#ocr#documentintelligence#TESSERACT#AbbyFineReader
If you are looking to get started in #MLOps and don't want to miss out on the best practices that surround it definitely check out this @Plural course by @MeAbhishekkumar -
Building End-to-end Machine Learning Workflows with Kubeflow:
https://t.co/taeFUTRzdL
Checkout my new blog on
“Multi-Label Text Classification”
A brief survey on Multi-label text classification and valuable comparison between various approaches used for multi-label classification.
#NLP#AI#DataScience#MachineLearning https://t.co/qwDwzBFv8t
Interested to discover hidden topics from your text data? Here is my blog post on *brief survey of Topic Modeling* and also a practical guide to implement Latent Dirichlet Allocation with Python
https://t.co/0VSeBcRzuL
#LDA#NLP#topicmodeling#Datascience#python
Stephen Hawking once said, "I'm not afraid of death, but I'm in no hurry to die. I have so much I want to do first." RIP Professor Hawking, and may we all strive to live as fully as he did.