Uso #python#rstats & #linux para datear, colecciono viniles, amo: las mates; escuchar wave y pop oscuro; la ciencia ficción; jugar wh4k y leer a Kierkegaard
Did you know that people tried to prove central limit theorem for over two centuries, first starting with de Moivre (1733), then almost a century after by Laplace who both used binomial distribution.
Then it was Poisson who worked on this theorem, and Chebyshev (1890–1891) who gave a rigorous demonstration of it in the middle of the nineteenth century.
At the beginning of the twentieth century, the Russian mathematician Liapounov, Aleksandr Mikhailovich (1901) created the generally recognized form of the central limit theorem by introducing its characteristic functions.
Markov, Andrei Andreevich(1908) also worked on it and was the first to generalize the theorem to the case of independent variables.
In 1924 Kolmogorov started to become interested in research in Probability Theory and in 1928 he was able for the first time to formulate necessary and sufficient conditions of the Law of Large Numbers that escaped other best mathematicians of the time for many decades.
It has taken the best mathematicians almost two centuries to prove conditions for LLN and prove CLT.
In fact there is almost 500 (!) pages book describing the history of CLT.
https://t.co/UcjmVxs9f2
#statistics #machinelearning #gaussian
ML models can only be as good as the data they are trained on.
Key aspects of data-centric AI:
- Data cleaning
- Handle outliers
- Account for annotator-quality
- Active learning
Folks at MIT have open-sourced a python library that does this all for you 🚀
Read more 🧵👇
Scikit-learn is one of the most useful and important Python 🐍 libraries for machine learning.
A great cheat sheet that will help you decide which algorithm to use:
How to write clean code:
"Always code as if the guy who ends up maintaining your code will be a violent psychopath who knows where you live."
― John Woods
Learn MLOps in a free online course from @DataTalksClub!
- Processes
- Training models
- Serving models
- Monitoring
- Best practices
More info here: https://t.co/4BsQpAgABb
Developing an ML model is the easy part...
Keeping it healthy on production is the challenge.
Wanna get biweekly articles and tips on real-world ML?
👉🏽 https://t.co/97E4zKMjQM
#ml#machinelearning#MLOps