Amin Sadou @aminsaadou - Twitter Profile

Amin Sadou @aminsaadou

7 months ago

HR Analytics report steps https://t.co/wYnocR7jmi

0

3

aminsaadou retweeted

elvis

@omarsar0

10 months ago

Everyone is talking about this new OpenAI paper. It's about why LLMs hallucinate. You might want to bookmark this one. Let's break down the technical details:

omarsar0's tweet photo. Everyone is talking about this new OpenAI paper.

It's about why LLMs hallucinate.

You might want to bookmark this one.

Let's break down the technical details: https://t.co/BQ3wSLtiHm

106

3K

456

7K

455K

aminsaadou retweeted

Lior Alexander

@LiorOnAI

10 months ago

AWS released an open-source framework that lets you orchestrate multiple AI agents and handle complex conversations. Can be deployed locally on your computer.

LiorOnAI's tweet photo. AWS released an open-source framework that lets you orchestrate multiple AI agents and handle complex conversations.

Can be deployed locally on your computer. https://t.co/J0Jp6AI7Kg

25

1K

123

2K

103K

aminsaadou retweeted

Poonam Soni

@CodeByPoonam

10 months ago

Google just changed note taking forever. NotebookLM got massive upgrades. 7 Powerful features you can't afford to miss:

CodeByPoonam's tweet photo. Google just changed note taking forever.

NotebookLM got massive upgrades.

7 Powerful features you can't afford to miss: https://t.co/PcaKFftfmX

31

1K

116

1K

176K

Who to follow

José Ruiz-Meza, Ph.D

@JosRuizMeza

Ph.D in Logistics and Supply Chain Management @unisabana. MSc Integral Logistics. Industrial Eng. Researcher

David H

@david01hunter

I still believe in IOT

aminsaadou retweeted

Lior Alexander

@LiorOnAI

about 2 years ago

NVIDIA just made Pandas 150x faster with zero code changes. It is now directly integrated in Google Colab. All you have to do is: %load_ext cudf.pandas import pandas as pd Their RAPIDS library will automatically know if you're running on GPU or CPU and speed up your processing. You can try it here: https://t.co/oq7qkVWztY

22

1K

240

997

150K

aminsaadou retweeted

Alex Reibman 🖇️

@AlexReibman

about 2 years ago

Meta launched Llama 3 to show the world what’s possible with open source LLMs. 500+ AI engineers just spent 24 hours straight putting it to the test. Here’s what we saw at the @AIatMeta x @cerebral_valley #Llama3Hackathon (🧵):

AlexReibman's tweet photo. Meta launched Llama 3 to show the world what’s possible with open source LLMs.

500+ AI engineers just spent 24 hours straight putting it to the test.

Here’s what we saw at the @AIatMeta x @cerebral_valley #Llama3Hackathon (🧵): https://t.co/D5yaLUjUJl

40

4K

436

3K

1M

aminsaadou retweeted

Valeriy M., PhD, MBA, CQF

@predict_addict

almost 3 years ago

Conformal prediction and SHAP are two developments that data scientists in 2023 should know. One might also be interested in Conformal SHAP that was presented in COPA conformal conference couple of years ago. https://t.co/kHDpOmysxb https://t.co/IKFwm74Q5b And in case you are curious about how conformal prediction is changing the landscape there is now Conformal Quantile Regression, Conformal Monte Carlo, Conformal VaR and more. #conformalprediction

predict_addict's tweet photo. Conformal prediction and SHAP are two developments that data scientists in 2023 should know.

One might also be interested in Conformal SHAP that was presented in COPA conformal conference couple of years ago.

https://t.co/kHDpOmysxb

https://t.co/IKFwm74Q5b

And in case you are curious about how conformal prediction is changing the landscape there is now Conformal Quantile Regression, Conformal Monte Carlo, Conformal VaR and more.

#conformalprediction

2

75

12

40

7K

aminsaadou retweeted

Jeremy Howard

@jeremyphoward

almost 3 years ago

I've spent the last few months interviewing >60 experts in law, economics, AI, alignment, etc, on the impacts of AI, and safety interventions. Today I'm publishing my first article, showing regulation designed to increase AI safety may backfire badly! https://t.co/NsZZ5H2BVH

73

2K

528

1K

761K

aminsaadou retweeted

Valeriy M., PhD, MBA, CQF

@predict_addict

about 3 years ago

Personally I quite like scikit-learn but, I am not convinced that the probability calibration section in scikit learn is well-written. TL;DR when it comes to classifier calibration scikit learn page is not a good source of information on the topic as it is neither up to date or factual. The issue with unfortunate naming of predict_proba is well known, Samuele Mazzanti has written a great article about it 'Python’s «predict_proba» Doesn’t Actually Predict Probabilities (and How to Fix It)' https://t.co/bquHDO2GX1 The part that is confusing users of scikit-learn users is that 'predict_proba'does not produce probabilities of the classes (it produces class scores). The estimates of class scores are not class probabilities by definition because they don't meet the basic definition and properties of probabilities as anyone who has studied Probability 1.01 knows. The other confusing part is that the authors mention that models producing poor estimates of the 'class probabilities' (should be 'class scores' instead) are not good, but don't mention explicitly anywhere that most if not all machine learning models produce poor estimates of class probabilities (deep learning is notorious but neither boosted trees nor other types of popular models produce calibrated provabilities either). The authors then suggest isotonic regression as a 'solution' but isotonic regression is an hoc tool that overfits on small datasets and assumes ROC AUC of 1 on the test set (correct ranking) which of course never happens in real life. Scikit learn then cites papers that are either 20 years old (Platt's scaler, isotonic regression), or papers that were laters debunked in their conclusion (e.g. 2005 classical paper by A. Niculescu-Mizil claimed that shallow neural networks were well calibrated by they aren't). TL;DR The scikit-learn page on Probability calibration is not an informative or correct source of information in 2023 and requires major revisions. #scikitlearn # #machinelearning #deeplearning #python #calibration

predict_addict's tweet photo. Personally I quite like scikit-learn but, I am not convinced that the probability calibration section in scikit learn is well-written.

TL;DR when it comes to classifier calibration scikit learn page is not a good source of information on the topic as it is neither up to date or factual.

The issue with unfortunate naming of predict_proba is well known, Samuele Mazzanti has written a great article about it 'Python’s «predict_proba» Doesn’t Actually Predict Probabilities (and How to Fix It)' https://t.co/bquHDO2GX1

The part that is confusing users of scikit-learn users is that 'predict_proba'does not produce probabilities of the classes (it produces class scores). The estimates of class scores are not class probabilities by definition because they don't meet the basic definition and properties of probabilities as anyone who has studied Probability 1.01 knows.

The other confusing part is that the authors mention that models producing poor estimates of the 'class probabilities' (should be 'class scores' instead) are not good, but don't mention explicitly anywhere that most if not all machine learning models produce poor estimates of class probabilities (deep learning is notorious but neither boosted trees nor other types of popular models produce calibrated provabilities either).

The authors then suggest isotonic regression as a 'solution' but isotonic regression is an hoc tool that overfits on small datasets and assumes ROC AUC of 1 on the test set (correct ranking) which of course never happens in real life.

Scikit learn then cites papers that are either 20 years old (Platt's scaler, isotonic regression), or papers that were laters debunked in their conclusion (e.g. 2005 classical paper by A. Niculescu-Mizil claimed that shallow neural networks were well calibrated by they aren't).

TL;DR The scikit-learn page on Probability calibration is not an informative or correct source of information in 2023 and requires major revisions.

#scikitlearn # #machinelearning #deeplearning #python #calibration

5

89

15

49

12K

aminsaadou retweeted

Valeriy M., PhD, MBA, CQF

@predict_addict

over 3 years ago

What is cooking in @nvidia R&D lab that has 300 PhDs? At GTC23 two weeks ago Nvidia chief scientist and SVP of research Bill Dally presented a behind-the-scenes peek at Nvidia’s R&D. One of the key 🔑 areas Better Uncertainty Prediction for Safer Driving using Conformal Prediction! “AI technology, of course, is a basic staple of autonomous driving. It’s what lets autonomous vehicles – using their cameras, radars and Lidar – interpret the environment around them and make predictions. Dally noted traditional machine learning will have a network and run a model against images and produce a single classification say a pedestrian or cyclist or car, and then make determinations about how it’s moving. What we’d really like to do is be able to output a [data] set where we can say it could be a pedestrian with some probability, could be a cyclist with some probability. If we’re judging the distance, rather than saying it’s a pedestrian at 20 meters, we could say it’s a pedestrian or possibly or a cyclist, and it’s 20 meters plus or minus one meter, because we need to make sure that we’re not going to miss this object. If our estimation is off by a meter that could wind up being a lot in difficult circumstances. So rather than getting a single output from our neural network, we’d like to have a set output, a set of possible classes (objects) and a set of possible positions usually centered around a mean estimate position. Given that output set, we can run our planner, so we can guarantee a Safe Action for any element of that set,” said Dally. A technique called conformal prediction is used to accomplish the task (see slide below). Broadly, this approach includes a non-conformance function to inject a range probability or confidence around many possible solutions.” https://t.co/djhHz6Wc2I #NVIDIA #conformalprediction #machinelearning #technology

predict_addict's tweet photo. What is cooking in @nvidia R&D lab that has 300 PhDs?

At GTC23 two weeks ago Nvidia chief scientist and SVP of research Bill Dally presented a behind-the-scenes peek at Nvidia’s R&D.

One of the key 🔑 areas Better Uncertainty Prediction for Safer Driving using Conformal Prediction!

“AI technology, of course, is a basic staple of autonomous driving. It’s what lets autonomous vehicles – using their cameras, radars and Lidar – interpret the environment around them and make predictions. Dally noted traditional machine learning will have a network and run a model against images and produce a single classification say a pedestrian or cyclist or car, and then make determinations about how it’s moving.

What we’d really like to do is be able to output a [data] set where we can say it could be a pedestrian with some probability, could be a cyclist with some probability. If we’re judging the distance, rather than saying it’s a pedestrian at 20 meters, we could say it’s a pedestrian or possibly or a cyclist, and it’s 20 meters plus or minus one meter, because we need to make sure that we’re not going to miss this object. If our estimation is off by a meter that could wind up being a lot in difficult circumstances. So rather than getting a single output from our neural network, we’d like to have a set output, a set of possible classes (objects) and a set of possible positions usually centered around a mean estimate position. Given that output set, we can run our planner, so we can guarantee a Safe Action for any element of that set,” said Dally.

A technique called conformal prediction is used to accomplish the task (see slide below). Broadly, this approach includes a non-conformance function to inject a range probability or confidence around many possible solutions.”

https://t.co/djhHz6Wc2I

#NVIDIA #conformalprediction #machinelearning #technology

3

152

20

67

23K

aminsaadou retweeted

Michael C. Frank

@mcxfrank

over 3 years ago

People are testing large language models (LLMs) on their "cognitive" abilities - theory of mind, causality, syllogistic reasoning, etc. Many (most?) of these evaluations are deeply flawed. To evaluate LLMs effectively, we need some principles from experimental psychology.🧵

mcxfrank's tweet photo. People are testing large language models (LLMs) on their "cognitive" abilities - theory of mind, causality, syllogistic reasoning, etc. Many (most?) of these evaluations are deeply flawed. To evaluate LLMs effectively, we need some principles from experimental psychology.🧵 https://t.co/Xlywu3gTVz

36

2K

466

1K

549K

aminsaadou retweeted

Allen Downey @AllenDowney

over 3 years ago

Exploring the effect of researcher choices on statistical results: 73 teams estimate the same effect size with the same data, and generate 1,253 different results. As always, statistical results depend on modeling decisions. https://t.co/xLMK2qLl15

5

103

29

56

18K

aminsaadou retweeted

François Chollet

@fchollet

over 3 years ago

The gap between generating words and generating meaning will prove hard to close. Generating meaning requires *a model of the things being talked about* (language as communication), while generating words only requires a model of the structure of text (language as statistics)

17

598

98

45

53K

aminsaadou retweeted

Miguel de Carvalho @mbcarvalho

over 5 years ago

Data Science is a misnomer; we should be talking about Data Sciences. Stats, ML, AI, Psychometrics, Econometrics, are examples of Data Sciences. The plural form unites the fields.

mbcarvalho's tweet photo. Data Science is a misnomer; we should be talking about Data Sciences. Stats, ML, AI, Psychometrics, Econometrics, are examples of Data Sciences. The plural form unites the fields. https://t.co/y3mZMMtwW7

0

31

11

1

0

aminsaadou retweeted

Stacey Hendriks, PhD @SHendriks_PhD

almost 4 years ago

Top 5 tools that I used during my #PhD and continue to use throughout my early #research career. A thread 🧵: @OpenAcademics @PhDVoice @PostdocVoice #AcademicTwitter #AcademicChatter If you have tools you find helpful I'm always looking for new ways to improve my workflow 👌

24

631

155

619

0

aminsaadou retweeted

Posit PBC @posit_pbc

almost 4 years ago

📢 Announcing keynote and talk recordings from rstudio::conf(2022)! ⬡ Keynotes: https://t.co/4cvPNAokhn ⬡ Talks: https://t.co/HTLwKREkEF Keep an eye out for a blog post that summarizes conf materials 👀

1

271

69

80

0

aminsaadou retweeted

Cédric Scherer @CedScherer

almost 4 years ago

"Graphic Design with #ggplot2" 👨‍💼👩‍💻🧑‍💻 Do you want to recap the 2-day workshop at #rstudioconf? Or do you feel sad you've missed it? 🔥 All course material incl. latest updates can be found on the workshop webpage—9 sessions, 760 slides, 314 ggplots! 👉 https://t.co/TyBRWQOW7R

CedScherer's tweet photo. "Graphic Design with #ggplot2" 👨‍💼👩‍💻🧑‍💻

Do you want to recap the 2-day workshop at #rstudioconf? Or do you feel sad you've missed it?

🔥 All course material incl. latest updates can be found on the workshop webpage—9 sessions, 760 slides, 314 ggplots!

👉 https://t.co/TyBRWQOW7R https://t.co/gx9iXmkFSw

23

2K

466

1K

0

aminsaadou retweeted

Mine Çetinkaya-Rundel @minebocek

almost 4 years ago

Abstinence-only education doesn't work, not for anything and not for spreadsheets. Here are some tips for better spreadsheets from myself and a few others, instead of the not-so-helpful advice of "avoid spreadsheets".

3

84

16

17

0

aminsaadou retweeted

Davis Blalock

@davisblalock

almost 4 years ago

"Language Models Can Teach Themselves to Program Better" This paper changed my thinking about what future langauge models will be good at, mostly in a really concerning way. Let's start with some context: [1/11]