Erwan Marechal

over 2 years ago

Open source AI models are on a path to overtake proprietary models.

275

267

276K

erwan_marechal retweeted

Former CTO IBM France / Administrateur indépendant certifié IFA

almost 3 years ago

Also: a model with more parameters is not necessarily better. It's generally more expensive to run and requires more RAM than a single GPU card can have. GPT-4 is rumored to be a "mixture of experts", i.e. a neural net consisting of multiple specialized modules, only one of which is run on any particular prompt. So the effective number of parameters used at any one time is smaller than the total number.

784

172K

Who to follow

teyssedre

@MHTEYSSEDRE

erwan_marechal retweeted

Jay Gambetta @jaygambetta

almost 3 years ago

Dear journalists, it makes absolutely no sense to write: "PaLM 2 is trained on about 340 billion parameters. By comparison, GPT-4 is rumored to be trained on a massive dataset of 1.8 trillion parameters." It would make more sense to write: "PaLM 2 possesses about 340 billion parameters and is trained on a dataset of 2 billion tokens (or words). By comparison, GPT-4 is rumored to possess a massive 1.8 trillion parameters trained on untold trillions of tokens." Parameters are coefficients inside the model that are adjusted by the training procedure. The dataset is what you train the model on. Language models are trained with tokens that are subword units (e.g. prefix, root, suffix). Saying "trained a dataset of X billion parameters" reveals that you have absolutely no understanding of what you're talking about.

132

511

757

erwan_marechal retweeted

almost 3 years ago

Happy to see an IBM Quantum System One operational in Quebec.

133

12K

erwan_marechal retweeted

Paul Krugman

@paulkrugman

almost 3 years ago

An inflation update: in the past I've focused on a measure that excludes lagging shelter and used cars as well as food and energy. Just to note that it adds to the evidence that inflation has been largely defeated

paulkrugman's tweet photo. An inflation update: in the past I've focused on a measure that excludes lagging shelter and used cars as well as food and energy. Just to note that it adds to the evidence that inflation has been largely defeated https://t.co/nDw76i1jD5

462

331

almost 3 years ago

Voici une annonce qui m'intéresse https://t.co/MwUxDCtCV3 Je suis vraiment curieux et enthousiaste de pouvoir rendre consommable à mes clients LLAMA-2. Je suis admiratif du travail de Meta sur l'IA et plus spécifiquement sur LLAMA-2

erwan_marechal retweeted

almost 3 years ago

This is huge: Llama-v2 is open source, with a license that authorizes commercial use! This is going to change the landscape of the LLM market. Llama-v2 is available on Microsoft Azure and will be available on AWS, Hugging Face and other providers Pretrained and fine-tuned models are available with 7B, 13B and 70B parameters. Llama-2 website: https://t.co/PKrrXgHdem Llama-2 paper: https://t.co/aINNrXNhMb A number of personalities from industry and academia have endorsed our open source approach: https://t.co/N7HwgW9Suh

384

15K

Jay Gambetta @jaygambetta

almost 3 years ago

Voici un article intéressant sur les banques américaines. Il clarifie les spécifictés lié aux nombres de banques régionales (plus de 4K). Ces spécificités vont probablement impacter la finance mondiale, avec un flux de nouvelles p…https://t.co/EM5sq7u8Lt https://t.co/SbNhi9GAgw

erwan_marechal retweeted

almost 3 years ago

More exciting progress on our path on our path to quantum utility https://t.co/gwbXzNMm86

35K

about 3 years ago

Le risque d'une IA dans un domaine réglementé, comme la banque est bien expliqué ici selon moi . Si l'explicabilité est nécessaire pour l'obtention d'un prêt, je suis persuadé que ce type de décision ne pourra s'obtenir que par des…https://t.co/CFNwCO0cYZ https://t.co/ALSU0ZCAHB

erwan_marechal retweeted

Sanjeev Sharma

@sd_architect

about 3 years ago

Vaccines are not Rocket Science. They are a proven biological method to improve immunity against bacterial and viral infections that otherwise killed millions. Why are people suddenly against them?

167

about 3 years ago

This will change the practice of testing for the mainframe ; developers will be able to test their changes on an ephemerous workspace, set-up for the test and destroyed afterward. https://t.co/aORQcmQylm

about 3 years ago

Interesting to see combination of AI techniques to solve problems https://t.co/m8ZCZdXAXV

erwan_marechal retweeted

about 3 years ago

Super-human AI is nowhere near the top of the list of existential risks. In large part because it doesn't exist yet. Until we have a basic design for even dog-level AI (let alone human level), discussing how to make it safe is premature.

258

294

136

533K

about 3 years ago

Je suis persuadé que les ingénieurs utilisant l'IA remplaceront ceux qui ne le feront pas. Je pense que le besoin et le nombre d'ingénieurs augmentera...

about 3 years ago

This must be said and repeated. Yes, Geoff was totally wrong to predict a drop in radiologist positions. We knew that it was wrong when he said it. We have data now.

101

997

148

219

erwan_marechal retweeted

about 3 years ago

LIMA : LLaMA 65B + 1000 supervised samples = {GPT4, Bard} level performance. From @MetaAI https://t.co/FIuIo6agXa

429

917

629K

erwan_marechal retweeted

Sumit Gupta @SumitGup

about 3 years ago

This is exactly how I view the Generative AI stack starting with infrastructure, cloud, models, tools, applications, and services. https://t.co/aD0zrFK0pM

SumitGup's tweet photo. This is exactly how I view the Generative AI stack starting with infrastructure, cloud, models, tools, applications, and services. https://t.co/aD0zrFK0pM https://t.co/2qqtFZ1rJQ

721