Adrien Pacifico

@psyfico

#Python #Economics #DataScience #OpenData #DoStuffWithData

Marseille, France

Joined January 2016

999 Following

233 Followers

1.6K Posts

psyfico retweeted

Chroma

@trychroma

2 months ago

Introducing Chroma Context-1, a 20B parameter search agent. > pushes the pareto frontier of agentic search > order of magnitude faster > order of magnitude cheaper > Apache 2.0, open-source

141

403

psyfico retweeted

Maitre Eolas🇫🇷 @Maitre_Eolas

4 months ago

L’assemblée nationale vient de décider que les policiers pouvaient tuer les citoyens sans avoir à expliquer pourquoi.

459

10K

478

426K

psyfico retweeted

Andrew Ng

@AndrewYNg

9 months ago

Parallel agents are emerging as an important new direction for scaling up AI. AI capabilities have scaled with more training data, training-time compute, and test-time compute. Having multiple agents run in parallel is growing as a technique to further scale and improve performance. We know from work at Baidu by my former team, and later OpenAI, that AI models’ performance scales predictably with the amount of data and training computation. Performance rises further with test-time compute such as in agentic workflows and in reasoning models that think, reflect, and iterate on an answer. But these methods take longer to produce output. Agents working in parallel offer another path to improve results, without making users wait. Reasoning models generate tokens sequentially and can take a long time to run. Similarly, most agentic workflows are initially implemented in a sequential way. But as LLM prices per token continue to fall — thus making these techniques practical — and product teams want to deliver results to users faster, more and more agentic workflows are being parallelized. Some examples: - Many research agents now fetch multiple web pages and examine their texts in parallel to try to synthesize deeply thoughtful research reports more quickly. - Some agentic coding frameworks allow users to orchestrate many agents working simultaneously on different parts of a code base. Our short course on Claude Code shows how to do this using git worktrees. - A rapidly growing design pattern for agentic workflows is to have a compute-heavy agent work for minutes or longer to accomplish a task, while another agent monitors the first and gives brief updates to the user to keep them informed. From here, it’s a short hop to parallel agents that work in the background while the UI agent keeps users informed and perhaps also routes asynchronous user feedback to the other agents. It is difficult for a human manager to take a complex task (like building a complex software application) and break it down into smaller tasks for human engineers to work on in parallel; scaling to huge numbers of engineers is especially challenging. Similarly, it is also challenging to decompose tasks for parallel agents to carry out. But the falling cost of LLM inference makes it worthwhile to use a lot more tokens, and using them in parallel allows this to be done without significantly increasing the user’s waiting time. I am also encouraged by the growing body of research on parallel agents. For example, I enjoyed reading “CodeMonkeys: Scaling Test-Time Compute for Software Engineering” by Ryan Ehrlich and others, which shows how parallel code generation helps you to explore the solution space. The mixture-of-agents architecture by Junlin Wang is a surprisingly simple way to organize parallel agents: Have multiple LLMs come up with different answers, then have an aggregator LLM combine them into the final output. There remains a lot of research as well as engineering to explore how best to leverage parallel agents, and I believe the number of agents that can work productively in parallel — like the humans who can work productively in parallel — will be very high. [Original text, with links: https://t.co/ElcJZyzcfw ]

119

280

324K

psyfico retweeted

GitHub Projects Community

@GithubProjects

11 months ago

Run a full virtual desktop inside a Docker container, accessible via WebRTC, right from your browser.

485

415K

Who to follow

Tanguy Le Fur

@TanguyLeFur

associate prof. (mcf) @univ_lille | growth theory, economic history & history of economic thought | pétanque, craft beer & jazz guitar

Sarah Vincent

@SarahVincent_

Post-doctoral Fellow at @ColumbiaPopFam • Development, Gender, Health Economics and Economic History • PhD @amseaixmars • Views are my own

Estefanía Galván

@estefaniagalv4n

Assistant professor at @deFCEA, @Udelaruy. PhD in Economics @AMSEaixmars @univamu, France. Fields: Labor Economics, Gender, Inequality, Development. Feminist

psyfico retweeted

Tivadar Danka

@TivadarDanka

11 months ago

A question we never ask: "How large is that number in the Law of Large Numbers?" Sometimes, a thousand samples are large enough. Sometimes, even ten million samples fall short. How do we know? I'll explain.

TivadarDanka's tweet photo. A question we never ask:

"How large is that number in the Law of Large Numbers?"

Sometimes, a thousand samples are large enough. Sometimes, even ten million samples fall short.

How do we know? I'll explain. https://t.co/km0lBa4lUY

395

357

36K

psyfico retweeted

stefano palombarini @StefPalomba

about 1 year ago

Je ne ne sais pas si nous avons été écoutés. On verra. Mais nous avons exprimé publiquement nos désaccords, et strictement personne ne nous l’a fait payer, même dans une toute petite mesure. [5/x]

343

14K

psyfico retweeted

Senior PowerPoint Engineer

@ryxcommar

over 1 year ago

setting up a Python environment

886

29K

psyfico retweeted

Nicolas Hervieu @N_Hervieu

over 1 year ago

Lecture d'utilité publique. Voilà pourquoi des étrangers en situation régulière (qui travaillent & vivent sereinement dans notre pays) finissent souvent dans l'irrégularité. Au mépris de leurs droits & au détriment de l'intérêt de tous (sauf des responsables de ce chaos...)

382

221

125

33K

psyfico retweeted

Sumanth

@Sumanth_077

over 1 year ago

This repository is absolute gold for all Data Science and Machine Learning practitioners! Best ideas and solutions shared by top performers in the Kaggle competitions: https://t.co/YXEVDNnmJ3

Sumanth_077's tweet photo. This repository is absolute gold for all Data Science and Machine Learning practitioners!

Best ideas and solutions shared by top performers in the Kaggle competitions:

https://t.co/YXEVDNnmJ3 https://t.co/tl66qdjEeB

346

393

20K

psyfico retweeted

Santiago

@svpino

over 1 year ago

Another step closer to having AI write code better than humans! The new release of AlphaCodium, an open-source state-of-the-art code generation tool, outperforms directly prompting OpenAI when generating code. This is a huge deal. The research team @QodoAI tested this on the Codeforces Code Contest benchmark, and the leap is huge: Using o1-preview • Direct prompting: 55% • AlphaCodium: 78% Using o1-mini • Direct prompting: 53% • AlphaCodium: 74% These results make AlphaCodium the best approach to generate code we've seen so far. I'm linking to a blog post with more information, the paper, and the GitHub repository below, but here is a 30-second summary of how AlphaCodium works: AlphaCodium relies on an iterative process that repeatedly runs and fixes the generated code using the testing data. 1. The first step is to have the model reason about the problem. They describe it using bullet points and focus on the goal, inputs, outputs, rules, constraints, and any other relevant details. 2. Then, they make the model reason about the public tests and come up with an explanation of why the input leads to that particular output. 3. The model generates two to three potential solutions in text and ranks them in terms of correctness, simplicity, and robustness. 4. Then, it generates more diverse tests for the problem, covering cases not part of the original public tests. 5. Iteratively, pick a solution, generate the code, and run it on a few test cases. If the tests fail, improve the code and repeat the process until the code passes every test. There's a lot more information in the paper and the blog post. Here are the links: • Blog: https://t.co/6VZYWMiBAj • Paper: https://t.co/OFzeRGJwl7 • Code: https://t.co/rcGwx22ybk I attached an image comparing AlphaCodium with direct prompting using different models.

svpino's tweet photo. Another step closer to having AI write code better than humans!

The new release of AlphaCodium, an open-source state-of-the-art code generation tool, outperforms directly prompting OpenAI when generating code.

This is a huge deal. The research team @QodoAI tested this on the Codeforces Code Contest benchmark, and the leap is huge:

Using o1-preview

• Direct prompting: 55%
• AlphaCodium: 78%

Using o1-mini

• Direct prompting: 53%
• AlphaCodium: 74%

These results make AlphaCodium the best approach to generate code we've seen so far.

I'm linking to a blog post with more information, the paper, and the GitHub repository below, but here is a 30-second summary of how AlphaCodium works:

AlphaCodium relies on an iterative process that repeatedly runs and fixes the generated code using the testing data.

1. The first step is to have the model reason about the problem. They describe it using bullet points and focus on the goal, inputs, outputs, rules, constraints, and any other relevant details.

2. Then, they make the model reason about the public tests and come up with an explanation of why the input leads to that particular output.

3. The model generates two to three potential solutions in text and ranks them in terms of correctness, simplicity, and robustness.

4. Then, it generates more diverse tests for the problem, covering cases not part of the original public tests.

5. Iteratively, pick a solution, generate the code, and run it on a few test cases. If the tests fail, improve the code and repeat the process until the code passes every test.

There's a lot more information in the paper and the blog post. Here are the links:

• Blog: https://t.co/6VZYWMiBAj
• Paper: https://t.co/OFzeRGJwl7
• Code: https://t.co/rcGwx22ybk

I attached an image comparing AlphaCodium with direct prompting using different models.

580

796

73K

Adrien Pacifico @psyfico

almost 2 years ago

@AA_Avocats On suppose qu'il en sera de même pour les mesures sur le RIO pour le 11 octobre prochain ?

104

psyfico retweeted

j'dis ça j'dis rien ⏚ @jdicajdisrien

almost 2 years ago

Ma voisine a travaillé pendant 20 ans en tant que secrétaire dans un garage automobile. En juin, elle est allé au rectorat et après 10 mn d'entretien elle est devenu professeure des écoles. Dans une semaine, elle aura une classe dans une ecole avec un poste réservé aux contractuels (c'est à dire des écoles pas trop difficiles, qui sont du coup retirées et bloquées pour les profs titulaires). Elle aura 4 jours de formation cette semaine. Le rectorat leur demande de ne pas spécifier aux parents qu'ils ne sont pas titulaires. 1 prof sur 10 est un prof contractuel. Y'en a sûrement de très bon. Ma voisine est sympa. Mais qui peut croire qu'un recrutement massif de la sorte, et une formation inexistante ne puisse pas avoir un effet délétère sur la scolarité de vos enfants ?

463

333

789K

psyfico retweeted

Philipp Heimberger @heimbergecon

almost 2 years ago

This is a very useful reading list of recent advances in econometrics.

284

152K

psyfico retweeted

Gael Varoquaux 🦋 @GaelVaroquaux

almost 2 years ago

⚡️ CARTE: toward table foundation models⚡️ https://t.co/8bfcc5cNOc Why foundation models for tables are hard, and why we have made significant headway with “CARTE” Published at #ICML2024 🧵 1/7

116

16K

psyfico retweeted

œconomicus @adewed00

almost 2 years ago

Non.

adewed00's tweet photo. Non. https://t.co/lzQHvZlptg

499

534

172K

psyfico retweeted

polars data

@DataPolars

almost 2 years ago

We are happy to announce Python Polars 1.0! https://t.co/cV8EFLRxyM

625

142

59K

psyfico retweeted

François Malaussena @malopedia

almost 2 years ago

J'arrive pas à dormir. Alors je vais écrire. Ce que je pense que Macron tente, et comment on peut s'en sortir.

279

25K

11K

psyfico retweeted

Charlie Marsh

@charliermarsh

about 2 years ago

Home Assistant (68k stars) migrated to uv. They now save over five hours of execution time on each build...

250

215K

psyfico retweeted

Matt Harrison

@__mharrison__

about 2 years ago

I enjoyed the talk "Accelerating Pandas with Zero Code Change using RAPIDS cuDF" at #GTC2024. One of Pandas' major drawbacks is its lack of a "query engine," which leads to eager execution of all operations. More modern tools like Polars and DuckDB are designed around a query engine, resulting in significantly faster performance for tasks such as grouping. By simply using cuDF, you can transform slow Pandas code into fast code, often achieving a 2-10x improvement over Polars and DuckDB. People often ask me which tool they should use, and the answer is usually more complex than a single sentence. If you're looking to boost the speed of your Pandas code today, cuDF is the simplest way to achieve significant performance gains without having to rewrite ANY of your code.

__mharrison__'s tweet photo. I enjoyed the talk "Accelerating Pandas with Zero Code Change using RAPIDS cuDF" at #GTC2024.

One of Pandas' major drawbacks is its lack of a "query engine," which leads to eager execution of all operations. More modern tools like Polars and DuckDB are designed around a query engine, resulting in significantly faster performance for tasks such as grouping.

By simply using cuDF, you can transform slow Pandas code into fast code, often achieving a 2-10x improvement over Polars and DuckDB.

People often ask me which tool they should use, and the answer is usually more complex than a single sentence.

If you're looking to boost the speed of your Pandas code today, cuDF is the simplest way to achieve significant performance gains without having to rewrite ANY of your code.

10K

Adrien Pacifico

@psyfico

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users