Two economists just published a mathematical proof that AI will destroy the economy.
Not might. Not could. Will — if nothing changes.
The paper is called "The AI Layoff Trap." Published March 2, 2026. Wharton School, University of Pennsylvania. Boston University. Peer reviewed. Mathematically modeled.
The conclusion is one sentence.
"At the limit, firms automate their way to boundless productivity and zero demand."
An economy that produces everything. And sells it to nobody.
Here is how you get there.
A company fires 500 workers and replaces them with AI. A competitor fires 700 to keep up. Another fires 1,000. Every company is behaving rationally. Every company is following the incentives correctly. And every company is building a trap for itself.
Because the workers who were fired were also customers.
When they lose their jobs faster than the economy can absorb them, they stop spending. Consumer demand falls. Companies respond by cutting costs — which means automating more workers — which means less spending — which means more falling demand — which means more automation.
The loop has no natural exit.
The researchers tested every proposed solution. Universal basic income. Capital income taxes. Worker equity participation. Upskilling programs. Corporate coordination agreements.
Every single one failed in the model.
The only intervention that worked: a Pigouvian automation tax — a per-task levy charged every time a company replaces a human with AI, forcing them to price in the demand they are destroying before they pull the trigger.
No government has implemented this. No major economy is seriously discussing it.
Meanwhile the numbers are already tracking the curve. 100,000 tech workers laid off in 2025. 92,000 more in the first months of 2026. Jack Dorsey fired half of Block's workforce and said publicly: "Within the next year, the majority of companies will reach the same conclusion."
Nobody is doing anything wrong. Companies are following their incentives perfectly. That is exactly the problem.
Rational behavior. At scale. Simultaneously. With no mechanism to stop it.
Two economists built the math. The math leads to one place.
Source: Falk & Tsoukalas · Wharton School + Boston University ·
Become a World Bank Group Pioneer.
𝐀𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧𝐬 𝐟𝐨𝐫 𝐨𝐮𝐫 𝐈𝐧𝐭𝐞𝐫𝐧𝐬𝐡𝐢𝐩 𝐏𝐫𝐨𝐠𝐫𝐚𝐦 𝐨𝐩𝐞𝐧 𝐉𝐚𝐧 𝟏𝟗 – 𝐅𝐞𝐛 𝟏𝟕, 𝟐𝟎𝟐𝟔.
Join a live info session to learn how to apply and ask questions:
• Jan 21, 11 AM ET (HQ+LAC) – https://t.co/s51ZQsqdqS
• Jan 27, 8 AM ET (EU+ME) – https://t.co/DXvcXcc0g7
• Feb 4, 9 PM ET (Asia) – https://t.co/kgqzfotWc9
Start your journey: https://t.co/fbREbid8tr
How Much Should We Trust Synthetic Control Estimates?
Economists are often concerned with causality. We would like to know the effect of X on Y, and we would like to know what causes what. The ideal case for discovering causality is one where the receipt of treatment X is totally uncorrelated with their attributes. If it is correlated, then we would be unable to say whether it was really the treatment X which caused the change in Y, or the attributes which they had. We could try and control for attributes – which is to say, we compare only the effect within certain types – but this means we are unable to rule out unobservable attributes affecting the results.
In a perfect world, we randomize. Economists like randomization because, conditional upon being in the sample which is randomized, attributes both observed and unobserved are uncorrelated with treatment. If you want to know the effect of a medication, then you just give medicine to some and a placebo to others. Nice, clean, easy. Alternatively, we can have some variable Z which is correlated with X and uncorrelated with Y conditional on X – an instrumental variable.
We often do not have randomization available, however. Suppose that we are studying the effect of an increase in the minimum wage on unemployment in the hypothetical state of “New Jersey”. There are two basic ways to study this. The first is to construct a structural model of workers, specify their utility functions, the conduct with which firms compete, and so on, and then simulate various counterfactuals. This just kicks identification down the road, though – you now need exogenous shocks to identify all these primitives.
The other is to just look at what happened. Card and Krueger (1994) studied this exact scenario in New Jersey. Their argument for identification is that the areas of Philadelphia in Pennsylvania are affected by the same trends as the areas in New Jersey, with the sole exception that one saw a change in the minimum wage. If you are willing to buy that assumption, then they have found the true effect of minimum wage laws.
However, it is often not clear that the proper control is. Suppose that Pennsylvania had different trends or levels in unemployment before. Can we compare it anyway? Can we compare it to any other state in the union? Differences-in-differences is also generally more believable when there is a clean and plausibly exogenous break between treated and untreated units. The regions that Card and Krueger studied bordered each other, and I’m inclined to believe a study which examines places on either side of border like Kansas-Nebraska, but what should we do if the affected regions are far away from any border? If you are considering the effect of a minimum wage hike in California, with the vast majority of its workers far away from any state boundary, what can you use as the control?
What’s more, it’s not even clear that bordering is a good thing. In order for the local average treatment effect to equal the “true” effect, we have to assume no spillovers between units. One can imagine a world in which employment plummets in New Jersey, but it’s only because people go work across the border in Pennsylvania. A minimum wage hike in both places would not have the same effect.
To deal with this, Alberto Abadie and Javier Gardeazabal (2003) devised a method called synthetic control. They are interested in what the effect of the ETA’s campaign of terrorism in the Basque country was on economic development. Rather than take any one particular place to be the control unit, they instead took a weighted average of Spanish provinces that most closely matched the behavior of the Basque province before. Specifically, they said the Basque country, out of 16 provinces, was closest to 85% Catalonia and 15% Madrid. Computing the weights is actually a computationally difficult problem, which entails a nested fixed-point procedure that you may have seen before if you are familiar with the industrial organization literature. This “synthetic” control produces a more realistic control than any particular region. The key reference is Alberto Abadie writing in the Journal of Economic Literature.
That is what I knew coming into writing this. I have always been a bit intensely suspicious of synthetic control, because of the degrees of freedom it affords the researcher. If at first you don’t have a result, you can try, try again. The point of writing this, then, was to document my process of learning about it.
I shall number my concerns so that we can refer back to how an advocate for synthetic control would address them, starting with the ones I was concerned about going in.
You can control which units go into your donor pool of possible control units, and how you divide up those units. In the case of the effect of terrorism in the Basque country, you can choose whether you use provinces or local municipalities, or whether foreign provinces should be included.
If there are many different combinations of units which arrive similarly matched pre-trends, you have the choice of which combination to use. Ideally, you would arrive at a single combination of best fit, but this actually needn’t happen. With enough potential control units, it is possible for there to be an infinite number of combinations of control units which produce the pre-event pattern, all of which will have different after event trends.
You have discretion over what pre-trends you care about. Suppose you’re examining the economic development of a country. Do you weight on years of education? Test scores? GDP? The commodities they export? Or any of a dozen possible other factors, some of which may be causally related to each other. The more factors you include, the more units in the donor pool you need.
If the control units are affected by transitory shocks, then matching on pre-trends doesn’t do anything at all. You observe Z, which are the realized values, but don’t observe mu, which are the shocks. So, if you’re matching on GDP, but don’t include type of industry, you will get entirely spurious differences. Basically there’s no reason for pretrends to continue unless they were caused by the same factors. Note that including a longer panel need not improve this at all. Differences-in-differences faces the same problem, but the point of comparing areas right next to each other is to account for all of the shared shocks.
Related to objection 4, and to be honest just a clarification of what it means, there’s no guarantee that the closest fit is actually the best comparison. Imagine a room with a dimmer that is slowly being slid from 0 to 1 brightness, and on average the room is ½ bright. With a large enough pool of rooms which have their dimmers being randomly flipped up and down between 0 and 1, we should be able to construct a sample which exactly mirrors the tested room. Let’s suppose the treatment happens and there’s no effect, but the room continues increasing in brightness from 1 to 2. Meanwhile the randomly flipping rooms stay between 0 and 1. We have gotten a totally spurious result despite a perfect fit of pretrends.
Objection 2 is, I think, the easiest one to reject. If researchers do not meddle with the algorithm to find the best fit, and there is a unique solution, then there is no room for meddling besides the usual publication bias reasons. I sincerely doubt that many researchers are tweaking how the algorithm computes in order to get significant results; and if they are, they really should be doing something better than reg x, y (complicated).
There are also some basic robustness checks which you see in every paper using it. First, you change the year in which the shock took place. Your results should now show spurious results for any year you test. Second, you can “leave one out” of your donor pool – essentially testing for outlier having outsized effects. You need to pass these hurdles to get published.
There are still problems with researcher degrees of freedom, which have been tested with Monte Carlo simulations. Ferman, Pinto, and Possebom (2020) tests how much cherrypicking is possible in the choice of variables, which is objection 3. First, the good news. The more periods to fit onto, the less scope there is for cherry-picking. This also answers objection 4 – as periods go to infinity, you can abstract away from transitory shocks. Now the bad news: there is still considerable scope for cherry-picking in the choice of predictor variables. They run simulations on data in the Current Population Survey, rather in the spirit of Bertrand-Duflo-Mullainathan (2004). If no cherry-picking, they should reject the null 5% of the time – they actually reject it 14%. Worse still, there will be serial correlation in your data over time, which has to be addressed ad hoc. (They set the parameter equal to .5). The more correlated with itself it is, the less able you are to get to minimize bias. And above all, the bias from objection 4 is what it converges too asymptotically as the number of periods goes to infinity, but in almost all use cases you are quite far from that happening.
Pickett, Hill, and Cowan (2022) is in a similar vein, and aims to dispel certain myths about synthetic control. The first problem is that the optimization procedure is actually different on Stata vs R, which leads to finding different results. (!!) The covariates of objection 4 are also empirically relevant, given the number of periods we have to work with. And objection five (their myth no. 3) really does matter. If you include too much, you end up fitting on noise and getting spurious differences. I’m not entirely sure I understand the solutions to overfitting on noise. My understanding is that they are essentially taking an average of the data they have. I don’t know enough to comment.
But enough of work with simulations. How about the robustness of published studies? Unfortunately, there is not nearly as much as I would have hoped for, but what results there are are hopeful. The best one I found was McClelland and Gault (2017), which throws a bunch of stuff at Abadie, Diamond, and Hainmueller (2010) and finds it comes out alright. Otherwise, though, I was unable to find something systematically assessing the trustworthiness
To sum up: I think objection 1 is a serious one, but afflicts differences-in-differences too. Your choice of donors should be intuitively reasonable, and ideally you should show that your results are robust to taking in a different bunch of donors. Objection 2 is not, in practice, a serious one. Objection 3 and 4 are two sides of the same coin, and are a fundamental weakness. To the extent that you keep things transparent by only including one outcome variable, the more vulnerable you are to transient shocks; but the more you try to account for transient shocks, the more you can get pointless curvefitting. It seems solvable only by good judgement, just the same as differences-in-differences.
I think synthetic controls do have extraordinary potential. I entirely agree with people like Susan Athey and Guido Imbens when they describe it as one of the most important innovations in econometrics in this century. However, it does not replace the hard work of arguing for exogeneity that you find in differences-in-differences papers. Whatever your weights are, they should be intuitive and sensible in the same way that differences-in-differences should be.
Further, taking the results requires an element of trust, and quite a lot of work in trying out all the different possible specifications. For that reason, I think we should not use this for trivial questions. It would be much better for society to have a few, very good and incredibly rigorous synthetic control papers on the very most pressing questions, than a firehose of synthetic control papers on every event, no matter how trivial. Results should jump off the page, and you turn to synthetic control to figure out by how much, rather than to see if there is any result at all.
9 Podcasts PhDs, Postdocs and Researchers should follow
Drowning in research? Need help with your career?
Here's a lifeline for PhDs, researchers, and academics.
Transform your workout, coffee break, or commute into powerful inspiration
Beach day at Grafton Lake by BGSA. Another fun treat by your favorite club🤩 What other way to wrap up the summer semester other than a barbecue x beach day filled with all sorts of fun activities?
Don’t forget to bring sunscreen!
The RPI-BGSA is inviting you to our Graduate School Application Workshop, a free, virtual session designed to equip you with the tools, clarity, and confidence to take your next academic step. Take the first step toward your academic future with the support of BGSA at RPI.
🌎 PhD students: apply to 2025 Berkeley/Sloan Summer School in Environmental/Energy Economics! 🔗
Info + application link below 👇
🗓️Deadline: May 14 | Program: Aug 18–22 at Berkeley
w @auffhammer, @BorensteinS, TCarleton, MFowlie, KJack, ...
Are you PhD student interested in AI/ML & economics?
Apply to the Machine Learning in Economics Summer Institute at Chicago Booth, Aug 7-13. Two-part program includes lectures from experts + 2-day conference on cutting-edge research.
Deadline March 28, plz RT. Link in reply.
2025 BUDGET : Allocations to Various Sectors or Programmes
1. GH¢13.85 billion: Allocation for the Big Push Programme.
2. GH¢499.8 million: Allocation for the No-Academic-Fee policy for first-year students in public tertiary institutions.
3. *GH¢292.4 million*: Allocation for the distribution of free sanitary pads to female students in primary and secondary schools.
4. GH¢242.5 million: Allocation to support victims of the Akosombo dam spillage.
5. GH¢200 million: Allocation to support victims of the tidal wave disaster in the Ketu South constituency.
6. GH¢3.5 billion: Allocation for the free secondary education program.
7. GH¢564.6 million: Allocation for comprehensive provision of free curricula-based textbooks.
8. GH¢1.788 billion: Allocation for the School Feeding Programme.
9. GH¢145.5 million: Allocation for the Capitation Grant.
10. GH¢203 million: Allocation for the payment of teacher trainee allowances.
11. GH¢480 million: Allocation for the payment of nursing trainee allowances.
12. GH¢9.93 billion: Allocation for the National Health Insurance Scheme (NHIS).
13. GH¢2.81 billion: Allocation for the Ghana Road Fund.
14. GH¢7.51 billion: Allocation for the District Assembly Common Fund (DACF).
15. GH¢1.5 billion: Allocation for Agriculture for Economic Transformation Agenda (AETA).
16. GH¢51.3 million: Allocation as seed fund for the establishment of the Women’s Development Bank.
17. GH¢300 million: Allocation for the National Apprenticeship Programme.
18. GH¢100 million: Allocation for the ‘Adwumawura’ Programme.
19. GH¢100 million: Allocation for the National Coders Programme
The Office of the Chief Economist at Microsoft is seeking a PhD intern with a background or interest in econometrics of experimentation, price elasticity estimation, and recalibration of ML models using estimates from experiments.
The five best popular books you can read to understand economics:
1. The Undercover Economist
2. Mastering Metrics
3. Good Economics for Hard Times
4. The Myth of the Rational Market
5. The Return of Depression Economics and the Crisis of 2008
@BlessingAddae5 is a walking library of Ghanaian proverbs! 📖😂 He can recite almost 50 at a go. If wisdom had a PhD, he’d already have it!
Oh wait… he’s working on one in Applied Economics and Policy at RPI.
Need financial advice and a proverb to go with it? He’s your guy! 🔥
The world has probably passed “peak air pollution”—
Global emissions of local air pollutants have probably passed their peak.
The chart shows estimates of global emissions of pollutants such as sulphur dioxide (which causes acid rain), nitrogen oxides, and black and organic carbon.
These pollutants are harmful to human health and can also damage ecosystems.
It looks like emissions have peaked for almost all of these pollutants. Global air pollution is now falling, and we can save many lives by accelerating this decline.
The exception is ammonia, which is mainly produced by agriculture. Its emissions are still rising.
These estimates come from the Community Emissions Data System (CEDS).
(This Daily Data Insight was written by @_HannahRitchie.)