Philip E. Tetlock

new book *Talent: How to Identify Energizers, Winners, and Creatives Around the World*, https://t.co/7bU5cTWLzc, Conversations with Tyler, The Free Press.

22 days ago

Is it possible to spot a good forecast by its rationale? We used LLMs to score the reasoning behind 55,000+ forecasts and test the link between forecast accuracy and written rationales. We found that: • Causal reasoning is much more prevalent than statistical argumentation • It's easier to identify poor forecasters rather than excellent ones • Human ratings of rationale quality can be unreliable. 🧵A thread on the results:

Research_FRI's tweet photo. Is it possible to spot a good forecast by its rationale?

We used LLMs to score the reasoning behind 55,000+ forecasts and test the link between forecast accuracy and written rationales.

We found that:

• Causal reasoning is much more prevalent than statistical argumentation
• It's easier to identify poor forecasters rather than excellent ones
• Human ratings of rationale quality can be unreliable.

🧵A thread on the results:

2

45

12

24

5K

Who to follow

tylercowen

@tylercowen

Robin Hanson

@robinhanson

Let’s skip witty banter & talk deep Qs. Books: https://t.co/hpZgEm5DBI https://t.co/iFs9C3J2Ek Advisor @_futarchy @MetaDAOProject @butterygg

Steven Pinker

@sapinker

Cognitive scientist at Harvard.

PTetlock retweeted

about 1 month ago

On a subset of ForecastBench questions, an LLM has matched superforecaster performance for the first time. A submission from @GoogleDeepMind, named “green tree,” is now #1 on dataset questions on ForecastBench, our AI forecasting benchmark. Superforecasters remain #1 overall.

Research_FRI's tweet photo. On a subset of ForecastBench questions, an LLM has matched superforecaster performance for the first time.

A submission from @GoogleDeepMind, named “green tree,” is now #1 on dataset questions on ForecastBench, our AI forecasting benchmark.

Superforecasters remain #1 overall. https://t.co/i5R0pFOeoK

5

84

15

27

200K

3 months ago

Examining the quality of the methodology & participants--& depth of the questions-- this study is the most ambitious subjective-probability forecasting exercise I've ever seen. Agree or disagree with the conclusions, this is a state-of-the-art benchmark to beat

3 months ago

We completed the most comprehensive study of how economists and AI experts think AI will affect the U.S. economy. They predict major AI progress—but no dramatic break from economic trends: GDP growth rates similar to today's and a moderate decline in labor force participation. However, when asked to consider what would happen in a world with extremely rapid progress in AI capabilities by 2030, they predict significant economic impacts by 2050: • Annualized GDP growth of 3.5% (compared to 2.4% in 2025) • A labor force participation rate of 55% (roughly 10 million fewer jobs) • 80% of wealth held by the top 10% (highest since 1939) 🧵 Here's what we found:

Research_FRI's tweet photo. We completed the most comprehensive study of how economists and AI experts think AI will affect the U.S. economy.

They predict major AI progress—but no dramatic break from economic trends: GDP growth rates similar to today's and a moderate decline in labor force participation.

However, when asked to consider what would happen in a world with extremely rapid progress in AI capabilities by 2030, they predict significant economic impacts by 2050:

• Annualized GDP growth of 3.5% (compared to 2.4% in 2025)
• A labor force participation rate of 55% (roughly 10 million fewer jobs)
• 80% of wealth held by the top 10% (highest since 1939)

🧵 Here's what we found:

20

645

190

713

381K

2

178

41

169

47K

PTetlock retweeted

3 months ago

We completed the most comprehensive study of how economists and AI experts think AI will affect the U.S. economy. They predict major AI progress—but no dramatic break from economic trends: GDP growth rates similar to today's and a moderate decline in labor force participation. However, when asked to consider what would happen in a world with extremely rapid progress in AI capabilities by 2030, they predict significant economic impacts by 2050: • Annualized GDP growth of 3.5% (compared to 2.4% in 2025) • A labor force participation rate of 55% (roughly 10 million fewer jobs) • 80% of wealth held by the top 10% (highest since 1939) 🧵 Here's what we found:

20

645

190

713

381K

PTetlock retweeted

3 months ago

We built an interactive forecasting tool so you can see how your forecasts about AI's economic impact compare with forecasts from the economists in our sample. When you submit your forecasts, you'll receive a summary image that you can share on X or elsewhere. https://t.co/eoKvzy4u9G

0

25

6

14

6K

4 months ago

A recurring forecasting failure (you can almost take this one to the bank).

Michael Malice

@michaelmalice

4 months ago

I have underestimated to what extent people interested in politics believe the good guys are perpetually on the verge of defeating the bad guys once and for all. here is Rush Limbaugh in 1993 predicting the imminent implosion of liberalism:

michaelmalice's tweet photo. I have underestimated to what extent people interested in politics believe the good guys are perpetually on the verge of defeating the bad guys once and for all.

here is Rush Limbaugh in 1993 predicting the imminent implosion of liberalism: https://t.co/NRs5blQPc4

173

1K

110

134

77K

4

25

4

6

6K

4 months ago

Variants of this question are popping up a lot nowadays: How much should the moral character of an artist or scholar or scientist sway our assessments of the value of their creative output?

Rob Henderson

@robkhenderson

over 3 years ago

While Marx's wife was pregnant, he impregnated their housemaid. To hide this from Marx's wife so she wouldn't get angry, Engels pretended he was the father. Then they gave the kid away. Both Marx and Engels abandoned him, pretending the kid didn't exist. https://t.co/imbZ76TjBY

robkhenderson's tweet photo. While Marx's wife was pregnant, he impregnated their housemaid. To hide this from Marx's wife so she wouldn't get angry, Engels pretended he was the father. Then they gave the kid away. Both Marx and Engels abandoned him, pretending the kid didn't exist. https://t.co/imbZ76TjBY https://t.co/M8DgIKfOFR

74

4K

694

718

0

13

45

4

13

10K

5 months ago

My collaborators in @Research_FRI have released latest AI progress forecasts from LEAP (Longitudinal Expert AI Panel). Experts continue to underestimate benchmark progress but also expect big increases in tech company valuations & data center buildout. For details: https://t.co/Ji5CucVrIG

Igor Grossmann, PhD @igi.bsky.social @psywisdom

6 months ago

What do experts and superforecasters think about the future of AI research and development? In Wave 4 of the Longitudinal Expert AI Panel (LEAP), we asked top AI experts to forecast progress in AI R&D, hiring, company valuations, data center buildout, and more. Here’s what you need to know 🧵

Research_FRI's tweet photo. What do experts and superforecasters think about the future of AI research and development?

In Wave 4 of the Longitudinal Expert AI Panel (LEAP), we asked top AI experts to forecast progress in AI R&D, hiring, company valuations, data center buildout, and more.

Here’s what you need to know 🧵

2

39

11

24

11K

1

24

7

9

5K

PTetlock retweeted

6 months ago

Can we predict the past? 🔮 New open-access paper in American Historical Review proposes 'Retrodiction'—using gaps in the archival record to test historical theories. Interdisciplinary collab feat. David Gill, Marc Trachtenberg, @PTetlock @cendripetalfrce, and more 🧵👇

2

28

9

15

5K

PTetlock retweeted

James Pethokoukis ⏩️⤴️

@JimPethokoukis

6 months ago

A really important piece "AI needs fewer prophets and more predictions" @PTetlock https://t.co/qdqc7yCFPE

0

6

2

4K

PTetlock retweeted

7 months ago

@Research_FRI is dedicated to improving signal-to-noise ratios in heated debates. LEAP, Longitudinal Expert AI Panel, elicits testable forecasts of AI progress each month. First-wave results will frustrate extremists at both ends of opinion spectrum but if you love nuance...

1

24

7

9

7K

7 months ago

@Research_FRI is dedicated to improving signal-to-noise ratios in heated debates. LEAP, Longitudinal Expert AI Panel, elicits testable forecasts of AI progress each month. First-wave results will frustrate extremists at both ends of opinion spectrum but if you love nuance...

7 months ago

In Bloomberg today, FRI Chief Scientist, @PTetlock argues that the debate over AI progress needs falsifiable forecasts rather than broad statements: "The debate over the future of AI deserves fewer overconfident proclamations and more precise, policy-relevant predictions. Careful forecasting surveys, of the kind I’ve spent my career working on, can test arguments against reality and, in time, affirm the voices that consistently deliver reliable, actionable guidance."

Research_FRI's tweet photo. In Bloomberg today, FRI Chief Scientist, @PTetlock argues that the debate over AI progress needs falsifiable forecasts rather than broad statements:

"The debate over the future of AI deserves fewer overconfident proclamations and more precise, policy-relevant predictions. Careful forecasting surveys, of the kind I’ve spent my career working on, can test arguments against reality and, in time, affirm the voices that consistently deliver reliable, actionable guidance."

1

28

7

8

9K

1

24

7

9

7K

The Dark Fiddling Pirate Jussim

7 months ago

The more tendentious the topic, the harder it is to consummate an adversarial collaboration. So the team deserves a lot of credit for pulling this off on “implicit bias”—& publishing in a high-rejection-rate journal. But many issues are left dangling. I hope there is a Round 2.

@PsychRabble

7 months ago

Terrific social psych paper ( how often do you hear that from me?). Adversarial collab on how much measures of "implicit bias" predict racial discrim. tl;dr: N>2000 1. Pro-Black discrimination>pro-White discrimination 2. Implicit measures predicted discrim (std reg coeff=.16) a hair above their agreed-upon threshold for considering the effect trivial (.15), controlling for explicit prejudice. 3. Explicit prej powerfully predicted discrim. Authors include @jordanaxt, @ImHardcory, @PTetlock, @tomstello https://t.co/NlUUUGKded ht @robsica

2

85

16

50

12K

0

16

3

9

6K

8 months ago

Will wisdom of elite forecasting crowd on LEAP out-perform famous AI accelerationists & skeptics? First-wave results: LEAP experts are consistently less bullish on AI progress than tech CEOs, like Altman, but more bullish than educated public

8 months ago

Today, we are launching the most rigorous ongoing source of expert forecasts on the future of AI: the Longitudinal Expert AI Panel (LEAP). We’ve assembled a panel of 339 top experts across computer science, AI industry, economics, and AI policy. Roughly every month—for the next three years—they’ll provide precise, falsifiable forecasts on the trajectory of AI capabilities, adoption, and impact. Our results cover where experts predict major effects of AI, where they expect less progress than AI industry leaders, and where they disagree. LEAP experts forecast major effects of AI by 2030, including: ⚡ 7x increase in AI’s share of U.S. electricity use (1% -> 7%) 🖥️ 9x increase in AI-assisted work hours (2% -> 18%) By 2040, experts predict: 👥30% of adults will use AI for companionship daily 🏆60% chance that AI will solve or substantially assist in solving a Millennium Prize Problem 🚂32% chance that AI will have been at least as impactful as a "technology of the millennium," like the printing press or the Industrial Revolution. 🧵Read on for more insights and results

Research_FRI's tweet photo. Today, we are launching the most rigorous ongoing source of expert forecasts on the future of AI: the Longitudinal Expert AI Panel (LEAP).

We’ve assembled a panel of 339 top experts across computer science, AI industry, economics, and AI policy.

Roughly every month—for the next three years—they’ll provide precise, falsifiable forecasts on the trajectory of AI capabilities, adoption, and impact.

Our results cover where experts predict major effects of AI, where they expect less progress than AI industry leaders, and where they disagree.

LEAP experts forecast major effects of AI by 2030, including:

⚡ 7x increase in AI’s share of U.S. electricity use (1% -> 7%)
🖥️ 9x increase in AI-assisted work hours (2% -> 18%)

By 2040, experts predict:
👥30% of adults will use AI for companionship daily
🏆60% chance that AI will solve or substantially assist in solving a Millennium Prize Problem
🚂32% chance that AI will have been at least as impactful as a "technology of the millennium," like the printing press or the Industrial Revolution.

🧵Read on for more insights and results

17

255

83

145

88K

2

74

7

20

11K

PTetlock retweeted

8 months ago

Today, we are launching the most rigorous ongoing source of expert forecasts on the future of AI: the Longitudinal Expert AI Panel (LEAP). We’ve assembled a panel of 339 top experts across computer science, AI industry, economics, and AI policy. Roughly every month—for the next three years—they’ll provide precise, falsifiable forecasts on the trajectory of AI capabilities, adoption, and impact. Our results cover where experts predict major effects of AI, where they expect less progress than AI industry leaders, and where they disagree. LEAP experts forecast major effects of AI by 2030, including: ⚡ 7x increase in AI’s share of U.S. electricity use (1% -> 7%) 🖥️ 9x increase in AI-assisted work hours (2% -> 18%) By 2040, experts predict: 👥30% of adults will use AI for companionship daily 🏆60% chance that AI will solve or substantially assist in solving a Millennium Prize Problem 🚂32% chance that AI will have been at least as impactful as a "technology of the millennium," like the printing press or the Industrial Revolution. 🧵Read on for more insights and results

17

255

83

145

88K

8 months ago

When I first read “When Prophecy Fails” in grad school—eons ago—I wondered whether this case study was “too neat/ theoretically convenient to be true.” Prof warned “don’t let anyone else hear you say that. Festinger is sacrosanct.” Another reason for my fascination with taboo topics.

Robin Hanson

@robinhanson

8 months ago

"When Prophecy Fails … the case was misrepresented. The cult did not persist, proselytize, or reinterpret its failure as a spiritual triumph. Its leader recanted, the group disbanded, and belief dissolved."

8

127

8

39

69K

12

416

45

105

65K