PDS

Verified account

@TopadoDuran

Modern Global Macro Investor. Macro AI Systems Engineer. Macroeconomics. Markets. Fed Watcher. China Macro. AI. Computer Science. Opinions are my own.

Manhattan, NY

Joined October 2009

1.8K Following

13.9K Followers

3.5K Posts

Pinned Tweet

3 months ago

Here is a Citrini critique from a Macro-AI lens. Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve. His core assumptions: 1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task 2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view Let's rigorize his feedback loop with some Macro-AI: scaling laws → capabilities → task automation → displacement The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds. The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes. If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this) Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest. **AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall. If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

TopadoDuran's tweet photo. Here is a Citrini critique from a Macro-AI lens.

Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve.

His core assumptions:
1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task
2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view

Let's rigorize his feedback loop with some Macro-AI:

scaling laws → capabilities → task automation → displacement

The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds.

The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes.

If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this)

Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest.

**AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall.

If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

5

95

7

102

298K

8 days ago

@steipete @0x_vibevortex What’s best is what the operator can make work best.. claude code >> codex for me

1

1

0

0

201

TopadoDuran retweeted

13 days ago

Q: How are job postings for software engineers rising rapidly despite AI agents automating coding? A: Because there’s far more code to manage than ever before. We’re already seeing a 14x YoY increase in GitHub commits, and it’s accelerating. AI has dramatically lowered the cost of writing code, so it’s now being used across far more businesses, applications, and use cases. We’re at the beginning of a massive productivity boom driven by the proliferation of bespoke software throughout the entire economy. Coding has been AI’s breakout use case this year. The fact that it’s increased demand for software engineers — rather than decreased it — should call into question the entire “AI will cause mass job loss” narrative.

DavidSacks's tweet photo. Q: How are job postings for software engineers rising rapidly despite AI agents automating coding?

A: Because there’s far more code to manage than ever before. We’re already seeing a 14x YoY increase in GitHub commits, and it’s accelerating.

AI has dramatically lowered the cost of writing code, so it’s now being used across far more businesses, applications, and use cases.

We’re at the beginning of a massive productivity boom driven by the proliferation of bespoke software throughout the entire economy.

Coding has been AI’s breakout use case this year. The fact that it’s increased demand for software engineers — rather than decreased it — should call into question the entire “AI will cause mass job loss” narrative.

832

9K

1K

2K

2M

18 days ago

@karpathy Goat..

0

0

0

0

478

Who to follow

Verified account

CIO at @UnlimitedFnds | PM of $HFGM, $HFEQ, $HFMF, $HFND | Fmr @Bridgewater IC | One of the few "sane" voices on #fintwit | Comments not investment advice

Verified account

Boaz Weinstein is the founder of Saba Capital Management.

Verified account

https://t.co/SgaSuGdrox macro & beta @2Graybeards for beta. Both for investor education, Brevan Howard, Bridgewater, Salomon, Dad of 4. Go Penn, No tweet is advice

18 days ago

0

0

0

0

421

27 days ago

Interesting -> .html vs .md

29 days ago

https://t.co/MXt5XS4xBX

1K

17K

2K

34K

14M

1

7

1

8

5K

about 1 month ago

The markets can be irrational for a long time. People investing are not at frontier labs or are not using AI beyond a conversational capability. A litmus test. Ask market participants - who think vibe coding will replace SAAS, who is going to verify the additional billions lines of codes that an AI system at scale can produce per month (maybe per week)? Automation comes only after verification, so tell me how less software engineers or non specialists (software companies) are going to verify codebases that are growing of orders of magnitude a month (a week)?

1

3

0

2

242

about 1 month ago

@Geiger_Capital 🤝

0

0

0

0

485

TopadoDuran retweeted

U.S. Department of Labor @USDOL

about 1 month ago

Today, we launched the AI Registered Apprenticeship Innovation Portal, a one-stop resource to help organizations build AI literacy and develop AI-focused apprenticeship programs. Under @POTUS’ leadership, we’re positioning American Workers to lead in the age of AI 🇺🇸 https://t.co/Wc76dKSIcW

USDOL's tweet photo. Today, we launched the AI Registered Apprenticeship Innovation Portal, a one-stop resource to help organizations build AI literacy and develop AI-focused apprenticeship programs.

Under @POTUS’ leadership, we’re positioning American Workers to lead in the age of AI 🇺🇸
https://t.co/Wc76dKSIcW

138

4K

812

2K

238K

about 1 month ago

Brilliant Thank u @dwarkesh_sp

about 1 month ago

Did a very different format with @reinerpope – a blackboard lecture where he walks through how frontier LLMs are trained and served. It's shocking how much you can deduce about what the labs are doing from a handful of equations, public API prices, and some chalk. It’s a bit technical, but I encourage you to hang in there - it’s really worth it. There are less than a handful of people who understand the full stack of AI, from chip design to model architecture, as well as Reiner. It was a real delight to learn from him. Recommend watching this one on YouTube so you can see the chalkboard. – How batch size affects token cost and speed – How MoE models are laid out across GPU racks – How pipeline parallelism spreads model layers across racks – Why Ilya said, “As we now know, pipelining is not wise.” – Because of RL, models may be 100x over-trained beyond Chinchilla-optimal – Deducing long context memory costs from API pricing – Convergent evolution between neural nets and cryptography

7

227

16

195

35K

0

9

0

5

1K

about 1 month ago

@nfergus @scmallaby @demishassabis 💯

0

1

0

0

661

about 1 month ago

@dylan522p @AnthropicAI Same

0

0

0

0

687

about 1 month ago

@Jason I’m in

0

0

0

0

271

about 2 months ago

@emollick @alexolegimas @danielrock @joshgans @robseamans Related to the topic Yann LeCun is directionally right in my view

3 months ago

Here is a Citrini critique from a Macro-AI lens. Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve. His core assumptions: 1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task 2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view Let's rigorize his feedback loop with some Macro-AI: scaling laws → capabilities → task automation → displacement The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds. The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes. If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this) Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest. **AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall. If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

TopadoDuran's tweet photo. Here is a Citrini critique from a Macro-AI lens.

Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve.

His core assumptions:
1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task
2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view

Let's rigorize his feedback loop with some Macro-AI:

scaling laws → capabilities → task automation → displacement

The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds.

The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes.

If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this)

Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest.

**AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall.

If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

5

95

7

102

298K

0

3

0

0

1K

about 2 months ago

@ylecun @Ph_Aghion @erikbryn 💯 Most macroeconomists thou are not close enough to the algorithmic capabilities of the models so they struggle understanding the capability transfer function @ylecun

3 months ago

Here is a Citrini critique from a Macro-AI lens. Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve. His core assumptions: 1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task 2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view Let's rigorize his feedback loop with some Macro-AI: scaling laws → capabilities → task automation → displacement The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds. The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes. If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this) Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest. **AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall. If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

TopadoDuran's tweet photo. Here is a Citrini critique from a Macro-AI lens.

Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve.

His core assumptions:
1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task
2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view

Let's rigorize his feedback loop with some Macro-AI:

scaling laws → capabilities → task automation → displacement

The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds.

The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes.

If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this)

Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest.

**AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall.

If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

5

95

7

102

298K

0

5

0

3

2K

about 2 months ago

@Geiger_Capital These weekend drops are a wall street staple 🤝

0

1

0

0

936

about 2 months ago

There's a dimension of the AI frontier that gets zero airtime next to compute scaling and Blackwell trained models — and it's where the actual alpha lives. Hermeneutics is the study of interpretation — not what a text says, but what it means. For 200 years it's been confined to philosophy departments. It now has a direct application in markets that nobody is exploiting. The entire financial NLP ecosystem — $9B deployed — operates at what hermeneutics would call the 'surface' and 'concept' layers. What was literally said, what category it maps to. This is textual processing, not interpretation. Interpretation begins at the mechanism layer: what causal model does this framing imply? 'Softening' through demand destruction and 'softening' through supply normalization are the same word pointing at completely different forward worlds. The concept layer can't distinguish them. The hermeneutic layer must. Below mechanism: principle (what foundational framework is active) and axiom (what's assumed without being stated). The market's coverage at these layers is sub 10% — and the participants processing there are doing it in their heads, unscalable. The processing cliff between concept and mechanism is a measured 44 points of F1 accuracy. That cliff is the boundary between text processing and interpretation. Most alpha theories rely on attention asymmetry. This relies on interpretive asymmetry — a gap that requires architectural change, not redirected attention. The LLM paradigm makes it possible to perform hermeneutic operations computationally — holding multiple interpretive horizons open, tracking epistemic modality, detecting framework shifts across time. But only when architected as structured extraction systems, not conversational tools. The gap between those two architectures is the gap between reading and interpreting. Markets have always rewarded the latter. They've just never been able to scale it.

TopadoDuran's tweet photo. There's a dimension of the AI frontier that gets zero airtime next to compute scaling and Blackwell trained models — and it's where the actual alpha lives.

Hermeneutics is the study of interpretation — not what a text says, but what it means. For 200 years it's been confined to philosophy departments. It now has a direct application in markets that nobody is exploiting.

The entire financial NLP ecosystem — $9B deployed — operates at what hermeneutics would call the 'surface' and 'concept' layers. What was literally said, what category it maps to. This is textual processing, not interpretation.

Interpretation begins at the mechanism layer: what causal model does this framing imply? 'Softening' through demand destruction and 'softening' through supply normalization are the same word pointing at completely different forward worlds. The concept layer can't distinguish them. The hermeneutic layer must.

Below mechanism: principle (what foundational framework is active) and axiom (what's assumed without being stated). The market's coverage at these layers is sub 10% — and the participants processing there are doing it in their heads, unscalable.

The processing cliff between concept and mechanism is a measured 44 points of F1 accuracy. That cliff is the boundary between text processing and interpretation. Most alpha theories rely on attention asymmetry. This relies on interpretive asymmetry — a gap that requires architectural change, not redirected attention.

The LLM paradigm makes it possible to perform hermeneutic operations computationally — holding multiple interpretive horizons open, tracking epistemic modality, detecting framework shifts across time. But only when architected as structured extraction systems, not conversational tools. The gap between those two architectures is the gap between reading and interpreting.

Markets have always rewarded the latter. They've just never been able to scale it.

3

136

5

50

394K

2 months ago

@Geiger_Capital Nice Easy money.. good call Geiger

0

8

0

0

808

2 months ago

@WhiteHouse Countries will just give a call to Iran.. basically we gave them agency over the strait and wasted ton of tax payer money.. Anyways Let’s get back to work Mr President

1

14

0

0

3K

2 months ago

@pmarca Some macro ai math

3 months ago

Here is a Citrini critique from a Macro-AI lens. Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve. His core assumptions: 1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task 2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view Let's rigorize his feedback loop with some Macro-AI: scaling laws → capabilities → task automation → displacement The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds. The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes. If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this) Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest. **AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall. If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

TopadoDuran's tweet photo. Here is a Citrini critique from a Macro-AI lens.

Citrini's "2028 Intelligence Crisis" direction of labor market impact is sensible. The velocity is wrong. His scenario timeline is off by ~ 5 years + because his AI capability assumptions are anecdotal, not grounded in the math of how models actually improve.

His core assumptions:
1) The AI feedback loop is sustained by "AI Investment Increases & AI Capabilities Improve" — implying that as long as you throw money at compute, you get better models that unlock capabilities replacing humans at a given task
2) He extrapolates from what he's observed anecdotally about AI agents to "smooth sailing" toward autonomous multi-week agents by 2027. This is a guess, not an informed view

Let's rigorize his feedback loop with some Macro-AI:

scaling laws → capabilities → task automation → displacement

The relationship between scaling laws and capabilities is determined by a "Capability Transfer" function. This relationship is non-linear and task-dependent. Capabilities don't emerge linearly from loss improvements — they phase-transition at specific thresholds (I model this with a sigmoid in pic below). Du et al. (2024) validates pre-training loss as the sufficient proxy for capability emergence below task-specific thresholds.

The hard tasks Citrini envisions — autonomous multi-week work by 2028 — likely have thresholds near the irreducible loss floor (E ≈ 1.82 nats). But here's the deeper issue: next-token prediction is a proxy objective. A model can approach E without acquiring the causal reasoning, goal persistence, and calibrated uncertainty that autonomous work demands. The frontier labs' pivot to post-training (RLHF, inference-time compute, tool use) is an implicit admission of this gap. All this to say, some hard tasks may not be solvable by scaling compute alone - which is what Citrini assumes.

If we know the capability transfer thresholds and map them to loss levels (and thus compute), we can estimate when complex tasks become feasible. (yup -> we can actually model this)

Using 0.40 OOM/year in compute growth and 0.30 OOM/year in algorithmic efficiency gains, I model that professional replacement (defined workflows: customer service, routine legal, basic analysis) starts mid-2026 — but cognitive displacement (multi-week autonomy, novel problem solving) only arrives ~2031. Citrini's crisis requires cognitive displacement. So his timeline starts at 2031 at earliest.

**AND** we haven't even addressed two more constraints that push it further: agent reliability decays exponentially with task complexity and the training data wall.

If I can vibe code myself into a web dev, I'll upload the interactive model at https://t.co/kdlYUwTjvt for you to stress-test the assumptions yourself.

5

95

7

102

298K

0

3

0

1

1K

Last Seen Users on Sotwe

Trends for you

Most Popular Users