Lin Yuwu

@linyuwu

Documenting the coming impact of AI on governments and industries | Study of expertise | Science of science | Open Government |

Taiwan

Joined February 2011

155 Following

54 Followers

141 Posts

Lin Yuwu @linyuwu

9 days ago

Some data from Anthropic on the increasing impact and improvements of their models.

Anthropic

@AnthropicAI

9 days ago

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. https://t.co/OVVPJO7VQx

29K

15K

18M

Lin Yuwu @linyuwu

22 days ago

This is the right pattern to deal with systemic AI risks going forward. Have the frontier models play societal defense exclusively until systemic risks are sufficiently minimized or eliminated, depending on the severity of the risks.

Anthropic

@AnthropicAI

22 days ago

Last month we launched Project Glasswing, our collaborative AI cybersecurity initiative. Since then, we and our partners have found more than ten thousand high- or critical-severity vulnerabilities in essential software.

518

652

Lin Yuwu @linyuwu

24 days ago

An internal OpenAI model disproved a famous math belief that experts thought was true for decades. This is the "move 37" for math that many have been waiting for. "https://t.co/KkssFMEAFo"

Lin Yuwu @linyuwu

2 months ago

Intelligence in essence is good pathfinding so in hindsight it's clear next character prediction was going to yield some form of artificial intelligence

Who to follow

0xJeff

@0xJeff

Researcher/Investor - sharing insights from 200+ convos with founders & builders at https://t.co/c7GxaWoasV | Ex-TradFi

CITY INDEX

@CityIndex

Global provider of #Forex & #CFD trading. Saracens sponsor #YourSaracens 70% of retail CFD accounts lose money

Jay

@JozsefSzalma

Once built a novel neural network, to resolve a Jira ticket. Personal opinions expressed.

Lin Yuwu @linyuwu

4 months ago

@DominiqueCAPaul @grok why is China's contribution to GitHub declining

Lin Yuwu @linyuwu

4 months ago

@elonmusk @peterrhague @grok who do you agree with more?

258

Lin Yuwu @linyuwu

5 months ago

Both Grok-4.2 and ChatGPT-5.2 crossed the important milestone of solving previously unsolved math problems at roughly the same time (Jan 2026).

linyuwu's tweet photo. Both Grok-4.2 and ChatGPT-5.2 crossed the important milestone of solving previously unsolved math problems at roughly the same time (Jan 2026). https://t.co/uhIV4LPn94

Paata Ivanisvili

@PI010101

5 months ago

Disclaimer: I had given early access to internal beta version of Grok 4.20 It found a new Bellman function for one of the problems I’d been working on with my student N. Alpay. The problem reduces to identifying the pointwise maximal function U(p,q) under two constraints and understanding the behavior of U(p,0). In our paper https://t.co/pgJw9MaEA1 we proved U(p,0)\geq I(p), where I(p) is the Gaussian isoperimetric profile, I(p) ~ p\sqrt{log(1/p)} as p ~ 0. After ~5 minutes, Grok 4.20 produced an explicit formula U(p,q) = E \sqrt{q^2+\tau}, where \tau is the exit time of Brownian motion from (0,1) starting at p. This yields U(p,0)=E\sqrt{\tau} ~ p log(1/p) at p ~ 0, a square root improvement in the logarithmic factor. Any significance of this result? It will not tell you how to change the world tomorrow. Rather, it gives a small step toward understanding what is going on with averages of stochastic analogs of derivatives (quadratic variation) of Boolean functions: how small can they be? More precisely, this gives a sharp lower bound on the L1 norm of the dyadic square function applied to indicator functions 1_A of sets A \subset [0,1]. In my previous tweet about Takagi function, we saw that the sharp lower bound on ||S_1(1_A)||_1 miraculously coincides with Takagi function of |A| which (surprisingly to me) is related to the Riemann hypothesis. Here, we obtain a sharp lower bound on ||S_2(1_A)||_1 given by E \sqrt{\tau}, where Brownian motion starts at |A|. This function belongs to the family of isoperimetric-type profiles, but unlike the fractal Takagi function, it is smooth and does not coincide with the Gaussian isoperimetric profile. Finally, in harmonic analysis it is known that the square function is not bounded in L^1. The question here was more about curiosity: how exactly does it blow up when tested on Boolean functions 1_A. Previously, the best known lower bound was |A|(1-|A|) (Burkholder—Davis—Gandy). In our paper, we obtained |A| (1-|A|)\sqrt{log(1/(|A|(1-|A|)))}. This new Grok’s Bellman function gives |A| (1-|A|) \log(1/(|A|(1-|A|))) and this bound is actually sharp.

$PI010101's tweet photo. Disclaimer: I had given early access to internal beta version of Grok 4.20 It found a new Bellman function for one of the problems I’d been working on with my student N. Alpay. The problem reduces to identifying the pointwise maximal function U(p,q) under two constraints and understanding the behavior of U(p,0). In our paper https://t.co/pgJw9MaEA1 we proved U(p,0)\geq I(p), where I(p) is the Gaussian isoperimetric profile, I(p) ~ p\sqrt{log(1/p)} as p ~ 0. After ~5 minutes, Grok 4.20 produced an explicit formula U(p,q) = E \sqrt{q^2+\tau}, where \tau is the exit time of Brownian motion from (0,1) starting at p. This yields U(p,0)=E\sqrt{\tau} ~ p log(1/p) at p ~ 0, a square root improvement in the logarithmic factor. Any significance of this result? It will not tell you how to change the world tomorrow. Rather, it gives a small step toward understanding what is going on with averages of stochastic analogs of derivatives (quadratic variation) of Boolean functions: how small can they be? More precisely, this gives a sharp lower bound on the L1 norm of the dyadic square function applied to indicator functions 1_A of sets A \subset [0,1]. In my previous tweet about Takagi function, we saw that the sharp lower bound on ||S_1(1_A)||_1 miraculously coincides with Takagi function of |A| which (surprisingly to me) is related to the Riemann hypothesis. Here, we obtain a sharp lower bound on ||S_2(1_A)||_1 given by E \sqrt{\tau}, where Brownian motion starts at |A|. This function belongs to the family of isoperimetric-type profiles, but unlike the fractal Takagi function, it is smooth and does not coincide with the Gaussian isoperimetric profile. Finally, in harmonic analysis it is known that the square function is not bounded in L^1. The question here was more about curiosity: how exactly does it blow up when tested on Boolean functions 1_A. Previously, the best known lower bound was |A|(1-|A|) (Burkholder—Davis—Gandy). In our paper, we obtained |A| (1-|A|)\sqrt{log(1/(|A|(1-|A|)))}. This new Grok’s Bellman function gives |A| (1-|A|) \log(1/(|A|(1-|A|))) and this bound is actually sharp.$

167

258

567

Lin Yuwu @linyuwu

over 1 year ago

Reinforcement learning as a method may be inherently truth-seeking. From the DeepSeek models, it seems reinforcement learning with reasoning traces may purge biases from the models as the full R1 model is more objective and truthful than the distillations and the V3 model.

Lin Yuwu @linyuwu

over 1 year ago

So while it is conceivable for a crash of prices for existing goods and services to occur because the demand for such goods and services is limited by the human population, a correspondent long-term crash in wages is not a likely outcome. 3/

Lin Yuwu @linyuwu

over 1 year ago

In a post-AI economy: A crash in prices of existing goods and services without the concurrent crash in wages is the more likely scenario for several reasons: 1. There is a near-infinite amount of human-specific activity humans can engage in. 1/

Marc Andreessen 🇺🇸

@pmarca

over 1 year ago

A world in which human wages crash from AI -- logically, necessarily -- is a world in which productivity growth goes through the roof, and prices for goods and services crash to near zero. Consumer cornucopia. Everything you need and want for pennies.

11K

785

Lin Yuwu @linyuwu

over 1 year ago

2. In an abundance scenario, if there were not enough private sector jobs (a big if), the governments can step in to provide public sector jobs, financed by the additional revenue potential necessitated by said abundant scenario. 2/

Lin Yuwu @linyuwu

over 1 year ago

This means that closed models have no moats unless restriction to access is placed on the "chain of thought" traces as OpenAI is already doing. 2/2

Lin Yuwu @linyuwu

over 1 year ago

Extracting "intelligence" from AI models: From the DeepSeek V3 paper (https://t.co/xwjoXxPNKe) we also see that "intelligence" could be extracted/distilled from models quite efficiently, using 800k samples (very roughly tens to hundreds of billions of tokens). 1/2

linyuwu's tweet photo. Extracting "intelligence" from AI models:

From the DeepSeek V3 paper (https://t.co/xwjoXxPNKe) we also see that "intelligence" could be extracted/distilled from models quite efficiently, using 800k samples (very roughly tens to hundreds of billions of tokens).

1/2 https://t.co/sflpRtffKn

Lin Yuwu @linyuwu

over 1 year ago

Regarding "AGI", the key insight that o1, o3, and R1 proved is that valuable synthetic data that future models can be trained on can be produced by expending computational power alone (by means of reinforcement learning). That was what Ilya saw.

Lin Yuwu @linyuwu

over 1 year ago

The latest scores from Chatbot Arena LLM Leaderboard were just released, and the open-source model DeepSeek R1 is on par with the frontier models. DeepSeek R1 is 25 point ELO points behind the top-ranking model Gemini 2.0 Flash-Thinking-Exp-01-02.

linyuwu's tweet photo. The latest scores from Chatbot Arena LLM Leaderboard were just released, and the open-source model DeepSeek R1 is on par with the frontier models.

DeepSeek R1 is 25 point ELO points behind the top-ranking model Gemini 2.0 Flash-Thinking-Exp-01-02. https://t.co/ik92viXvQO

134

Lin Yuwu @linyuwu

over 1 year ago

@fchollet Humans solve Arc-AGI-1 problems through physical/visual intuitions gained from interacting with the physical world, a domain the models have not had access to. A blind human from birth may not do well on Arc-AGI-1 problems.

151

Lin Yuwu @linyuwu

almost 2 years ago

The release of model o1 is the first public confirmation that the path to "AGI" is clear and it is only bound by compute as this point.

Lin Yuwu @linyuwu

almost 2 years ago

In what may be the biggest breakthrough since ChatGPT, OpenAI's releases model o1. It uses reinforcement learning techniques to achieve a quantum leap over previous models in math, coding, and other reasoning capabilities/accuracy.

linyuwu's tweet photo. In what may be the biggest breakthrough since ChatGPT, OpenAI's releases model o1. It uses reinforcement learning techniques to achieve a quantum leap over previous models in math, coding, and other reasoning capabilities/accuracy. https://t.co/TZSddbCr2x

107

Lin Yuwu @linyuwu

about 2 years ago

In the current debate around "synthetic data", part of the confusion is the result of experts using the same term to describe different things. Synthetic data should just be defined as all novel data obtained by means of compute.

Jim Fan

@DrJimFan

about 2 years ago · Gilroy

Does AlphaZero count as training on synthetic data? There’s no human grandmaster data at all. AlphaZero expands its strategies & wisdom indefinitely with self-driven exploration and compute. The input is just a simple Go/Chess simulator that implements the game rules.

370

104

125K

Lin Yuwu @linyuwu

about 2 years ago

@AndrewYNg Useful synthetic data can also come from increasingly elaborate simulations

839

Lin Yuwu

@linyuwu

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users