Lingjun_capital

@LingjunCapital

Joined April 2023

214 Following

61 Followers

1.4K Posts

LingjunCapital retweeted

Holger Zschaepitz

@Schuldensuehner

4 days ago

One of the most striking charts out there: the SOX index has risen sixfold since 2020, while SOX earnings per share have climbed fivefold. In other words, most of the rally has been backed by real earnings growth, not just higher valuations. The key question now is whether these profits will actually come through and whether they can keep rising. (HT Goldman Sachs)

Schuldensuehner's tweet photo. One of the most striking charts out there: the SOX index has risen sixfold since 2020, while SOX earnings per share have climbed fivefold. In other words, most of the rally has been backed by real earnings growth, not just higher valuations. The key question now is whether these profits will actually come through and whether they can keep rising. (HT Goldman Sachs)

414

41K

LingjunCapital retweeted

Richard Sutton

@RichardSSutton

4 days ago

A new and possibly controversial perspective: In this video, I explain the sense in which generative AI trained by supervised learning is incapable of making novel discoveries. https://t.co/zin5QbbT9N The text of the speech: AI Creativity and Discovery Good day ladies and gentlemen. I regret that I am unable to be with you all today to engage in a back-and-forth discussion, but I am nevertheless pleased to be able to share with you, via this recording, some high-level thoughts about the current and future state of artificial intelligence, and in particular about AI’s relationship to science and mathematics, which is, as I understand it, the central focus of this meeting and of the SAIR Foundation. I would like to start with an old joke; I am sure you have heard it before. It is the one about the researcher whose work is being evaluated, and the review comes back, and says “This work is both novel and good. Unfortunately, the parts that are good are not novel, and the parts that are novel are not good.” My first point about AI is that this assessment applies exactly to large parts of AI as we know it today. Not all of today’s AI, but a large part of it. Pretty much all of what we mean by “Generative AI”---which includes large language models, and the images and video models, and even the new methods for learning world models. All of these AIs take large numbers of examples and produce a “model” which behaves similar to the examples, that is, which generates text like people, or images like artists or nature, and videos like we find on the internet. Don’t get me wrong, Generative AI can be extremely useful. No doubt about that. But the assessment of the joke still applies. These systems can produce output that is both novel and good, but not at the same time. In many ways this is just absolutely not a problem. When we ask an AI for an answer from the internet, or to summarize a document, we don’t want it to be novel. We are happy if the quality of the answer, the goodness, comes from the source material—from the people who wrote the document or the articles on the internet. If the AI’s answer is novel it means it is going beyond the source material, adding something beyond it. This is what we call “hallucinations”. In most cases, we don’t like it when the AI makes something up, when it adds something novel. One exception, of course, is when we are looking not for facts or reality, but for fiction and entertainment. We might ask for a bedtime story for a child, or an image based on existing images on the internet but which is nevertheless different and distinct from them. In these cases, it is never easy for us to know how creative the AI is actually being, as we do not know how close the AI’s story, poem, or image is to the source material. In a real practical sense we can not know this because the internet is too big, the possible sources that the AI may draw upon are too numerous. When we ask for a fiction or novelty, the AI can give it to us because its processing is in part stochastic. Every decision can go multiple ways and will go different ways and produce a different trajectory every time. The trajectory can be random—and thus novel—or it can be based on the training data—and thus “good” because the training data is good, sourced from people or reality. Thus, the trajectory is either novel or good—based on randomness or based on data—but never both at the same time. Really, I think it is okay if the output of Generative AI is never good and novel at the same time. For the researcher in the joke this is a devastating criticism, but for most things it is not, and for Generative AI it is not. Generative AI is meant to be a mimic. This is what supervised learning is for. Generative AI can be extremely useful, even when it just mimics, if it is faster, or cheaper, or smaller, or more customizable, or more copy-able, than the thing being mimicked. It is okay if Generative AI cannot be both novel and good at the same time. It is still a transformative technology. But it is a limitation. And remember we are here to use AI for science and mathematics, and for these areas the assessment of the reviewer in the joke is devastating. For these areas we need true creativity and discovery. Generative AI—or Mimicking AI—will never get where us there. For these we need something more, and indeed we have something more in other parts of AI. We have many AI systems which can give us more. We have AlphaGo with its world-changing move 37, or AlphaZero with its brilliant original chess-playing style. We have GT-Sophy that drives simulated racecars better than any human. We have AlphaFold and AlphaProof and Claude-Code, which have brought true advances in science, mathematics, and programming. We have RL-Lyft which optimizes the assignment of cars to passengers in the ride-hailing business. All these systems have found things that are both novel and good. And, truth be told, some language models have been augmented in ways that make them more than Generative AI based on supervised learning. All these systems have some additional features that make them capable of true creativity and true discovery. It is important for us to recognize what this is—and that it is not present in ordinary, garden-variety Generative AI. It is something that can not come from just supervised learning, from learning from examples. What is it? Well, it is a simple thing, a commonsense thing. It is not new. We have many names for it, but unfortunately none of them are very good names. I will call it Discovery. Basically, Discovery is just the idea of trying many things and seeing which of them work, then keeping those that worked the best. Evolution by natural selection works this way. The scientific method works this way. And just ordinary life and learning works this way. We try things and remember what works. What could be more obvious? In this behavioral case, psychology has two names for it— “instrumental learning” and “operant conditioning”—and in machine learning it is what we mean by “reinforcement learning”. We also see the idea of Discovery in planning and combinatorial search—anything that involves the idea of “generate and test”. The essence of Discovery is to combine three steps: 1. Variation, 2. Evaluation, and 3. Selective retention. Of course, I am not the first to say this. I am not the first to point out that this combination of steps is key to science, to evolution by natural selection, and to animal behavior. I think particularly of papers by Donald Campbell, by Daniel Dennett, and by Gary Cziko. What is new in my remarks is to directly relate the idea of Discovery to modern AI to help us see that it is not present in supervised learning or Generative AI—in particular, that Discovery is not present in backpropagation or gradient descent. Let me say explicitly what is missing from Generative AI. As we have remarked, these systems do have a stochastic aspect, so they do generate a variety of trajectories and behavior. What is missing is the Evaluation step. The generator was pre-trained by supervised learning, leaving no way at runtime to Evaluate what it generates. And of course without Evaluation there can be no Selective retention, and thus no Discovery. The variation can bring novelty, but without evaluation there is no Discovery, and arguably, no creativity. That is, I would say that creativity requires that the new things generated be Evaluated. Without evaluation, and retention of the best, there is nothing created. The novelty flickers into existence but, if its value is unrecognized, it flickers away and is lost. In many cases, Evaluation is done by people to make a discovery. As when we have Generative AI make many pictures for us, and then we pick the one that we like the best. The human+AI system completes the discovery. In many other cases, the Evaluation comes from a clear objective. Some moves lead to checkmate, some steps lead to a proof, some actions result in high reward, some genotypes make more copies, some theories explain the data better. Some prefer the Variation step to be called Blind variation, where “blind” here means that it is uninformed, a shot in the dark. It does not need to be completely uninformed; a good scientist does not select theories to test at random. But neither can it be completely informed and determined. There must be some uncertainty about where the answer lies in order for there to be a discovery. In practice, the variation is partly informed and partly blind, but it is the blind part that corresponds to the discovery. Now let us briefly go all the way to modern deep learning, to the backpropagation algorithm. At first it might seem that backpropagation is incapable of discovery because it is deterministic and thus incapable of variation. But this is not correct. The weight updates of backprop are deterministic, but the weights are initialized to small random values. The random initialization is often downplayed, but in fact it is a necessary form of variation; it must be done properly to get good performance. In backprop this Variation is done once, at network initialization, so its effect is temporary, and later the network may lose its ability to learn. This is the weakness of deep learning that is alleviated with a new algorithm that my group presented in Nature a couple of years ago. Our “continual backpropagation” made one small change: every so often a less-used neuron would be re-initialized to small random weights. This allows the variation to continue and plasticity to be retained. Although there is much more to be said about Creativity and Discovery, this is the key point: they are more than supervised learning, more than pattern recognition, more than prediction, and more than world modeling. Those things are important, but they alone will not bring us to discovery. Discovery requires Evaluation from a person or from an explicit goal, and only in the latter case will we attain full autonomy. So that is my call to arms. If we want the full power of AI scientists, then we should share the goals with them so they can create, evaluate, discover, and in these ways fully participate in achieving the goals. Let’s be bold! Let’s fully automate Creativity and Discovery!

257

601K

LingjunCapital retweeted

思维怪怪

@0xLogicrw

3 days ago

强化学习之父 Richard Sutton 发布最新演讲《AI Creativity and Discovery》，指出仅靠监督预训练的生成式 AI 本质上是「模仿式 AI」（Mimicking AI），无法在科学和数学领域实现真正的创造与发现。 Sutton 此前曾发推以 26 字重申「苦涩的教训」（The Bitter Lesson）立场，呼吁专注于搜索与学习等能随算力扩展的通用方法，而非人类知识。 Sutton 阐明，真正的发现与创造必须包含变异（Variation）、评估（Evaluation）和选择性保留（Selective Retention）。生成式 AI 虽有采样随机性带来的变异，但运行时（runtime）缺乏评估机制，无法选择性保留有价值的生成。这导致生成式 AI 的输出要么因随机而新颖，要么因数据而优秀，但无法同时做到新颖且优秀，试图超越源材料便会产生幻觉。 Sutton 强调，发现的本质是基于明确目标函数的「尝试与保留」，这存在于强化学习、规划和组合搜索中，但纯预测和监督学习并不具备。即使是反向传播算法（Backpropagation），在运行时也是确定性的，变异仅发生在权重初始化阶段，导致神经网络在训练中逐渐丧失可塑性（Plasticity）。为此，他介绍了此前发表于《自然》（Nature）的持续反向传播算法（Continual Backpropagation），通过定期随机重置低使用率神经元的权重，使网络得以维持持续变异与学习能力。 Sutton 最后呼吁，若要实现全自动自主 AI 科学家，人类必须将科学探索的衡量标准写成算法可度量的「目标函数」共享给模型。只有让 AI 在运行时自主进行「生成尝试、利用目标打分、选择性保留」的完整闭环，不再依赖人类在屏幕前充当裁判，才能真正自动化科学发现与创造。

LingjunCapital retweeted

Millie Marconi

@MillieMarconnni

3 days ago

A British biologist looked at 200,000 years of human history and found that the entire reason humans broke out of poverty was not intelligence, not language, not even agriculture, but one mechanism so simple a 6-year-old could explain it. His name is Matt Ridley. He is a zoologist by training, an evolutionary biologist by career, and in 2010 he wrote a book called The Rational Optimist that quietly argued the most important fact about human progress had been hiding in plain sight for the entire history of economics. Naval Ravikant has been telling people to read everything Ridley has ever written for the last 15 years. The reason is the argument inside this one book. For 200,000 years, anatomically modern humans walked around with the same brain you have right now. Same skull size. Same neural architecture. Same raw capacity for language, planning, and abstract thought. For roughly 190,000 of those years, almost nothing happened. Generation after generation lived and died inside the same Stone Age toolkit their great-great-grandparents had used. Then somewhere around 50,000 years ago, the line on the chart of human progress started to tick upward. Then it bent. Then it exploded. The question Ridley spent years on was the only question that mattered. What changed. It was not the brain. The brain had been the same for 190,000 years. It was not language, which had existed long before the takeoff. It was not even agriculture, which arrived only 10,000 years ago and was actually preceded by the upward bend, not the cause of it. What changed was that humans started trading with strangers. This sounds too small to be the answer. Ridley argues that it is the answer to almost everything. The moment one human exchanged a useful object with another human from a different group, something happened that no other species on earth had ever done. Two ideas that had developed in isolation came into contact. The flint knapper learned what the spear maker had figured out. The fisherman from the coast learned what the hunter from the forest had figured out. The two pieces of knowledge fused into something neither side could have produced alone. Ridley calls this ideas having sex. The phrase sounds frivolous and it is meant to. The point is that ideas, like genes, get better when they combine with other ideas from different lineages. An idea sitting inside one head, no matter how brilliant the head, eventually hits a ceiling. The same idea exposed to ten thousand other ideas does something genes do under sexual reproduction. It mixes. It recombines. It produces offspring nobody planned. The cleanest proof of this argument is the most uncomfortable case study in the book. Tasmania. Around 10,000 years ago, rising sea levels cut Tasmania off from mainland Australia. A population of roughly 4,000 humans was now isolated on an island, with no possibility of contact with the rest of humanity. They had the same brains. The same language. The same starting toolkit as their cousins 150 kilometers north. The natural experiment was now running. What happened next is something no economist or geneticist had ever predicted. The mainland Australians kept inventing. Boomerangs. Spear-throwers. Fishing nets. Bone needles for sewing fitted clothes. Watercraft with paddles. Their technology compounded slowly across the centuries. The Tasmanians went the other way. They did not just fail to invent the new tools their cousins were developing. They started losing the tools they already had. Fishing was abandoned within a few thousand years. Bone tools disappeared. Fitted clothing disappeared. They forgot how to make fire from scratch and started carrying lit firebrands from camp to camp instead, relighting their fires from a neighbor's whenever their own went out. By the time European explorers arrived in the 17th century, the Tasmanians had the simplest toolkit of any human society ever recorded. Their material culture had gone backward for 8,000 years. The archaeologist Rhys Jones called it a slow strangulation of the mind. Joseph Henrich at Harvard later proved with formal mathematical models that there was nothing wrong with Tasmanian brains. There was something wrong with their network. A toolkit requires a critical mass of people exchanging skills to maintain itself. The act of teaching a skill is imperfect. Every generation loses a small percentage of what the last generation knew. If your population is large enough and trading widely enough, those losses get caught and corrected by someone else who still remembers. If your population shrinks below a certain threshold and stops mixing with outsiders, the small losses compound until entire technologies disappear. This is the part that should haunt anyone reading this in 2026. Intelligence is not a property of the individual brain. Intelligence is a property of the network the brain is connected to. A genius in isolation will produce less than a mediocre thinker inside a dense exchange of other mediocre thinkers. The thing your ancestors needed in order to break out of 190,000 years of stagnation was not better brains. It was better connections between brains they already had. The implication for any individual is direct and uncomfortable. If you are smart and isolated, you will be outproduced by people half as smart who are connected. The most successful people in any field are almost never the smartest people in it. They are the ones positioned at the intersection of the most idea flows. They are reading more authors than their competitors. They are talking to more people from more disciplines. They are in the rooms where ideas from different lineages bump into each other. Ridley ends the book on the line that sounds optimistic but is actually a warning its this "The future will be invented by people who connect ideas, not by people who guard them."

MillieMarconnni's tweet photo. A British biologist looked at 200,000 years of human history and found that the entire reason humans broke out of poverty was not intelligence, not language, not even agriculture, but one mechanism so simple a 6-year-old could explain it.

His name is Matt Ridley.

He is a zoologist by training, an evolutionary biologist by career, and in 2010 he wrote a book called The Rational Optimist that quietly argued the most important fact about human progress had been hiding in plain sight for the entire history of economics.

Naval Ravikant has been telling people to read everything Ridley has ever written for the last 15 years. The reason is the argument inside this one book.

For 200,000 years, anatomically modern humans walked around with the same brain you have right now. Same skull size. Same neural architecture. Same raw capacity for language, planning, and abstract thought.

For roughly 190,000 of those years, almost nothing happened. Generation after generation lived and died inside the same Stone Age toolkit their great-great-grandparents had used. Then somewhere around 50,000 years ago, the line on the chart of human progress started to tick upward. Then it bent. Then it exploded.

The question Ridley spent years on was the only question that mattered. What changed.

It was not the brain. The brain had been the same for 190,000 years. It was not language, which had existed long before the takeoff. It was not even agriculture, which arrived only 10,000 years ago and was actually preceded by the upward bend, not the cause of it.

What changed was that humans started trading with strangers.

This sounds too small to be the answer. Ridley argues that it is the answer to almost everything. The moment one human exchanged a useful object with another human from a different group, something happened that no other species on earth had ever done.

Two ideas that had developed in isolation came into contact. The flint knapper learned what the spear maker had figured out. The fisherman from the coast learned what the hunter from the forest had figured out. The two pieces of knowledge fused into something neither side could have produced alone.

Ridley calls this ideas having sex. The phrase sounds frivolous and it is meant to. The point is that ideas, like genes, get better when they combine with other ideas from different lineages.

An idea sitting inside one head, no matter how brilliant the head, eventually hits a ceiling. The same idea exposed to ten thousand other ideas does something genes do under sexual reproduction. It mixes. It recombines. It produces offspring nobody planned.

The cleanest proof of this argument is the most uncomfortable case study in the book. Tasmania.

Around 10,000 years ago, rising sea levels cut Tasmania off from mainland Australia. A population of roughly 4,000 humans was now isolated on an island, with no possibility of contact with the rest of humanity. They had the same brains. The same language. The same starting toolkit as their cousins 150 kilometers north. The natural experiment was now running.

What happened next is something no economist or geneticist had ever predicted.

The mainland Australians kept inventing. Boomerangs. Spear-throwers. Fishing nets. Bone needles for sewing fitted clothes. Watercraft with paddles. Their technology compounded slowly across the centuries.

The Tasmanians went the other way. They did not just fail to invent the new tools their cousins were developing. They started losing the tools they already had. Fishing was abandoned within a few thousand years. Bone tools disappeared. Fitted clothing disappeared. They forgot how to make fire from scratch and started carrying lit firebrands from camp to camp instead, relighting their fires from a neighbor's whenever their own went out.

By the time European explorers arrived in the 17th century, the Tasmanians had the simplest toolkit of any human society ever recorded. Their material culture had gone backward for 8,000 years.

The archaeologist Rhys Jones called it a slow strangulation of the mind.

Joseph Henrich at Harvard later proved with formal mathematical models that there was nothing wrong with Tasmanian brains. There was something wrong with their network. A toolkit requires a critical mass of people exchanging skills to maintain itself.

The act of teaching a skill is imperfect. Every generation loses a small percentage of what the last generation knew. If your population is large enough and trading widely enough, those losses get caught and corrected by someone else who still remembers.

If your population shrinks below a certain threshold and stops mixing with outsiders, the small losses compound until entire technologies disappear.

This is the part that should haunt anyone reading this in 2026.

Intelligence is not a property of the individual brain. Intelligence is a property of the network the brain is connected to. A genius in isolation will produce less than a mediocre thinker inside a dense exchange of other mediocre thinkers.

The thing your ancestors needed in order to break out of 190,000 years of stagnation was not better brains. It was better connections between brains they already had.

The implication for any individual is direct and uncomfortable. If you are smart and isolated, you will be outproduced by people half as smart who are connected.

The most successful people in any field are almost never the smartest people in it. They are the ones positioned at the intersection of the most idea flows. They are reading more authors than their competitors. They are talking to more people from more disciplines. They are in the rooms where ideas from different lineages bump into each other.

Ridley ends the book on the line that sounds optimistic but is actually a warning its this "The future will be invented by people who connect ideas, not by people who guard them."

235

531K

Who to follow

alexb

@alexsanderb83

professional fortniter

LingjunCapital retweeted

DittoJo

@DittoJoBrr

3 days ago

看Druckenmiller的访谈，他刚开始买Nvidia的时候甚至都不知道该怎么拼，也不知道Nvidia的收入算多还是算少。他不是AI产业链的专家，不是那种看透未来十年的愿景型投资者；他真正厉害的是二阶的模式识别的能力：他懂的不是AI产业本身，而是市场面对结构性的变革会如何反应，以及对当下在市场认知结构变化中的位置有大致的判断。所以他说自己会先投资再研究，因为尽管对产业的具体了解并不是全面的，但是对目前市场的认知结构是有一种直觉式的认识，这才是真正AI代替不了的判断力啊。

LingjunCapital retweeted

小C哈哈哈

@dashen9999

4 days ago

为什么现在很多人推崇国外原版教材？说实话，美国教科书是真的很能打。能打到什么程度？细细的读了你根本就不用老师教。选一本大出版社（HMH， Mcgraw Hill，Pearson）的教材，一章一章的跟着学，你就无敌了。不撒点干货都有点对不起看我帖子的人。事实上很多美国大学都有免费网课. 哈佛网课：https://t.co/9uL3ed1PU7 史丹佛网课：https://t.co/0JIyS4ZCdA 卡内基梅隆：https://t.co/48GxzsvhJI 但我个人最最喜欢的还是可汗学院，尤其是文科，开着车听课即使不看黑板也很舒服:https://t.co/qRMCDbsMmf 祝学子们天道酬勤, 不负华年。

366

79K

LingjunCapital retweeted

goldenlabubuwatch

@pandawatch88

5 days ago

Pfizer CEO on the Norges interview, he goes to bed and wakes up with two things in his mind: "China and AI." common theme too with other pharma CEO interviews. He said a few interesting things, below, and before you start screaming how much you dislike this guy (imagine thinking Greeks ever cared about being liked) just discount his message according to your bias. For me, a lot of what he is saying is probably pumped on purpose to send a message to his regulators (wake up or China will eat our lunch). But still worth incorporating into your thinking. 1-In his view, Chinese pharma strength right now is in early stage (discovery, chemistry, testing, preclinical) "for every one area we have a company working on something novel the Chinese have ten companies working on it too" and on R&D productivity "they do things 3 times faster and at half the cost" 2-Another strength is the technical capability of the regulator, paraphrasing "they spent years copying the FDA, iterating, their process mirrors it, but is now faster, and with an ability to quickly distinguish garbage from diamonds" (the Novartis guy also mentioned something like this) 3-And final strength, most surprising for many: he says it is a great place to invest because of how they have developed a very strong IP protection system (imagine that) "the judiciary system can resolve IP issues very quickly, if someone challenges your patent you are not having uncertainty for 10 years" 4-He says Chinese pharma weakness is they still don't have global clinical development teams neither global commercialization teams. That is what makes all these licensing deals with global pharmas possible: ---usd200bn deals announced over the past couple of years, latest actually on Friday by Pfizer usd650m + usd10bn licensing deal with Innovent for 12 early stage oncology candidates 5-However, he thinks licensing is not a sustainable strategy for global pharmas, because at some point Chinese pharmas will stop selling their drugs to global players, and instead go to market themselves to keep the profit 6-So the only way long term is to be better than them, and that requires an exponential change to productivity, since the benchmark is the above "3 times faster, half the cost" (hint to the regulator there) 7-Finally after 30min of talking he decides to directly point at the regulator saying "stop spending 80% of your time and effort in slowing down China, because it is not going to work, and instead spend 80% of your time an effort in helping us accelerate" Now, from an investing perspective: 8-There is a big mismatch between China biotech "insider vibes" from CEOs, industry participants, and commercial newsflow, vs share price performance 9-Innovent stock was up 11% on Friday on the back of the Pfizer deal, but has been rangebound for almost 1 year, and recently starting to break lower. Similar story with Beone or Akeso. Other China pharma bluechips like Hengrui are already down 30% (from big multiples true). ---On the fundamental front, Innovent and Beigene had 2025 as their first year of profitability, with top line growing at 40%. They trade at 10x 6x P/S respectively 10-Similar story around services, where only Wuxi Apptec is working, while XDC and Biologics are rangebound (despite XDC saying they are fully booked till 2030, almost as if they were making datacenter gas turbines), and tier two names like tigermed, pharmaron, joinn starting to break lower. 11-Medical equipment is even worse, from big guys like Mindray to smaller like Tofflon. 12-Even AI pharma pump names like Insilico are breaking (this company btw gives me forbes 40 under 40 vibes, though I confess I haven't bothered to look in detail, it may be a fantastic business, you tell me) So, a lot of good stuff happening for people, new good cheaper drugs, not a lot of good stuff happening for investors. May be a global sector issue, may be a flow issue, which generalist in its right mind is selling Nvidia to buy Chinese pharmas.

pandawatch88's tweet photo. Pfizer CEO on the Norges interview, he goes to bed and wakes up with two things in his mind: "China and AI." common theme too with other pharma CEO interviews.

He said a few interesting things, below, and before you start screaming how much you dislike this guy (imagine thinking Greeks ever cared about being liked) just discount his message according to your bias.

For me, a lot of what he is saying is probably pumped on purpose to send a message to his regulators (wake up or China will eat our lunch). But still worth incorporating into your thinking.

1-In his view, Chinese pharma strength right now is in early stage (discovery, chemistry, testing, preclinical) "for every one area we have a company working on something novel the Chinese have ten companies working on it too" and on R&D productivity "they do things 3 times faster and at half the cost"

2-Another strength is the technical capability of the regulator, paraphrasing "they spent years copying the FDA, iterating, their process mirrors it, but is now faster, and with an ability to quickly distinguish garbage from diamonds" (the Novartis guy also mentioned something like this)

3-And final strength, most surprising for many: he says it is a great place to invest because of how they have developed a very strong IP protection system (imagine that) "the judiciary system can resolve IP issues very quickly, if someone challenges your patent you are not having uncertainty for 10 years"

4-He says Chinese pharma weakness is they still don't have global clinical development teams neither global commercialization teams. That is what makes all these licensing deals with global pharmas possible:
---usd200bn deals announced over the past couple of years, latest actually on Friday by Pfizer usd650m + usd10bn licensing deal with Innovent for 12 early stage oncology candidates

5-However, he thinks licensing is not a sustainable strategy for global pharmas, because at some point Chinese pharmas will stop selling their drugs to global players, and instead go to market themselves to keep the profit

6-So the only way long term is to be better than them, and that requires an exponential change to productivity, since the benchmark is the above "3 times faster, half the cost" (hint to the regulator there)

7-Finally after 30min of talking he decides to directly point at the regulator saying "stop spending 80% of your time and effort in slowing down China, because it is not going to work, and instead spend 80% of your time an effort in helping us accelerate"

Now, from an investing perspective:

8-There is a big mismatch between China biotech "insider vibes" from CEOs, industry participants, and commercial newsflow, vs share price performance

9-Innovent stock was up 11% on Friday on the back of the Pfizer deal, but has been rangebound for almost 1 year, and recently starting to break lower. Similar story with Beone or Akeso. Other China pharma bluechips like Hengrui are already down 30% (from big multiples true).
---On the fundamental front, Innovent and Beigene had 2025 as their first year of profitability, with top line growing at 40%. They trade at 10x 6x P/S respectively

10-Similar story around services, where only Wuxi Apptec is working, while XDC and Biologics are rangebound (despite XDC saying they are fully booked till 2030, almost as if they were making datacenter gas turbines), and tier two names like tigermed, pharmaron, joinn starting to break lower.

11-Medical equipment is even worse, from big guys like Mindray to smaller like Tofflon.

12-Even AI pharma pump names like Insilico are breaking (this company btw gives me forbes 40 under 40 vibes, though I confess I haven't bothered to look in detail, it may be a fantastic business, you tell me)

So, a lot of good stuff happening for people, new good cheaper drugs, not a lot of good stuff happening for investors. May be a global sector issue, may be a flow issue, which generalist in its right mind is selling Nvidia to buy Chinese pharmas.

146

127

55K

LingjunCapital retweeted

yibie

@yibie

3 days ago

训练小模型：2026 年最被低估的 AI 技能 2026 年 5 月 11 日，一个叫 CJ Zafir 的人发了一条推文。他想教普通人 fine-tune 开源模型。 2538 个赞，316 次转发，178,000 次观看。这条推文炸了。不是因为他发明了什么新东西——Unsloth 早在 2023 年就开源了，Hugging Face 上的 fine-tuning 教程有几百篇。炸了是因为他在一个所有 AI 公司都在比赛造「更大模型」的时候，反手指向了完全相反的方向：小模型。你自己训练的。在 Colab 上。花几十美元。两个月过去了。CJ 没有停。他把这条推文变成了一套完整的方法论：Codex 5.5 做大脑规划工作流，DeepSeek v4 Pro 做肌肉批量生成数据，Unsloth 做训练引擎，Qwen 3.5 4B 做基座模型。然后他用这套方法论，花了 $173 训练出一个 4B 模型，在垂直任务上精度 96%+，击败了 20 倍大的模型。再后来他做了个 voice agent 的工具调用 fine-tune——成本 $11。从 $80 到 $173 到 $11。这不是渐进式优化。这是在重置整个 AI 部署的经济学。一、CJ 方法论：为什么值得认真对待 CJ 不是第一个倡导 fine-tuning 的人。但他做对了一件事：他把整个流程的成本打到了个人开发者可以承受的水平，并且公开了每一步。工具链 | 角色 | 工具 | 成本 | |------|------|------| | 规划（Orchestrator） | Codex 5.5 | 订阅内 | | 数据生成（Generator） | DeepSeek v4 Pro | API 按量 | | 训练框架 | Unsloth | 免费开源 | | 训练硬件 | Google Colab Pro A100 80GB | ~$0.60/hr | | 推理部署 | llama.cpp / Ollama | 免费 | 数据流水线 CJ 最核心的洞察不是「用什么工具」，而是「怎么造数据」：「"Low quality data = Low quality model performance"」他的流水线是自进化的：Codex 设计工作流 → DeepSeek 批量生成数据行 → 每批次过 Quality Gates → Codex 根据上一批结果优化生成规格。结果是越跑越快、越便宜、质量越高。 $80 生成了 1 亿+参数的高质量数据集。不是简单 paraphrase，是「手工打造」每一行——Codex 设计的多样化 prompt 模板 + DeepSeek 的高速执行 + 可编程的质量门槛。模型选择 CJ 实测了 Gemma 4 和 Qwen 3.5 全系列后，结论很明确：「Qwen 3.5 9B 和 4B 是垂直训练的完美基座模型。Gemma 4 在垂直任务上不及 Qwen 3.5。」他的 Qwen 3.5 4B fine-tune 在精度和延迟上击败了 20 倍大的模型。不是什么小众 benchmark——是他自己在垂直任务上实测的结果。二、这不是 CJ 一个人的实验——行业数据在同时发声 CJ 的方法论看起来像「个人黑客技巧」，但独立的数据正在从各个方向验证同一个趋势。市场规模 SLM（Small Language Model，1B-13B 参数）市场 2024 年达到 65 亿美元，年复合增长率 25.7%。SLM 在大多数企业任务上已经匹配或超过 GPT-3.5 级别性能，只需一小部分成本。性能阿里巴巴的 Qwen 3.5 9B 在 MMLU-Pro、GPQA Diamond 和 multilingual MMMLU 三个基准上击败了 OpenAI 的 gpt-oss-120B——13 倍参数差距，小模型胜。另一项独立的 brand normalization 案例中，一个小模型达到了 90.6% 准确率，超越了 GPT-5.2 和 Gemini 3 Pro。企业已经在省钱——真正的省钱 | 公司 | 方案 | 效果 | |------|------|------| | Knowunity | 用 Distil Labs fine-tune SLM | 推理成本降 68% | | 某客服 agent | 从通用大模型切换到 fine-tuned SLM | 月成本从 $13,000 降到 $400 | | EliseAI | 专门训练的垂直小模型用于住房和医疗 | 推理成本降 60%，延迟降 80% | | AT&T | 专用 SLM 做客服和欺诈检测 | 数百万美元节省 | 400 美元一个月。一个客服 agent。这是几个工程师的年收入可以覆盖无限次 API 调用的部署方案。三、「数据集工厂」才是真正的护城河 CJ 在 5 月 22 日发了一条推文，八个词：「"The moat might not be the fine-tune itself. It might be the dataset factory."」这是整个小模型运动最重要的洞察。Fine-tuning 技能本身正在被民主化——Unsloth 已经能在 3GB VRAM 上训练，Colab 免费提供 GPU，CJ 连 prompt 模板都公开了。当人人都能做 fine-tune 的时候，差异化从「会不会做」转移到了「数据从哪里来」。中文社区的 AKIRAXCLAW 在一篇分析里补了一个关键观察：「「CJ 的工作流在 2026 年多数是业界共识，但真正难点不是工具——而是你有没有 200 笔干净数据 + 自动评估脚本。」」在 CJ 的体系里，Codex 的核心角色是 orchestrator，不是因为它比 DeepSeek 聪明，而是因为它能设计数据流水线、构建 Quality Gates、自进化优化生成规格——它在管理数据生产的质量系统，而不仅仅是生产数据。四、从边缘到生产线：部署已经在发生 SLM fine-tune 的部署门槛已经降到消费级硬件。Unsloth 2.0 支持 GGUF 量化导出 + llama.cpp / Ollama 部署。一个 3B 参数量化到 INT4 后占用 1.5-2GB 内存，可以在树莓派 5 上运行。Qwen 3-0.6B 在 Pixel 8 和 iPhone 15 Pro 上实现约 40 tokens/s 推理。 2024 年你可能需要一台 A100 才能跑 fine-tune。2026 年你可以在 Colab 上训练、在手机上推理。这种从「数据中心才能碰」到「个人设备上运行」的变化，是 Platform Shift 级别的。而 NVIDIA 在推动的「Data Flywheel Blueprint」——用生产流量持续生成训练数据，自动 fine-tune，自动部署——正在把这个流程变成企业基础设施。NVInfo AI 用这个方案在三个月内把内部知识助手的质量提升到覆盖 30,000 名员工。五、成本轨迹：从「公司预算」到「午饭钱」 CJ 的公开成本记录是最好的注脚： | 日期 | 项目 | 成本 | |------|------|------| | 05-13 | 1 亿+参数数据集生成 | ~$80 | | 05-27 | Qwen 3.5 4B 完整 fine-tune | ~$173 | | 05-28 | Qwen 3.5 tool calling for voice agent | $11 | $11。这在硅谷大概是一顿午饭。在印度、东南亚、非洲，是一个独立开发者完全负担得起的预算。fine-tune 已经从「大公司才有资格做的研发项目」变成了「个人开发者用午休时间能完成的 side project」。六、结论：学会训练你的第一个模型 CJ 给的学习路线图异常简单： 1. 从 1B-4B 小模型开始，不要一上来就搞大模型 2. 用 Colab Pro（A100 ≈ $0.60/hr），不要急着买 GPU 3. 先 fine-tune 7-10 个模型积累经验（SFT → LoRA/QLoRA → GRPO/DPO） 4. 用 Codex 做规划，DeepSeek 做数据生成 5. 理解量化（GGUF）、本地推理（llama.cpp）、KV 缓存他甚至在 5 月 23 日直接发布了一个「复制粘贴就能用」的 prompt——把整个学习路径封装成一段对话，扔进 Codex 或 ChatGPT，让 AI 带你从 beginner 到 advanced。这就是 2026 年 AI 技能民主化的真实面貌：不是每个人都需要学会训练模型。但学会训练小模型的人，不再需要 10 人团队和 10 万美元预算。通用大模型的军备竞赛还在继续。GPT-5、Claude Opus 5、Gemini 3 会越来越强。但越来越多的真实场景正在证明：一个花了 $173 训练的 4B 小模型，在你自己的数据上，比你花 $200/月订阅的通用大模型更好用。不是因为小模型更聪明。是因为它只做了你需要的那一件事。参考来源：CJ Zafir X 推文系列 (2026.05.11-05.28)；AKIRAXCLAW 中文分析；Alibaba Qwen 3.5 benchmark (VentureBeat)；Unsloth 2.0 文档；NVIDIA Data Flywheel Blueprint；AT&T / EliseAI / Knowunity / https://t.co/7r0fApoHPJ 企业案例；Manicode / Arendil 独立 benchmark；SLM Market Report 2024-2026

342

94K

LingjunCapital retweeted

✧ 𝕀𝔸𝕄𝔸𝕀 ✧

@iamai_omni

5 days ago

看到任何行业新闻，都可以试试和这篇文章一起贴给 AI，让它帮你寻找市场的 alpha，人类自身的认知和智能在 AI 面前太弱了，不用外挂大脑越来越难赚钱。

173

306

59K

LingjunCapital retweeted

Hedgeye

@Hedgeye

6 days ago

Cash flow no longer covers the AI capex bill, so hyperscalers are funding it with record debt: Hyperscaler bond issuance has soared to $150 billion YTD, more than the prior two years combined.

Hedgeye's tweet photo. Cash flow no longer covers the AI capex bill, so hyperscalers are funding it with record debt:

Hyperscaler bond issuance has soared to $150 billion YTD, more than the prior two years combined. https://t.co/tcM6XbzaVp

639

174

225

97K

LingjunCapital retweeted

Goshawk Trades

@GoshawkTrades

6 days ago

Dan Loeb started Third Point with $3 million. it's now $24 billion. when asked which books shaped how he invests, he named four, each one representing a different phase of his evolution as an investor: phase 1. event-driven deep value: "You Can Be a Stock Market Genius" by Joel Greenblatt. "most of the people i know in that world use that as their framework." spin-offs, privatizations, post-reorganization equities. buying things cheap that nobody else was doing the work on. ....

375

634

58K

LingjunCapital retweeted

Kathryn Wu

@kathrynwu1

6 days ago

One thing that feels different between the US and East Asia: In a lot of East Asian systems, your trajectory gets determined very early. People who perform well academically early tend to stay ahead in very linear ways. In the US, the variance feels much wider. You see people peak later, reinvent themselves, switch paths entirely, start companies, become creators, unexpectedly win. Maybe that’s part of the American Dream mindset: the belief that your future doesn’t have to be fully decided at 15.

177

299

123K

LingjunCapital retweeted

AYi

@AYi_AInotes

6 days ago

史上最贵的一觉：CEO醒了，欠了Anthropic 5亿美元

271

105

291K

LingjunCapital retweeted

Jeff Weniger

@JeffWeniger

6 days ago

This market has left PDD, Alibaba and JD for dead, with a total market capitalization across all three down to 15.5% of Amazon's market cap. They also got this low on January 10, 2025, at which point Chinese stocks proceeded to destroy the S&P 500.

JeffWeniger's tweet photo. This market has left PDD, Alibaba and JD for dead, with a total market capitalization across all three down to 15.5% of Amazon's market cap. They also got this low on January 10, 2025, at which point Chinese stocks proceeded to destroy the S&P 500. https://t.co/UpBKajYSum

234

38K

LingjunCapital retweeted

Thierry from arvy 🇨🇭

@ThierryBorgeat

7 days ago

In the last 30 days alone: – Microsoft cancelled most of its Claude Code licenses, citing cost – Uber burned through its entire 2026 AI budget in 4 months – Uber's COO publicly said AI costs are "harder to justify" – A Fortune 20 CEO ordered token spending to be "dramatically slashed" – One company spent $500M in a single month on Claude with no usage limits – H200 rental prices crashed from $7/hr to $4/hr in three weeks The stocks: still at all-time highs. How long can the market keep ignoring the receipts?

ThierryBorgeat's tweet photo. In the last 30 days alone:

– Microsoft cancelled most of its Claude Code licenses, citing cost
– Uber burned through its entire 2026 AI budget in 4 months
– Uber's COO publicly said AI costs are "harder to justify"
– A Fortune 20 CEO ordered token spending to be "dramatically slashed"
– One company spent $500M in a single month on Claude with no usage limits
– H200 rental prices crashed from $7/hr to $4/hr in three weeks

The stocks: still at all-time highs.

How long can the market keep ignoring the receipts?

428

565

275K

LingjunCapital retweeted

Polymarket

@Polymarket

7 days ago

NEW: Amazon has reportedly scrapped its internal AI leaderboard as costs soared, with a senior executive telling staff: “don’t use AI just for the sake of using AI.”

458

21K

LingjunCapital retweeted

a16z @a16z

8 days ago

OpenAI and Anthropic are effectively telling the market they can't solve every problem with a generic AI coworker. You don't pour billions into massive forward-deployed joint ventures if you think the next model release is going to take care of it. In the cloud supercycle, semis led and software followed (and you didn't need Qualcomm or ARM to tell you the value was migrating up the stack). In AI, the infra layer itself is telling us the application layer is a separate, massive opportunity they can't fully capture. a16z's @joeschmidtiv on why the app layer isn't dead: https://t.co/84QN5Mj9T3

a16z's tweet photo. OpenAI and Anthropic are effectively telling the market they can't solve every problem with a generic AI coworker.

You don't pour billions into massive forward-deployed joint ventures if you think the next model release is going to take care of it.

In the cloud supercycle, semis led and software followed (and you didn't need Qualcomm or ARM to tell you the value was migrating up the stack).

In AI, the infra layer itself is telling us the application layer is a separate, massive opportunity they can't fully capture.

a16z's @joeschmidtiv on why the app layer isn't dead: https://t.co/84QN5Mj9T3

182

420

LingjunCapital retweeted

Thierry from arvy 🇨🇭

@ThierryBorgeat

7 days ago

The price to rent an Nvidia H200 just collapsed from $7/hr to $4/hr in three weeks. A -40% drop in the cost of the single most strategic asset in tech. When the underlying commodity that powers your entire thesis loses 40% of its value in a month, that usually means one of two things: supply finally caught up, or demand was never as deep as the headlines said. Either way, somebody is selling. So why is the AI trade still pricing in scarcity?

ThierryBorgeat's tweet photo. The price to rent an Nvidia H200 just collapsed from $7/hr to $4/hr in three weeks.

A -40% drop in the cost of the single most strategic asset in tech.

When the underlying commodity that powers your entire thesis loses 40% of its value in a month, that usually means one of two things: supply finally caught up, or demand was never as deep as the headlines said.

Either way, somebody is selling.

So why is the AI trade still pricing in scarcity?

298

598

LingjunCapital retweeted

Dividendology

@dividendology

9 days ago

Think valuations look crazy right now? This is what they looked like during the 'Dot Com' bubble:

699

106

239

101K

LingjunCapital retweeted

Thierry from arvy 🇨🇭

@ThierryBorgeat

9 days ago

- Semis are 22% of $SPX - Big AI is 40% - The 41 AI-stocks in the $SPX are 61% So, yes, the biggest bubble concentration ever - railroads - is challenged today.

281

38K

Lingjun_capital

@LingjunCapital

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users