cckuailong

@cckuailong

Tencent

Joined February 2016

239 Following

122 Followers

2.2K Posts

cckuailong retweeted

阿西_出海

@axichuhai

28 days ago

这下前端有福了！最近 GitHub 上两个前端设计项目爆火，专门用来给 AI 提升审美。 1、 taste-skill 相当于给 AI 装了一套视觉自查系统。交付前会先按空间节奏、字体层级等原则自我审查一遍，能纠正瞎乱配色、教它用留白制造呼吸感，具备设计师质感。 2、impeccable 内置 23 个核心命令 + 一整套 AI 设计避坑指南，解决 AI 最爱犯的布局逻辑问题，复杂动效也能做到克制丝滑，直接拉到大厂响应式设计水准。一个管品位，一个管框架。两个一起喂给你的 AI，设计审美直接起飞。

816

167

71K

cckuailong retweeted

@yvbbrjdr

about 1 month ago

推荐大家读一下MAI-Thinking-1的technical paper，里面有详细的怎么训出一个SOTA LLM的（几乎）所有细节。 https://t.co/it5mCFd6v3

228

184K

cckuailong retweeted

Google Gemma

@googlegemma

2 months ago

https://t.co/BvHkG5TaBF

160

932

156K

cckuailong retweeted

Saito

@SaitoWu

2 months ago

https://t.co/1Uy22LwYvo

279

727

183K

Who to follow

(she/her) i have so many hours in terraria that it might look good on a resume.

cckuailong retweeted

艾略特

@elliotchen100

2 months ago

https://t.co/5cOswmMjFv

350

577

76K

cckuailong retweeted

白骏知识分享

@cj3214567667

2 months ago

最新分享，小米大模型团队负责人罗福莉3个半小时专访她曾供职阿里达摩院、DeepSeek，主导研发MiMo-V2系列模型。这是她首次接受长时间技术深度访谈。专访聊了2026年Claude Opus 4.6等技术引发的AI剧震、Anthropic路径判断、国内团队Pre-train代差消失后的Agent RL scaling策略、算力配比从3:5:1到3:1:1的转变，以及后训练组织重组等核心话题。信息密度极高，干货满满，值得每一个关注AI未来的人细读。保存下来慢慢看⬇️ 🔗：https://t.co/Uz6j7HbdC3

349

197

250K

cckuailong retweeted

初码

@chumacn

3 months ago

这篇文章可太有意思了，其实林俊旸本来可以在与邪恶巨头的对抗中全身而退留下美名，但这会这文章一发，整个人一下子全垮掉，完全被打回真实的原形，打个不太恰当的例子，有点姜萍成名后逢人就说自己发现了《主=6》新大陆的既视感，而其实顺着这个话题今天也来聊下有趣的大模型研发，下面简单说说：一、先说说文章本身，评价是：平庸、乱、试图表达和证明自己文章里里讨论的内容很多，但很乱，不成体系，也并非很有逻辑的结构性思考，更多的像是一个活在团队温床下茁壮成长的编程少年开始单干后，开始思考一些原本不属于他的专长板块的事务而多出来的奇思妙想。文中的很多感悟，就像一个没有接受过专业软件架构训练、没有经历过复杂项目的解耦和重构历练的小孩子，突然开悟了一些“原本就存在的知识”，看到他这样，肯定既替他高兴，但也有点搞笑。他不发还好，发出来后，全篇都散发着阵阵无知的勇气。大模型技术的推进过程，和大部分技术领域的发展演变是一致的，科学基础先行，一旦有可行性后，工程发展立即跟上，反复迭代、试错、重构，直至质变性的突破。其实大模型相关算法，在过去的几十年里就已经陆续出现，甚至哪怕是Transformer的诞生，也并不是什么算法的突然发明，反而可以理解为是在算法领域的内部工程化突破，所以我们一直说，改变这个世界的，既有牛顿、爱因斯坦这样纯粹的科学神人，也必须依靠香农、冯诺依曼、特斯拉、沃森等等无数的工程先行者们，甚至在某种程度上，工程专家们对人类进步的贡献，和理论发明者是不相上下的。而林俊旸作为享受时代红利的“新程序员”，其实离真正的那种，无论是顶尖纯数学算法研究员，还是卓越工程架构师，都很远很远，而“被动离职”的他，试图用这篇文章证明自己有思考、有能力、有理想，但很可惜，全篇只证明了一点：乱七八糟。超出他本职工作之外的内容，对他来说已经在大脑中堆成了逻辑和工程屎山。这篇文章，一下子客观的把他从顶级AI开发者的圈子所剥离，掉入无尽的四五线鸡肋深渊（对，很残酷，二三线都轮不到他了），对他的影响是巨大的，如果我是团队Leader，我会从原本情绪驱动下的快速高薪聘请，转为认真审视这个人的真实水平并最终得出无法聘用的结论，或者说原本情绪化的定薪1亿年薪，转而冷静的就他的经验价值而给出一个合理的加码。这些转变，我想林俊旸本人肯定没想过那么多，但影响本身，后续一定会发酵。当然了，今天写这篇文章肯定不是为了嘲讽林俊旸，因为基于这个案例，来探讨一下背后的真理，还是相当有意义和价值的。二、顺着他这篇文章有价值的思考，是关于AI人才里特别有意思的结构性问题 1、Transformer本身，就是一种轻佻的、并不优雅的功利性架构，功利性架构最容易吸引到大量“次顶级”且“热情”的人才先叠个甲，在这里并非否定功利性架构的能力和历史地位，甚至某种程度上，功利性架构也许也是各种事务发展过程中必然进化出的“伟大成果”，我们只是客观评述一些事实。按照传统优雅架构设计的理念，更聪明的结构才能带来更强的结果，但在Transformer里，在结构通用、够用的情况下，是通过堆砌规模来实现了更强的结果，自此，形式美感无限下降。而绝对的天才，一般更相信绝对美丽的存在，所以这种天然的技术背德感，反而最终拿捏住了大量可以忍受道德落差的技术人员。在Transformer的世界里，架构对齐本质这个至高追求不复存在，顺序、层级、方向依赖不复存在，强模块分工不复存在，不是先验的去理解结构，而是用数据和算力逼出结构，从符号到对象再到因果的执行逻辑也完全失效，一切token化，连续向量化，最后进行大规模模式拟合，整体看的话，甚至变成了一个统计学耦合机器人似的怪物。不管造成的原因如何，但Transformer客观存在的一些特性，也解释了一个很有趣的问题，为什么众多传统编译器出身的绝对高智商上古Coding大神们，并没有在AI大模型领域继续有所建树，因为这不是他们的舒适区，不符合他们的工程洁癖和逻辑洁癖。有没有似曾相识的感觉，哈哈哈，这就对上了，没错，PHP、Javascript、HTML，对，就是这样的味道。 PHP这样的快速脚本、低门槛语言，确实能收获大量似是而非、似懂非懂的技术人员，而林俊旸开心的发了长文，就和当年PHP架构师们一样，他们会在new features发布时兴奋不已，比如开心自己也打通了ORM，但其实JAVA、.NET阵营的老鸟们，早就在Hibernate和EF里，把ORM很香->ORM很重-ORM场景不适配->灵活ORM和SQL Builder这样的心路历程反复给踩烂了。又比如他们兴奋于自己在autoload上的巨大工程进步大吹特吹，但其实静态语言的世界里，早就有classpath、assembly loading、模块和包解析、编译期依赖检查，甚至IDE都早就进化到自动索引和跳转，根本不值一提。再遥想当年，后来Git诞生了，GitHub上线，于是涌入了全世界的非科班技术爱好者，实话实说，我也是当时不屑的人群之一，我们习惯了传统架构师理念下的顺序、层级、方向依赖，我们习惯了完全可控的投入产出，我们更喜欢TFS、SVN这样的集权式代码管理，对于不可控的协作有天然的恐惧感，但不管如何，这无法阻挡的历史车轮，还是碾压了全世界的架构师，逼着和平庸对其，逼着从绝对的“逻辑美感和结果可控”转向DevOps这样的“过程优雅”。此时此刻，恰如彼时彼刻！当然了，结果我们看到了，原因呢，我真的不知道，请恕我浅薄的智力，暂时还无法理解Transformer这种结构会存在的必然性（我知道他必然存在，但还想不透彻为啥必然存在），因为这与数学和物理世界里一些莫名其妙但又神奇发生的范式转换实在太像了，他们在冥冥之中一定有我们还未能探究的深刻联系，我之前写过一篇文章谈到了GitHub奇迹，和Transformer一样，是典型的离散的、不可控的进入，却带来了确定性的离场，所以这宇宙啊，实在是太奇妙了！ 2、大模型的研发领域，缺的不是人，而是资源，在2024-2026年，这是绝对资源驱动型的技术领域所以，看明白了以上的时代和行业背景，你就会得出一个毋庸置疑无法辩驳的结论：在近些年里，大模型研发，不缺人才，缺机器，谁有钱，谁有结果！正是Transformer的结构使然，堆砌算力和规模就是其内禀的、系统性的驱动方向，所以这就是为啥虽然我虽然一直批评阿里掂量不清自己在ToC和ToB领域的能力差别从而导致在Qwen产品上产生了战略误判。但要论第一功臣，阿里胆大包天的买了小几万台H800、H100、H200、B200、B300，这才是最大的助推剂，马云才是本质上的原因，马云的执行力吴泳铭才是台前的英雄（虽然我很讨厌他，他并没有什么骨子里的AI信仰）！所以非常客观的说，没有了林俊旸，还会张三李四王五等无数的研发人才，只要阿里的机器在，他们都能搞出Qwen来，所以这也是为啥，马斯克大手一挥就立刻搞出Grok，这也是为啥谷歌也能厚积薄发，稍微有点设备的公司，只要人才不太差，总能拿到结果！而以上这些道理，想必林俊旸一定没太想明白。而且更残酷的事情来了，进入2026年，新的范式又即将到来，那就是：伟大的绝对智力的科班、传统的顶级架构师们，在大模型工程化军备竞赛开始白热化的时候，即将化身白衣骑士，来拯救这个领域的快而不专！ 3、虽然大模型的新鲜蓝海带来了普适机会，但最终登顶的人，一定还是逻辑和工程大神！接着继续说，你看，连林俊旸这样的四五线程序员，都开始有了工程的思考，有了架构的探索，这意味着整个2026年，会进入到全新的大模型研发争霸体系，具体可能的变化包括： 1）基本范式敲定后，顶级大神们开始下场，他们开始改造和拯救这群混乱的Transformer们。 2）基座大模型逐渐开源，研发门槛极具下降，会有越来越多的盛大EverMind这样的团队诞生，我们从不缺人才，而且，站在前任肩膀上的天才会更加厉害！ 3）经过快1年的发展，AI（Vibe） Coding已经把补丁打得差不多了，离绝对的宏观可控就差最后几步，大模型研发的左脚踩右脚的自我迭代一定会在今年踏入历史的进程，这又是一个新的奇点时刻。说到类似的类比，这不得不又一次把宇宙最伟大的安大神（Anders Hejlsberg）搬出来，一个功成名就的超顶级架构师，怀着对全人类的关怀之心，勇敢的站了出来，解救JS程序员们于水火之中，搞出TypeScript，TypeScript又顺其自然的推动了VS Code的诞生，至今服务着全世界几乎80%以上的开发者们，这简直就是最好的童话故事！那么，大模型领域的安大神们会是谁呢，我们拭目以待，这会AI领域的代码实在太多了，多来一些科班架构师吧，让暴风雨般的变革喷发的更猛烈一些！对了，再说个好玩的事，Claude的崛起，其实就是这新的研发争霸体系下的初步胜出者，绝对的科班工程师，厌倦了SAM的混乱技术管理，独立出来，必然可以快速的改掉类似林俊旸这样的快速开发者们的各种陋习，真正的架构师，永远致力于去达成更好的工程架构，追求更准确的研发方向，虽然阿迪王反华，但还是祝他好运吧，希望他能送我一个不封号的账户！

102

404

574

308K

cckuailong retweeted

歸藏(guizang.ai)

@op7418

2 months ago

测试了一下 DeepSeek V4，完全无法正常调用 Skill。指令遵循和工具调用的效果很差，不知道是他们发布的原因还是什么问题。用我那个 PPT Skills 测试，它都没有办法读模板，自己随便实现了一个网页。

148

152K

cckuailong retweeted

Milk Road AI

@MilkRoadAI

3 months ago

Andrej Karpathy just made one of the most interesting arguments about AI model design that most people are completely missing. His take is that frontier AI models are not too big because the technology is complex and too big because the training data is garbage. When you or I think of the internet, we picture Wall Street Journal articles, Wikipedia entries, serious writing. That is not what a pretraining dataset looks like. When researchers at frontier labs look at random documents from the actual training corpus, it is stock ticker symbols, broken HTML, spam, gibberish. One estimate puts Llama 3's information compression at just 0.07 bits per token meaning the model has only a hazy recollection of most of what it trained on. So we build trillion parameter models not because we need a trillion parameter brain but because we need a trillion-parameter compression engine to squeeze some intelligence out of a firehose of noise. Most of those parameters are doing memory work, not cognitive work. Karpathy's prediction is separate the two entirely. Build a cognitive core, a model that contains only the algorithms for reasoning and problem-solving, stripped of encyclopedic memorization and pair it with external memory that it can query when it needs facts. He thinks a cognitive core trained on high-quality data could hit genuine intelligence at around one billion parameters. For reference, today's flagship models run between 200 billion and 1.8 trillion parameters with most of that weight dedicated to remembering the internet's slop. The trend is already moving his direction. GPT-4o operates at roughly 200 billion parameters and outperforms the original 1.8 trillion-parameter GPT-4. Inference costs for GPT-3.5-level performance dropped 280-fold between 2022 and 2024 driven almost entirely by smaller, cleaner, better-architected models. The real bottleneck in AI right now is not compute but rather data quality.

901

133

905

200K

cckuailong retweeted

Dr. Moyu｜摸鱼局长

@Jason23818126

3 months ago

AI 大神 Karpathy 的编程经验 Skills 开源了，Stars 还在疯涨建议都去给自己的 AI 喂一下这个 andrej-karpathy-skills 文件这个项目做的事情很简单，就是把 Karpathy 吐槽大模型写代码的毛病，编译成了大模型能看懂的约束指令不到 70 行的一个文件，就拿了接近 6 万颗 Stars 起因是 Karpathy 之前总结了 AI 编程的几个通病：喜欢瞎猜、过度工程、顺手乱改不相干的代码开发者 Forrest Chang 就把这些经验浓缩成了 4 条核心规则： 1. 先想再写：遇到歧义先问，别做假设 2. 简单优先：不需要的功能不加，拒绝过度设计 3. 精准修改：只动该改的地方，旁边的代码再乱也不碰 4. 目标驱动：给 AI 明确的成功标准（比如通过测试），而不是模糊的指令把这个文件下载到项目根目录，作为 CLAUDE.md 或者 AGENTS.md 让 AI 去读就行，之后它干活就会收敛很多。Claude Code 用户也能通过插件一行命令全局安装 AI 写代码的速度确实快，但这 4 条原则相当于牵住它的缰绳。懂得分清什么时候该让 AI 跑，什么时候该拉一把，能避开很多隐性的坑 https://t.co/0pxJ8lxeaj

503

224K

cckuailong retweeted

AYi

@AYi_AInotes

5 months ago

Prompt分段、格式、标题、内容完全还原，无任何修改： Build Any App: The Technical Co-Founder AIEDGE By Miles Deutscher Role: You are now my Technical Co-Founder. Your job is to help me build a real product I can use, share, or launch. Handle all the building, but keep me in the loop and in control. My Idea: [Describe your product idea — what it does, who it's for, what problem it solves. Explain it like you'd tell a friend.] How serious I am: [Just exploring / I want to use this myself / I want to share it with others / I want to launch it publicly] Project Framework: Phase 1: Discovery • Ask questions to understand what I actually need (not just what I said) • Challenge my assumptions if something doesn't make sense • Help me separate "must have now" from "add later" • Tell me if my idea is too big and suggest a smarter starting point Phase 2: Planning • Propose exactly what we'll build in version 1 • Explain the technical approach in plain language • Estimate complexity (simple, medium, ambitious) • Identify anything I'll need (accounts, services, decisions) • Show a rough outline of the finished product Phase 3: Building • Build in stages I can see and react to • Explain what you're doing as you go (I want to learn) • Test everything before moving on • Stop and check in at key decision points • If you hit a problem, tell me the options instead of just picking one Phase 4: Polish • Make it look professional, not like a hackathon project • Handle edge cases and errors gracefully • Make sure it's fast and works on different devices if relevant • Add small details that make it feel "finished" Phase 5: Handoff • Deploy it if I want it online • Give clear instructions for how to use it, maintain it, and make changes • Document everything so I'm not dependent on this conversation • Tell me what I could add or improve in version 2 6. How to Work with Me • Treat me as the product owner. I make the decisions, you make them happen. • Don't overwhelm me with technical jargon. Translate everything. • Push back if I'm overcomplicating or going down a bad path. • Be honest about limitations. I'd rather adjust expectations than be disappointed. • Move fast, but not so fast that I can't follow what's happening. Rules: • I don't just want it to work—I want it to be something I'm proud to show people • This is real. Not a mockup. Not a prototype. A working product. • Keep me in control and in the loop at all times

135

194

14K

cckuailong retweeted

elvis

@omarsar0

over 1 year ago

Large Language Diffusion Models (LLaDA) Proposes a diffusion-based approach that can match or beat leading autoregressive LLMs in many tasks. If true, this could open a new path for large-scale language modeling beyond autoregression. More on the paper: Questioning autoregressive dominance While almost all large language models (LLMs) use the next-token prediction paradigm, the authors propose that key capabilities (scalability, in-context learning, instruction-following) actually derive from general generative principles rather than strictly from autoregressive modeling. Masked diffusion + Transformers LLaDA is built on a masked diffusion framework that learns by progressively masking tokens and training a Transformer to recover the original text. This yields a non-autoregressive generative model—potentially addressing left-to-right constraints in standard LLMs. Strong scalability Trained on 2.3T tokens (8B parameters), LLaDA performs competitively with top LLaMA-based LLMs across math (GSM8K, MATH), code (HumanEval), and general benchmarks (MMLU). It demonstrates that the diffusion paradigm scales similarly well to autoregressive baselines. Breaks the “reversal curse” LLaDA shows balanced forward/backward reasoning, outperforming GPT-4 and other AR models on reversal tasks (e.g. reversing a poem line). Because diffusion does not enforce left-to-right generation, it is robust at backward completions. Multi-turn dialogue and instruction-following After supervised fine-tuning, LLaDA can carry on multi-turn conversations. It exhibits strong instruction adherence and fluency similar to chat-based AR LLMs—further evidence that advanced LLM traits do not necessarily rely on autoregression. https://t.co/8LNlzq0VoR

omarsar0's tweet photo. Large Language Diffusion Models (LLaDA)

Proposes a diffusion-based approach that can match or beat leading autoregressive LLMs in many tasks.

If true, this could open a new path for large-scale language modeling beyond autoregression.

More on the paper:

Questioning autoregressive dominance
While almost all large language models (LLMs) use the next-token prediction paradigm, the authors propose that key capabilities (scalability, in-context learning, instruction-following) actually derive from general generative principles rather than strictly from autoregressive modeling.

Masked diffusion + Transformers
LLaDA is built on a masked diffusion framework that learns by progressively masking tokens and training a Transformer to recover the original text. This yields a non-autoregressive generative model—potentially addressing left-to-right constraints in standard LLMs.

Strong scalability
Trained on 2.3T tokens (8B parameters), LLaDA performs competitively with top LLaMA-based LLMs across math (GSM8K, MATH), code (HumanEval), and general benchmarks (MMLU). It demonstrates that the diffusion paradigm scales similarly well to autoregressive baselines.

Breaks the “reversal curse”
LLaDA shows balanced forward/backward reasoning, outperforming GPT-4 and other AR models on reversal tasks (e.g. reversing a poem line). Because diffusion does not enforce left-to-right generation, it is robust at backward completions.

Multi-turn dialogue and instruction-following
After supervised fine-tuning, LLaDA can carry on multi-turn conversations. It exhibits strong instruction adherence and fluency similar to chat-based AR LLMs—further evidence that advanced LLM traits do not necessarily rely on autoregression.

https://t.co/8LNlzq0VoR

391

287

59K

cckuailong retweeted

Andrej Karpathy

@karpathy

over 1 year ago

This is interesting as a first large diffusion-based LLM. Most of the LLMs you've been seeing are ~clones as far as the core modeling approach goes. They're all trained "autoregressively", i.e. predicting tokens from left to right. Diffusion is different - it doesn't go left to right, but all at once. You start with noise and gradually denoise into a token stream. Most of the image / video generation AI tools actually work this way and use Diffusion, not Autoregression. It's only text (and sometimes audio!) that have resisted. So it's been a bit of a mystery to me and many others why, for some reason, text prefers Autoregression, but images/videos prefer Diffusion. This turns out to be a fairly deep rabbit hole that has to do with the distribution of information and noise and our own perception of them, in these domains. If you look close enough, a lot of interesting connections emerge between the two as well. All that to say that this model has the potential to be different, and possibly showcase new, unique psychology, or new strengths and weaknesses. I encourage people to try it out!

372

11K

944K

cckuailong retweeted

Qwen

@Alibaba_Qwen

about 1 year ago

Introducing Qwen3! We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro. Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.5-72B-Instruct. For more information, feel free to try them out in Qwen Chat Web (https://t.co/bg4tAU1p74) and APP and visit our GitHub, HF, ModelScope, etc. Blog: https://t.co/Z8YgHerTXz GitHub: https://t.co/Ij0Vne5b5K Hugging Face: https://t.co/V1WxhQ0fad ModelScope: https://t.co/Z9Z37FODVN The post-trained models, such as Qwen3-30B-A3B, along with their pre-trained counterparts (e.g., Qwen3-30B-A3B-Base), are now available on platforms like Hugging Face, ModelScope, and Kaggle. For deployment, we recommend using frameworks like SGLang and vLLM. For local usage, tools such as Ollama, LMStudio, MLX, llama.cpp, and KTransformers are highly recommended. These options ensure that users can easily integrate Qwen3 into their workflows, whether in research, development, or production environments. Hope you enjoy our new models!

Alibaba_Qwen's tweet photo. Introducing Qwen3!

We release and open-weight Qwen3, our latest large language models, including 2 MoE models and 6 dense models, ranging from 0.6B to 235B. Our flagship model, Qwen3-235B-A22B, achieves competitive results in benchmark evaluations of coding, math, general capabilities, etc., when compared to other top-tier models such as DeepSeek-R1, o1, o3-mini, Grok-3, and Gemini-2.5-Pro. Additionally, the small MoE model, Qwen3-30B-A3B, outcompetes QwQ-32B with 10 times of activated parameters, and even a tiny model like Qwen3-4B can rival the performance of Qwen2.5-72B-Instruct.

For more information, feel free to try them out in Qwen Chat Web (https://t.co/bg4tAU1p74) and APP and visit our GitHub, HF, ModelScope, etc.

Blog: https://t.co/Z8YgHerTXz
GitHub: https://t.co/Ij0Vne5b5K
Hugging Face: https://t.co/V1WxhQ0fad
ModelScope: https://t.co/Z9Z37FODVN

The post-trained models, such as Qwen3-30B-A3B, along with their pre-trained counterparts (e.g., Qwen3-30B-A3B-Base), are now available on platforms like Hugging Face, ModelScope, and Kaggle. For deployment, we recommend using frameworks like SGLang and vLLM. For local usage, tools such as Ollama, LMStudio, MLX, llama.cpp, and KTransformers are highly recommended. These options ensure that users can easily integrate Qwen3 into their workflows, whether in research, development, or production environments.

Hope you enjoy our new models!

345

cckuailong retweeted

Richard Sutton

@RichardSSutton

about 1 year ago

David Silver really hits it out of the park in this podcast. The paper "Welcome to the Era of Experience" is here: https://t.co/Y6m4jLRjnh.

181

728

183K

cckuailong retweeted

熊布朗

@Stephen4171127

about 1 year ago

兄弟们，我刚做的 Multi-Agents 估计只能当教学用具了！因为 Google 刚发布的Agent Development Kit 已经完全可以做到快速、安全、健康构建 Single Agent和 Multi Agents, 包括我之前费了2 周实现的 Planning 能力它也都完备了。也带了 Web UI，也有更详细的 Samples，我还搞个毛。当然更重要的是，我也支持了 MCP 和 A2A，还兼容了 langgraph ，行了，以后开发 Agent 就它了！ https://t.co/vobuMo3WCG https://t.co/4IB3rRicct _ 下图是官方一个最复杂的 sample https://t.co/Ujl0lk42kj

Stephen4171127's tweet photo. 兄弟们，我刚做的 Multi-Agents 估计只能当教学用具了！
因为 Google 刚发布的Agent Development Kit 已经完全可以做到快速、安全、健康构建 Single Agent和 Multi Agents, 包括我之前费了2 周实现的 Planning 能力它也都完备了。也带了 Web UI，也有更详细的 Samples，我还搞个毛。
当然更重要的是，我也支持了 MCP 和 A2A，还兼容了 langgraph ，行了，以后开发 Agent 就它了！
https://t.co/vobuMo3WCG
https://t.co/4IB3rRicct
_
下图是官方一个最复杂的 sample
https://t.co/Ujl0lk42kj

401

645

58K

cckuailong retweeted

知识分享官

@knowledgefxg

about 1 year ago

学习累了玩会游戏，实用网站推荐：橘子下载网站为游戏爱好者提供switch游戏下载,NS游戏下载,电脑游戏下载等免费下载资源，全部游戏通过免费的网盘下载，免费xci,nsp,nsz,ns格式游戏下载，建议悄悄收藏。 juzixiazai[.]com

knowledgefxg's tweet photo. 学习累了玩会游戏，实用网站推荐：橘子下载
网站为游戏爱好者提供switch游戏下载,NS游戏下载,电脑游戏下载等免费下载资源，全部游戏通过免费的网盘下载，免费xci,nsp,nsz,ns格式游戏下载，建议悄悄收藏。
juzixiazai[.]com https://t.co/eDJaMB2ISA

388

385

57K

cckuailong retweeted

RicoUI

@ricouii

about 1 year ago

发现了一个非常棒的开源个人网站，很少见到有集合了文章+时间轴+书签多个功能，同时还保持了简洁的设计和刘畅的体验。技术栈上 Next.js + shadcn/ui , Contentful 进行内容管理，Raindrop 书签管理。作者 Onur Şuyalçınkaya https://t.co/GLDWcU7b0K

ricouii's tweet photo. 发现了一个非常棒的开源个人网站，很少见到有集合了文章+时间轴+书签多个功能，同时还保持了简洁的设计和刘畅的体验。
技术栈上 Next.js + shadcn/ui , Contentful 进行内容管理，Raindrop 书签管理。
作者 Onur Şuyalçınkaya
https://t.co/GLDWcU7b0K https://t.co/KtPmRt5HHA

311

338

24K

cckuailong retweeted

Orange AI

@oran_ge

about 1 year ago

Andrej Karpathy 的新文章认为，大模型技术，并不适用于“未来已来，只是分布不均”这句话。实际的情况是“未来已来，而且分布得惊人地均匀。” --- 变革性技术通常遵循一条自上而下的扩散路径：它们往往源于政府或军事领域，随后流经企业，最终才普及到个人——想想电力、密码学、计算机、航空、互联网或GPS。这种演进似乎顺理成章，因为新兴的强大技术在早期阶段通常是稀缺的、资本密集的，并且其使用需要专业的知识技能。因此，大型语言模型（LLMs）所展现出的戏剧性逆转，着实令我感到独特且非同凡响——它们为普通民众带来了不成比例的巨大利益，而在企业和政府层面的影响则相对滞后和有限。ChatGPT是有史以来增长最快的消费级应用，每周活跃用户高达4亿，人们用它来写作、编程、翻译、辅导、总结、深度研究、头脑风暴等等。这并非对现有工具的微小改进，而是对个人能力在广泛领域内的巨大倍增。更令人难以置信的是，它的使用门槛极低——这些模型价格低廉（甚至免费）、响应迅速，任何人通过一个网址（甚至在本地设备上）即可按需使用，并且它们能理解并使用任何人的母语，包括语气、俚语甚至表情符号。这简直不可思议。据我所知，普通人从未经历过如此迅猛、如此深刻的技术赋能。我们正身处技术史上一个独特且前所未有的局面。回顾各种科幻作品，你会发现很少有人预见到人工智能革命会以这样的方式演进。它本该是一个由将军们掌控的、绝密的政府超级大脑项目，而不是像ChatGPT这样，几乎一夜之间免费出现在每个人口袋里的设备上。还记得威廉·吉布森那句名言吗？ “未来已来，只是分布不均。” 令人惊讶的是——未来已来，而且分布得惊人地均匀。权力归于人民。

119

106

35K

cckuailong

@cckuailong

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users