CuiMao @cuimao - Twitter Profile

More of the iOS app loop, now inside Codex. The Build iOS Apps plugin lets Codex view and test your iOS app in the in-app browser, open SwiftUI previews, and hot reload edits without leaving Codex.

200

7K

524

4K

921K

10

20

0

5

11K

CuiMao retweeted

Primero @EsMonsieur

about 21 hours ago

もしもあの日ARMではなくてNVIDIAを買収していたら…

68

3K

142

137

417K

CuiMao

@CuiMao

about 5 hours ago

@StanTechAddict cool

0

2

0

3K

CuiMao

@CuiMao

about 7 hours ago

@hoshikihao 他们黄头发蓝眼睛对隐私很担心，说方块字的几乎无所谓罢了，

2

0

98

CuiMao

@CuiMao

about 8 hours ago

This is actually insane. Holy sh^t, what an invention. 😭

LM Studio @lmstudio

about 13 hours ago

Meet LM Studio's mobile app. Your local models, now in your pocket.

130

3K

429

1K

484K

5

27

0

12

12K

CuiMao

@CuiMao

about 7 hours ago

@dotey @geniusvczh 😭可我古法时代没学会咋整啊，还要再去穿越一次体验一遍吗哈哈

6

5

0

1

2K

CuiMao

@CuiMao

about 7 hours ago

@zjpeng94 @DarioAmodei just a joke！ hah😁😁

0

61

CuiMao

@CuiMao

about 18 hours ago

只要大家给我足够多的钱我就有百分百信心去收购anthroipic 真的只差钱了 @DarioAmodei

Frank Wang 玉伯

@lifesinger

about 18 hours ago

只要大家给我足够多的钱我就有百分百信心去收购 OpenAI 真的只差钱了 @sama

68

39

0

2

27K

45

26

0

9K

CuiMao

@CuiMao

about 7 hours ago

给我的感觉，就像这样😭😭

jason

@jxnlco

about 14 hours ago

insane ball knowledge in codex I just found out @wonforall has a skill called $kobe that spawns off 3 subagents to discuss / review his code, each of which is build to represent one of our principal engineers on tuned in on his past code reviews. I'm going to start doing this with @dkundel and @charlierguo for our docs...

22

286

5

217

23K

4

11

0

2

4K

CuiMao

@CuiMao

about 7 hours ago

@hoshikihao LLM并不是只能用来写代码而已，本地推理模型，远比你想象中的强大的多。

1

0

197

CuiMao

@CuiMao

about 8 hours ago

@GCsheng 只不过没有历史沉淀罢了，同样广告牌，香港的他有能如何解释呢。

3

16

1

0

2K

CuiMao

@CuiMao

about 8 hours ago

@GCsheng @UlrichFY 我来翻译一下姐妹们，现在我建议你回家和你们男朋友都去给我无理由发脾气，看看他的真实反应，如果他现在对你态度极差，那么以后有的你吃苦的！

1

0

272

CuiMao

@CuiMao

about 8 hours ago

@hsu_byron @imhaotian @jefffhj @du_peichao75719 @martin_ma_007 @YknZhu ，LMAO, these past few days I’ve become mutuals with so many people from the Imagine team that I low-key feel like I joined the team too. 😂😂😂

0

128

CuiMao

@CuiMao

about 8 hours ago

Omg I’m so happy my Imagine video made it into the show. It was literally only one second, but still hahaha.😃😃

Ethan He

@EthanHe_42

4 days ago

In @latentspacepod podcast, I shared my view on video generation, world models, LLMs, agents, continual learning and where the next frontier is. 1. Video models get most of their intelligence from language, not from video data. 2. Idea-to-code is fast now. The bottleneck is back to having enough compute to try every idea. 3. Iteration speed beats almost everything else in model development. 4. The next leap won't be a better video model. It'll be a video agent. 5. Diffusion will be the frontend of AGI, the LLM the backend. Generative UI will replace HTML/CSS: user intent straight to pixels. 6. Physical embodiment may become a tool a powerful AI picks up. Robotics may get solved by video-capable LLMs. 7. Continual learning may look like models that manage their own context, and even rewrite their own harness at test time. Thanks @swyx and @vibhuuuus for having me 🙏 https://t.co/mLuvbODJxA

22

359

38

211

114K

1

6

0

4K

CuiMao retweeted

Arena.ai

@arena

about 15 hours ago

Introducing Agent Arena: real-world agentic evals at scale. How do you evaluate agents doing actual work? We measure millions of live sessions where real users accomplish real tasks. On Arena, models now get web search, filesystem, and terminal tools to complete complex workflows: writing code, creating slide deck, researching the web, building apps, and analyzing documents. Every session produces rich signals. Users iterate with the agent turn-by-turn: approving, editing, correcting, praise or expressing frustration. The environment gives feedback too: shell errors, tool failures, recovery attempts, and more. Our leaderboard measures each model's agentic performance using causal inference across five signals: task success, steerability, error recovery, user praise vs. complaint, and tool hallucination. This leaderboard snapshot is built from 300K+ tasks, 2M+ tool calls, and 40M lines of code by agents. Top labs in Agent Arena: - #1 @OpenAI: GPT-5.5 (High) - #2 @AnthropicAI: Claude-Opus-4.7 (Thinking) - #3 @Zai_org: GLM-5.1 - #4 @GoogleDeepMind: Gemini-3.1-Pro - #5 @Kimi_Moonshot: Kimi-K2.6 More analysis in the thread, with the full technical blog below.