Yuhao Yang @itsyuhao - Twitter Profile

Pinned Tweet

over 1 year ago

🚀 Introducing Aria-UI – a cutting-edge grounding LMM for GUI agents with a lightning-fast 3.9B parameters activated backbone! 🌐 Try it yourself: https://t.co/bVV1XCqkGL 📄 Project page: https://t.co/FEjkBbqWpw 📂 Explore on GitHub: https://t.co/VIMGKVbti9

7

374

56

445

106K

Yuhao Yang

@itsyuhao

9 days ago

@KentonVarda So funny and btw I like CF Workers 😁

0

1

0

731

Yuhao Yang

@itsyuhao

11 days ago

Software 3.0: drive any hardware in secs 🙂‍↔️

Warren

@WarrenLau_

11 days ago

我是真无聊啊，突发奇想，把用了快 3 年的显示器灯带改造了下。现在已经集成到 Codex , 主要有三种效果： 1、AI 思考中，变成蓝色呼吸灯 (本来是多种颜色，但是太乱了，改成单一颜色，舒服多了) 2、需要人工审核，变成粉色灯光，提示需要审核处理。 3、完成后，变成黄色暖光，长时间不刺眼，更舒服。以后再也不需要盯着电脑了，语音输入内容后，按回车，就可以离开电脑，愉快的玩其他的去，以后看灯光提示。

126

876

68

429

169K

0

1

287

itsyuhao retweeted

Bowen Wang

@BowenWangNLP

12 days ago

RLVR has become the recipe for agentic post-training. But for Computer-Use Agents, the bottleneck is not the algorithm, it is the data. 🐌 🚀 We introduce CUA-Gym: a scalable, lightweight synthesis engine that turns arbitrary task queries into verifiable RLVR data for computer-use agents. The largest open CUA RLVR dataset to date: 🎯 32,122 verifiable RLVR tasks with programmatic setup scripts + rewards 🌐 110 environments: 16 desktop apps + 94 synthesized mock web apps 🏆 Qwen3.5-based CUA models trained with GSPO reach 72.6% on OSWorld-Verified and 56.6% on WebArena 📄 Paper: https://t.co/cdvHJPzgb1 🏠 Homepage: https://t.co/kvhaOQxNx7 🤗 Dataset: https://t.co/w5vOIRdchR 💻 Codebase: https://t.co/CcRlNTlS1c 🧩 Environments: https://t.co/fNZ6YAI8LD 🧵[1/6]

18

507

94

563

97K

Who to follow

“don’t go searching for something you don’t want to see”. SJ.

Yuhao Yang

@itsyuhao

14 days ago

everyone building for agents should read 🫣

yan5xu

@yan5xu

15 days ago

https://t.co/6W06db6v5c

10

185

29

339

31K

2

27

2

46

13K

Yuhao Yang

@itsyuhao

16 days ago

CLI-Anything x nanobot @xubinrencs We made the first OPEN AppStore for agents!

nanobot

@nanobot_project

16 days ago

Your agent shouldn’t just chat about work. It should use the apps where work happens. We provide CLI Apps in nanobot via CLI-Anything. Install app adapters from Settings, mention them in chat, and let your agent use them safely. Available today as a source preview, and coming in the next release. This feels like a big step toward personal agents that actually do work.

3

54

6

27

28K

2

11

1

3

1K

Yuhao Yang

@itsyuhao

17 days ago

We got something even stronger than cuadriver!!

Bridge

@bridge_surf

17 days ago

https://t.co/iogcE5aNzc

9

620

37

856

87K

0

10

0

7

3K

Yuhao Yang

@itsyuhao

18 days ago

@DimitrisPapail yep that’s what your blog led us to rethink 👍👍

1

0

39

Yuhao Yang

@itsyuhao

19 days ago

remind me if anyone still cares about how multi-turn convs should be masked 🤔 last systematic review seems to be Instruction Tuning With Loss Over Instructions [Neurips'24]

Dimitris Papailiopoulos

@DimitrisPapail

20 days ago

https://t.co/n10GwfKYuY

55

989

129

1K

858K

2

3

0

3

2K

Yuhao Yang

@itsyuhao

21 days ago

Herdr is on fire 🔥🔥

Ding

@dingyi

22 days ago

我现在都懒得用 tmux 了，直接用 https://t.co/0Ki7Dlwk0k 或 https://t.co/dcu76qJ7ul 更方便，适合懒人。

103

743

61

1K

85K

0

2

0

555

itsyuhao retweeted

Junli Wang

@JunliWang2021

24 days ago

Digital agent learning needs massive rollouts. But digital agent rollouts are painfully slow due to heavy environments. 🐌 🚀 We introduce NanoRollout, a lightweight open infra (900 lines core code) for digital agent rollout at scale, validated with three workloads: 🏋️ Large batchsize (4K) SWE Agent RL -> surpasses DeepSWE-32B 🧪 250k+ distilled coding trajectories -> SOTA ≤32B open coding agent ⚡ Fast evaluation on coding/cua/unified agent -> finish Check our Blog: https://t.co/IBNqqbLqra

2

136

39

102

35K

itsyuhao retweeted

Jiayi Weng

@Trinkle23897

about 1 month ago

Codex grew programmatic policies with no neural nets: max score on Breakout, and SOTA-level scores on MuJoCo. Maybe heuristics were not too weak. Maybe they were just too expensive to maintain. Maybe it's the next paradigm. https://t.co/1ZaIneleuW

64

1K

234

1K

3M

Yuhao Yang

@itsyuhao

about 1 month ago

Okay this is called Software 3.0 now 😁

Yuhao Yang

@itsyuhao

about 2 months ago

Code-as-a-service is eating software from the bottom up. Anything that exists mainly to solve a narrow, repetitive workflow is vulnerable. Disk cleanup. File triage. Log analysis. Batch transforms. Data cleanup. Internal glue tools. I’m already doing these with Codex / Claude Code instead of dedicated apps. 😵 True or false: a lot of software isn’t a product moat, it’s just a temporary wrapper around a workflow that models can now execute directly

1

2

0

700

0

1

0

333

Yuhao Yang

@itsyuhao

about 1 month ago

@thoma_gu That means you’re using the real Opus 😄

0

1

0

220

itsyuhao retweeted

Tianyu Fan

@t1anyufan

about 1 month ago

https://t.co/OqN2ou8cEs

1

37

7

49

14K

Yuhao Yang

@itsyuhao

about 1 month ago

CLI-Anything × CLI-Hub v0.3.0 is out! A fun release for all of us. With general-purpose agents plus CLI-Hub × CLI-Anything, we made one general agent handle complicated tasks that usually live in very different toolchains: - a FreeCAD Curiosity-style rover - a Blender orbital relay drone scene with motion - a real 2026 game played through generated CLIs (shout out to @t1anyufan !!) v0.3.0 brings together several pieces that made these demos easier to build, inspect, and share: - meta preview bundles and trajectories - updated skills and docs for agent usage - more real-world complex software and services converted into agent-native forms The common thread here is reachability. Once software becomes reachable to an agent, the agent can inspect it, control it, recover from mistakes, and gradually turn intent into artifacts. CLI-Hub now includes 66 software harnesses across 30 categories, with more community contributions coming in. One fun signal from command usage: roughly 20% comes from humans, and 80% comes from agents. A few takeaways from this round: 1. The CLI is only the transport layer. Harness engineering is the elephant in the room: the hard and valuable work is turning existing software and services into agent-native surfaces that can be inspected, controlled, previewed, and recovered from. 2. Better harnesses improve agents without retraining them. A good harness is like a bridge to a new world. The agent does not need a new brain for every destination; once the bridge is stable enough, it can carry over its existing planning, coding, debugging, and iteration skills, then use them to create new things in FreeCAD, Blender, video editors, games, and beyond. 3. Preview is for both humans and agents. A preview trajectory gives the agent a feedback loop. It connects “I ran this command” to “the artifact now looks like this.” That makes long creative and build tasks less blind for agents, and much easier for people to follow. 4. CLI-Hub is becoming the distribution layer for these harnesses. The goal is to move beyond one-off scripts for individual demos and make high-quality harnesses easier to find, install, test, improve, and reuse across projects. Our current view: stronger harnesses are one of the most practical ways to expand what general agents can do today. Longer term, we expect agentic model training to absorb these environments and feedback loops more directly. CLI-Hub: https://t.co/9yJ1yA0Kv4 Release: https://t.co/Q203cHUWyz

itsyuhao's tweet photo. CLI-Anything × CLI-Hub v0.3.0 is out! A fun release for all of us.

With general-purpose agents plus CLI-Hub × CLI-Anything, we made one general agent handle complicated tasks that usually live in very different toolchains:
- a FreeCAD Curiosity-style rover
- a Blender orbital relay drone scene with motion
- a real 2026 game played through generated CLIs (shout out to @t1anyufan !!)

v0.3.0 brings together several pieces that made these demos easier to build, inspect, and share:
- meta preview bundles and trajectories
- updated skills and docs for agent usage
- more real-world complex software and services converted into agent-native forms

The common thread here is reachability. Once software becomes reachable to an agent, the agent can inspect it, control it, recover from mistakes, and gradually turn intent into artifacts.

CLI-Hub now includes 66 software harnesses across 30 categories, with more community contributions coming in. One fun signal from command usage: roughly 20% comes from humans, and 80% comes from agents.

A few takeaways from this round:

1. The CLI is only the transport layer.

Harness engineering is the elephant in the room: the hard and valuable work is turning existing software and services into agent-native surfaces that can be inspected, controlled, previewed, and recovered from.

2. Better harnesses improve agents without retraining them.

A good harness is like a bridge to a new world. The agent does not need a new brain for every destination; once the bridge is stable enough, it can carry over its existing planning, coding, debugging, and iteration skills, then use them to create new things in FreeCAD, Blender, video editors, games, and beyond.

3. Preview is for both humans and agents.

A preview trajectory gives the agent a feedback loop. It connects “I ran this command” to “the artifact now looks like this.” That makes long creative and build tasks less blind for agents, and much easier for people to follow.

4. CLI-Hub is becoming the distribution layer for these harnesses.

The goal is to move beyond one-off scripts for individual demos and make high-quality harnesses easier to find, install, test, improve, and reuse across projects.

Our current view: stronger harnesses are one of the most practical ways to expand what general agents can do today. Longer term, we expect agentic model training to absorb these environments and feedback loops more directly.

CLI-Hub: https://t.co/9yJ1yA0Kv4
Release: https://t.co/Q203cHUWyz

1

34

7

26

4K

Yuhao Yang

@itsyuhao

about 2 months ago

Code-as-a-service is eating software from the bottom up. Anything that exists mainly to solve a narrow, repetitive workflow is vulnerable. Disk cleanup. File triage. Log analysis. Batch transforms. Data cleanup. Internal glue tools. I’m already doing these with Codex / Claude Code instead of dedicated apps. 😵 True or false: a lot of software isn’t a product moat, it’s just a temporary wrapper around a workflow that models can now execute directly

1

2

0

700

itsyuhao retweeted

Huli | lang: zh-Hant-TW @hulitw

about 2 months ago

這陣子用 AI 逆向的一些心得，我是從這邊開始對 AI 改觀的，以前只覺得 AI 在軟體這領域很厲害，現在有一種 AI 在軟體領域無所不能的感覺至少在逆向這塊，完全出乎我意料，打開了我這個井底之蛙的眼界，什麼都拆，什麼都不奇怪，萬物皆可逆 https://t.co/TBm5Kup5kN

16

515

72

515

48K

Yuhao Yang

@itsyuhao

about 2 months ago

We're literally CLI-ing anything and everything..🤔

Mishaal Rahman

@MishaalRahman

about 2 months ago

👩‍💻 A new tool for making Android apps is here: Android CLI! It's the primary interface for Android development from the terminal and is designed to make your agents more efficient, effective, & capable of following the latest development best practices!

6

206

23

72

14K

0

3

0

2

624

Yuhao Yang

@itsyuhao

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users