OpenAdaptAI @openadaptai - Twitter Profile

Pinned Tweet

about 2 years ago

Here's the latest from @OpenAdaptAI, faster and more robust. No command line required! #AI #agent #OpenAI #GPT4 Free download at https://t.co/cDF7XTzTBc 🚀

2

24

4

32

5K

OpenAdaptAI retweeted

JJ

@JosephJacks_

3 months ago

Don’t get me wrong, I love Opus 4.6 But there is no fucking way I’m letting Anthropic control my computer That’s why we have open source

40

202

8

14

32K

OpenAdaptAI @OpenAdaptAI

5 months ago

New and improved! More coming soon.

0

2

0

84

OpenAdaptAI retweeted

Xinyuan Wang @xywang626

10 months ago

We are super excited to release OpenCUA — the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data. 🔗 [Paper] https://t.co/SYEio5ccNJ 📌 [Website] https://t.co/ma6bBuYiNM 🤖 [Models] https://t.co/7TVtIdjkmq 📊[Data] https://t.co/N6tQQwQkhs 💻 [Code] https://t.co/ihr8TXmG6k 🌟 OpenCUA — comprehensive open-source framework for computer-use agents, including: 📊 AgentNet — first large-scale CUA dataset (3 systems, 200+ apps & sites, 22.6K trajectories) 🏆 OpenCUA model — open-source SOTA on OSWorld-Verified (34.8% avg success, outperforms OpenAI CUA) 🖥 AgentNetTool — cross-system computer-use task annotation tool 🏁 AgentNetBench — offline CUA benchmark for fast, reproducible evaluation 💡 Why OpenCUA? Proprietary CUAs like Claude or OpenAI CUA are impressive🤯 — but there’s no large-scale open desktop agent dataset or transparent pipeline. OpenCUA changes that by offering the full open-source stack 🛠: scalable cross-system data collection, effective data formulation, model training strategy, and reproducible evaluation — powering top open-source models including OpenCUA-7B and OpenCUA-32B that excel in GUI planning & grounding. Details of OpenCUA framework👇

xywang626's tweet photo. We are super excited to release OpenCUA — the first from 0 to 1 computer-use agent foundation model framework and open-source SOTA model OpenCUA-32B, matching top proprietary models on OSWorld-Verified, with full infrastructure and data.

🔗 [Paper] https://t.co/SYEio5ccNJ
📌 [Website] https://t.co/ma6bBuYiNM
🤖 [Models] https://t.co/7TVtIdjkmq
📊[Data] https://t.co/N6tQQwQkhs
💻 [Code] https://t.co/ihr8TXmG6k

🌟 OpenCUA — comprehensive open-source framework for computer-use agents, including:
📊 AgentNet — first large-scale CUA dataset (3 systems, 200+ apps & sites, 22.6K trajectories)
🏆 OpenCUA model — open-source SOTA on OSWorld-Verified (34.8% avg success, outperforms OpenAI CUA)
🖥 AgentNetTool — cross-system computer-use task annotation tool
🏁 AgentNetBench — offline CUA benchmark for fast, reproducible evaluation

💡 Why OpenCUA?
Proprietary CUAs like Claude or OpenAI CUA are impressive🤯 — but there’s no large-scale open desktop agent dataset or transparent pipeline. OpenCUA changes that by offering the full open-source stack 🛠: scalable cross-system data collection, effective data formulation, model training strategy, and reproducible evaluation — powering top open-source models including OpenCUA-7B and OpenCUA-32B that excel in GUI planning & grounding.

Details of OpenCUA framework👇

14

464

103

253

165K

OpenAdaptAI retweeted

Xinyuan Wang @xywang626

10 months ago

🙌 Acknowledgement: We thank @ysu_nlp, @CaimingXiong , and the anonymous reviewers for their insightful discussions and valuable feedback. We are grateful to Moonshot AI for providing training infrastructure and annotated data. We also sincerely appreciate Jin Zhang, Hao Yang, Zhengtao Wang, and Yanxu Chen from the Kimi Team for their strong infrastructure support and helpful guidance. The development of our tool is based on the open-source projects DuckTrack @arankomatsuzaki and @OpenAdaptAI we are very grateful for their commitment to the open-source community. Finally, we extend our deepest thanks to all annotators for their tremendous effort and contributions to this project. ❤️

0

21

3

1K

OpenAdaptAI retweeted

Rico Pagliuca @pagilgukey

about 1 year ago

Anybody looking for a GUI+ICL-->MCP library should definitely check out OmniMCP which puts Microsoft's Omniparser to use in generating GUI tool use APIs. Early days but pretty neat https://t.co/iJkRVDO57B

1

5

2

1

269

OpenAdaptAI retweeted

Python Hub

@PythonHub

almost 2 years ago

OpenAdapt AI-First Process Automation with Large Multimodal Models (LMMs). https://t.co/A3LMVJNyrF

0

15

4

1

2K

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

I prompted @openai's ChatGPT o3-mini-high and @DeepSeek's R1 to implement code to for deploying @alibaba_qwen's Qwen2.5-VL. Both agree that R1's implementation is "more comprehensive" and better "for production systems".

abrichr's tweet photo. I prompted @openai's ChatGPT o3-mini-high and @DeepSeek's R1 to implement code to for deploying @alibaba_qwen's Qwen2.5-VL.

Both agree that R1's implementation is "more comprehensive" and better "for production systems". https://t.co/3Rn0Yd3XaG

1

7

1

0

745

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

Qwen2.5-VL is the first open source multimodal model that appears to be able to accurately generate bounding box coordinates 🚀 Thank you @Alibaba_Qwen ! Excited to integrate this in @OpenAdaptAI https://t.co/XJwVgm991i

abrichr's tweet photo. Qwen2.5-VL is the first open source multimodal model that appears to be able to accurately generate bounding box coordinates 🚀

Thank you @Alibaba_Qwen ! Excited to integrate this in @OpenAdaptAI

https://t.co/XJwVgm991i https://t.co/LjXgOQCbY7

1

8

2

4

2K

OpenAdaptAI retweeted

Yujia Qin @TsingYoga

over 1 year ago

Check out our latest GUI Agent -> UI-TARS 🥳 A vision-language model surpasses GPT-4o & Claude Computer-Use Paper, code, model ckpt, desktop APP are now open-sourced~ https://t.co/7umVHrnMds https://t.co/f4973AmmQh

TsingYoga's tweet photo. Check out our latest GUI Agent -> UI-TARS 🥳
A vision-language model surpasses GPT-4o & Claude Computer-Use

Paper, code, model ckpt, desktop APP are now open-sourced~
https://t.co/7umVHrnMds
https://t.co/f4973AmmQh https://t.co/T1zWbNxE5e

10

183

36

114

37K

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

https://t.co/oJq0bv7kKF > 🚀 Introducing Kimi k1.5 --- an o1-level multi-modal model 🤯

1

2

1

0

271

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

https://t.co/EXFqSIvRCg > DeepSeek-R1 achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks. https://t.co/Vgnovq9hsD We can run frontier models at home now.

1

6

2

1

598

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

Another day, another breakthrough: Apply DCT to convert actions into frequency components, quantize them prioritizing low frequencies, then use autoregressive prediction in frequency order (low to high) to generate actions. From @physical_int. May generalize to @OpenAdaptAI.

abrichr's tweet photo. Another day, another breakthrough:

Apply DCT to convert actions into frequency components, quantize them prioritizing low frequencies, then use autoregressive prediction in frequency order (low to high) to generate actions.

From @physical_int. May generalize to @OpenAdaptAI. https://t.co/khbuKqhwsh

1

7

1

2

333

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

@hwchase17 With @OpenAdaptAI you start and stop recording demonstrations of repetitive tasks via the tray icon. Show, don't tell. Perform, don't prompt.

0

3

1

0

247

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

Nice taxonomy of Agents from @huggingface smolagents

0

3

1

260

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

@JosvdWest @OpenAdaptAI Sure does! Mac and Win compatible.

1

3

1

209

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

@OpenAdaptAI @julien_c @Microsoft @AWS @Docker (venv) % python https://t.co/BUIUZmT8s9 http://34.206.53.77:7861 ~/Desktop/screenshot.png Loaded as API: http://34.206.53.77:7861/ ✔ Parsed content: ... 2024-10-29 11:13:07.414 | INFO | __main__:predict:84 - Output image saved to: output_image.png

abrichr's tweet photo. @OpenAdaptAI @julien_c @Microsoft @AWS @Docker (venv) % python https://t.co/BUIUZmT8s9 http://34.206.53.77:7861 ~/Desktop/screenshot.png
Loaded as API: http://34.206.53.77:7861/ ✔
Parsed content:
...
2024-10-29 11:13:07.414 | INFO | __main__:predict:84 - Output image saved to: output_image.png https://t.co/LlKFjIv7gx

3

2

1

0

202

OpenAdaptAI @OpenAdaptAI

over 1 year ago

@abrichr @julien_c @Microsoft @AWS @Docker `(venv) % python https://t.co/poRKhZ7zuW start`

2

0

37

OpenAdaptAI retweeted

Richard Abrich @abrichr

over 1 year ago

@julien_c Deploy @Microsoft OmniParser to @AWS #EC2 automatically via @Docker and #GitHub actions: https://t.co/4NCWERHZnE

1

6

3

1

284

OpenAdaptAI retweeted

louis030195 | screenpipe (YC S26)

@louis030195

over 1 year ago

within the next year, AI will be able to ingest everything that ever happened on your computer check out this cool video about tools enabling this: @OpenAdaptAI @tooluseai @MikeBirdTech @OpenInterpreter @FieroTy @abrichr and @screenpipe :) https://t.co/umYIhLSthP

3

12

3

17K

OpenAdaptAI retweeted

Julien Chaumond

@julien_c

over 1 year ago

If I was starting a company today, I would look into productizing this model 👀

14

1K

83

1K

164K

OpenAdaptAI

@OpenAdaptAI

Last Seen Users on Sotwe

Trends for you

Most Popular Users