Copilot Arena @copilotarena - Twitter Profile

Pinned Tweet

over 1 year ago

Check out our findings in our latest preprint! A big thank you to everyone who's been using and voting on Copilot Arena. We couldn't have done it without you all♥️!

Wayne Chi

@iamwaynechi

over 1 year ago

What do developers 𝘳𝘦𝘢𝘭𝘭𝘺 think of AI coding assistants? In October, we launched @CopilotArena to collect user preferences on real dev workflows. After months of live service, we’re here to share our findings in our recent preprint. Here's what we have learned /🧵

iamwaynechi's tweet photo. What do developers 𝘳𝘦𝘢𝘭𝘭𝘺 think of AI coding assistants?

In October, we launched @CopilotArena to collect user preferences on real dev workflows. After months of live service, we’re here to share our findings in our recent preprint.

Here's what we have learned /🧵 https://t.co/mZIsMOY8Fe

3

160

32

125

71K

0

4

1

2K

CopilotArena retweeted

Jiseung Hong @jiseungh99

9 months ago

We are excited to launch the ⚔️PR Arena⚔️ leaderboard! Full results will be revealed after a certain milestone of community votes. Fix your GitHub issues for free and vote for better fix! 👉Leaderboard & Setup Guide: https://t.co/S1Oe3xXm6K

jiseungh99's tweet photo. We are excited to launch the ⚔️PR Arena⚔️ leaderboard!

Full results will be revealed after a certain milestone of community votes.

Fix your GitHub issues for free and vote for better fix!

👉Leaderboard & Setup Guide: https://t.co/S1Oe3xXm6K https://t.co/qhZj4cl7wu

1

24

9

6

6K

CopilotArena retweeted

Jiseung Hong @jiseungh99

9 months ago

Here are some tips for using ⚔️PR Arena⚔️ 1⃣ pr-arena🏷️ option is added automatically to Issue Labels for ease of use! 2⃣ You can use PR Arena in forked repositories. 3⃣ Don't like either fix? Select “neither” and no PR will be created. 👉Install here: https://t.co/bk19LcnBVf

1

14

2

4

4K

Copilot Arena @CopilotArena

9 months ago

📢Calling all developers who contributed votes in Copilot Arena, we need your help building the PR Arena leaderboard 🗳️. You will no longer be restricted to VSCode IDE--any GitHub repo with an open issue is fair game! Check out the thread below for details:

Jiseung Hong @jiseungh99

9 months ago

Introducing ⚔️PR Arena⚔️ - free AI coding agents to fix real GitHub issues. Claude Sonnet 4 vs Gemini 2.5 Pro… Who writes better pull requests? 👉 Install here: https://t.co/bk19LcnBVf Powered by @allhands_ai

4

78

12

22

32K

0

10

1

0

680

Copilot Arena @CopilotArena

10 months ago

Come meet our amazing little brother, Music Arena!

Chris Donahue @chrisdonahuey

10 months ago

Excited to share our beta release of Music Arena, a live evaluation platform for state-of-the-art AI music generation models! 🎧 Listen to the latest models and 🗳️ vote for your favorite ⚔️ https://t.co/XyXvzlOMcH ⭐️ https://t.co/9f3mvseEyu 📜 https://t.co/HqTg5PH09O

chrisdonahuey's tweet photo. Excited to share our beta release of Music Arena, a live evaluation platform for state-of-the-art AI music generation models!

🎧 Listen to the latest models and 🗳️ vote for your favorite

⚔️ https://t.co/XyXvzlOMcH
⭐️ https://t.co/9f3mvseEyu
📜 https://t.co/HqTg5PH09O https://t.co/vxNDB1A5h0

12

238

57

97

28K

0

9

3

0

758

Copilot Arena @CopilotArena

12 months ago

We’re featured in the new tech report on Mercury models! Check it out👇

Aditya Grover

@adityagrover_

12 months ago

Since our launch earlier this year, we are thrilled to witness the growing community around dLLMs. The Mercury tech report from @InceptionAILabs is now on @arxiv with more extensive evaluations: https://t.co/DnDxFvoX0E New model updates dropping later this week!

3

251

38

137

23K

0

7

3

0

852

Copilot Arena @CopilotArena

about 1 year ago

New result: Qwen-2.5-Coder jumps from 13th to joint 1st place with fill-in-the-middle (FiM)! Congrats to @Alibaba_Qwen 🥳 Also check out @lmarena_ai 's new UI 🖥️✨

CopilotArena's tweet photo. New result: Qwen-2.5-Coder jumps from 13th to joint 1st place with fill-in-the-middle (FiM)! Congrats to @Alibaba_Qwen 🥳

Also check out @lmarena_ai 's new UI 🖥️✨ https://t.co/VRMXGzNtxk

0

7

4

3

984

CopilotArena retweeted

Valerie Chen

@valeriechen_

about 1 year ago

Who is winning the race to claim the LLMs for SWE market? We share our thoughts based on our @CopilotArena work. See article below for current sentiments and what lies ahead 👇

0

20

3

2

2K

CopilotArena retweeted

Inception

@_inception_ai

about 1 year ago

We are launching our API in open beta! Visit the Inception Platform to create your account and get started using the first commercial-scale diffusion large language models (dLLMs). https://t.co/joTqBB0cZ4

8

136

30

47

65K

CopilotArena retweeted

CMU School of Computer Science @SCSatCMU

about 1 year ago

With so many AI coding assistants out there, it can be hard to keep track of ones that perform well on real-world tasks. CMU researchers developed Copilot Arena to do just that by crowdsourcing user ratings of LLM-written code. https://t.co/OVObru9h7b

0

10

4

3

1K

CopilotArena retweeted

Valerie Chen

@valeriechen_

about 1 year ago

@CopilotArena was featured in @SCSatCMU news! Featuring quotes from me, @iamwaynechi, @atalwalkar and @chrisdonahuey 🥳 📖Check out the article here: https://t.co/HAbOgZMPDp

0

19

5

0

2K

Copilot Arena @CopilotArena

about 1 year ago

A post about me?

ML@CMU @mlcmublog

about 1 year ago

https://t.co/cTVGkK59Dr How do real-world developer preferences compare to existing evaluations? A CMU and UC Berkeley team led by @iamwaynechi and @valeriechen_ created @CopilotArena to collect user preferences on in-the-wild workflows. This blogpost overviews the design and deployment of Copilot Arena + new insights into developer code preferences.

mlcmublog's tweet photo. https://t.co/cTVGkK59Dr

How do real-world developer preferences compare to existing evaluations? A CMU and UC Berkeley team led by @iamwaynechi and @valeriechen_ created @CopilotArena to collect user preferences on in-the-wild workflows. This blogpost overviews the design and deployment of Copilot Arena + new insights into developer code preferences.

0

18

8

3

4K

0

4

0

218

CopilotArena retweeted

Arena.ai

@arena

about 1 year ago

Check out @CopilotArena’s new Code Edit Leaderboard!

3

70

4

8

10K

Copilot Arena @CopilotArena

about 1 year ago

Try Copilot Arena for free here: https://t.co/9esMDbMc2g Leaderboard at: https://t.co/QrSrc9sNQ9 Paper at: https://t.co/VnEymKOotx Open-source at: https://t.co/1FnXTQ4h5Z

0

4

0

1

575

Copilot Arena @CopilotArena

about 1 year ago

New #1 Leaders of Code Edit Leaderboard: Strong performance from both Claude 3.7 Sonnet and Gemini-2.0-Pro! Congratulations to @AnthropicAI and @GoogleDeepMind 🥇 We also release new live leaderboard interface✨. You can now easily toggle between code completion and code edit.

CopilotArena's tweet photo. New #1 Leaders of Code Edit Leaderboard:
Strong performance from both Claude 3.7 Sonnet and Gemini-2.0-Pro! Congratulations to @AnthropicAI and @GoogleDeepMind 🥇

We also release new live leaderboard interface✨. You can now easily toggle between code completion and code edit. https://t.co/qmQ2iBIvIL

1

69

6

18

22K

Copilot Arena @CopilotArena

about 1 year ago

Curious about how code edits work in Copilot Arena? Check out this post: https://t.co/pZEKzXDwSC

Arena.ai

@arena

over 1 year ago

News from @CopilotArena: Code Editing Leaderboard is now LIVE! We have collected over 3.7k votes on 6 models. Congrats @AnthropicAI Claude 3.5 Sonnet on a 1st place rank!🥇 Blog analysis below👇

arena's tweet photo. News from @CopilotArena: Code Editing Leaderboard is now LIVE!

We have collected over 3.7k votes on 6 models. Congrats @AnthropicAI Claude 3.5 Sonnet on a 1st place rank!🥇

Blog analysis below👇 https://t.co/Nc7ntaIOJE

7

183

12

26

17K

1

6

0

1

1K

Copilot Arena @CopilotArena

about 1 year ago

Copilot Arena is now on Open VSX! Download here: https://t.co/rIqBZNti81

0

5

0

1

254

CopilotArena retweeted

Wayne Chi

@iamwaynechi

over 1 year ago

Interested in trying out Copilot Arena for yourself? Download at https://t.co/NCcFQBkM4Z. Follow us at @CopilotArena for upcoming updates!

0

6

1

0

835

Copilot Arena @CopilotArena

over 1 year ago

Our VS Code Extension is now open source. We're excited to see how you extend it! https://t.co/1FnXTQ4h5Z

0

29

2

10

5K

CopilotArena retweeted

Mikel @MikelEcheve

over 1 year ago

🏆 Mercury Coder’s performance: It’s tied for 2nd place on Copilot Arena, a platform for evaluating coding assistants in real-world settings. This is impressive for a new model based on emerging tech, competing with leaders like DeepSeek V2.5 and Claude Sonnet 3.5. #Coding #AI

MikelEcheve's tweet photo. 🏆 Mercury Coder’s performance: It’s tied for 2nd place on Copilot Arena, a platform for evaluating coding assistants in real-world settings. This is impressive for a new model based on emerging tech, competing with leaders like DeepSeek V2.5 and Claude Sonnet 3.5. #Coding #AI

1

6

1

4K

Copilot Arena

@CopilotArena

Last Seen Users on Sotwe

Trends for you

Most Popular Users