Jacob Jackson

19 days ago

Our new model is out. It stacks up nicely against the frontier!

19 days ago

Introducing Composer 2.5, our most powerful model yet. It's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions. For the next week, we’re doubling the included usage of the model.

cursor_ai's tweet photo. Introducing Composer 2.5, our most powerful model yet.

It's more intelligent, better at sustained work on long-running tasks, and more reliable at following complex instructions.

For the next week, we’re doubling the included usage of the model. https://t.co/N87ojcXlOC

927

13K

1K

3K

20M

4

105

1

5

4K

jbfja retweeted

Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

2 months ago

We’re introducing Cursor 3. It is simpler, more powerful, and built for a world where all code is written by agents, while keeping the depth of a development environment.

725

9K

905

3K

3M

Who to follow

Lilian Weng

@lilianweng

Chris Olah

@ch402

Reverse engineering neural networks at @AnthropicAI. Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

Cristóbal Valenzuela

@c_valenzuelab

@runwayml Co-Founder & Co-CEO

2 months ago

@_chenglou Really cool

0

4

0

2K

2 months ago

@vivekkalyansk On-policy = model that generated the response receiving feedback is the same as the model being trained with RL Implicit feedback = user feedback, but not something like thumbs up/thumbs down, which would be explicit

1

3

0

2

115

2 months ago

Excited to share our work on training Composer with on-policy implicit feedback!

2 months ago

Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.

cursor_ai's tweet photo. Earlier this week, we published our technical report on Composer 2.

We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours. https://t.co/f75l7Qa4fr

102

2K

129

504

507K

5

78

2

7

9K

2 months ago

Composer 2 technical report - excited to share details about how we trained the model!

2 months ago

Read the full report: https://t.co/GLY24X0Gov

8

313

16

224

73K

2

61

1

3

5K

jbfja retweeted

2 months ago

We're releasing a technical report describing how Composer 2 was trained.

169

5K

482

4K

1M

jbfja retweeted

Vicent Martí

@vmg

2 months ago

This is my first post on the Cursor blog: an interactive survey on the state of the art for n-gram indexes.

18

321

19

51

35K

3 months ago

Composer 2 is frontier-level on coding benchmarks. You should try it!

3 months ago

Composer 2 is now available in Cursor.

647

10K

880

2K

5M

2

28

0

2K

jbfja retweeted

Sasha Rush

@srush_nlp

3 months ago

It was kind of amazing how many RL challenges in this run were bootstrapped by earlier Composers. Interesting times.

5

129

6

11

9K

jbfja retweeted

Sam Kottler @samkottler

3 months ago

a ton of work went into building composer 2 and it's a good model! try it and let us know what you think - https://t.co/zHWWHpgUf4

3

35

2

0

2K

jbfja retweeted

Federico Cassano

@ellev3n11

3 months ago

a lot went into this model. it was fun! i hope people enjoy it.

24

244

12

3

21K

jbfja retweeted

Ashvin Nair

@ashvinair

3 months ago

Very excited for the world to try this model! People like it a lot internally at Cursor - feels frontier-level smart and extremely fast

6

101

5

4

6K

jbfja retweeted

Naman Jain

@StringChaos

3 months ago

New post: how we do evals at @cursor_ai. Takeaways: 1. Online metrics from real Cursor requests provide construct validity 2. CursorBench: a dynamic offline suite distilled from online learnings 3. Multi-axes evals -- correctness, efficiency, agent interaction behavior

5

147

18

103

39K

jbfja retweeted

Michael Truell

@mntruell

7 months ago

After adopting Cursor, businesses merge ~40% more PRs each week. New economics research from the University of Chicago.

48

923

66

338

226K

jbfja retweeted