Shomil Jain

about 13 hours ago

Excited to train some very strong models!

69

1K

60

32

56K

Shomil_J retweeted

Elon Musk

@elonmusk

29 days ago

Try it out! (Partially trained on Colossus 2)

2K

28K

3K

12M

Shomil_J retweeted

think less, do more / prev: product @twitter, eng @ucberkeley

about 1 month ago

We use previous generations of Composer to train future ones. Our autoinstall system has earlier Composer models set up dev environments for RL training. That way, the next generation can focus on learning to solve harder problems. https://t.co/GbZILEfhAt

46

891

60

224

225K

Shomil_J retweeted

Michael Truell

@mntruell

about 2 months ago

Excited to partner with the SpaceX team to scale up Composer. A meaningful step on our path to build the best place to code with AI.

480

10K

1K

447

2M

Who to follow

Shubha Jagannatha

@shubhastudios

creative techie in sf ツ

Shomil_J retweeted

SpaceX

@SpaceX

about 2 months ago

SpaceXAI and @cursor_ai are now working closely together to create the world’s best coding and knowledge work AI. The combination of Cursor’s leading product and distribution to expert software engineers with SpaceX’s million H100 equivalent Colossus training supercomputer will allow us to build the world’s most useful models. Cursor has also given SpaceX the right to acquire Cursor later this year for $60 billion or pay $10 billion for our work together.

2K

40K

5K

23M

Shomil_J retweeted

3 months ago

Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.

cursor_ai's tweet photo. Earlier this week, we published our technical report on Composer 2.

We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours. https://t.co/f75l7Qa4fr

101

2K

129

503

507K

Shomil_J retweeted

Sasha Rush

@srush_nlp

3 months ago

Lots of juicy details from Composer 2 training. How we think about RL, how we set up envs, why we think it scales, why we make our own evals...

14

467

40

224

50K

Shomil_J retweeted

3 months ago

It's a good model

19

247

7

14

28K

Shomil_J retweeted

elie

@eliebakouch

3 months ago

cursor composer 2 tech report is VERY VERY nice some of the things i found interesting: > nice "scaling" study of how rl performance is impacted by continual pretraining (more flops => lower ppl => higher RL perf) > they added mtp head to k2.5 for speculative decoding, they use self distillation objective which is not standard i think for mtp training > they added length penalty RL to force the model to think more on long tasks and less on easy ones > they use self summarization (introduced in a previous blog post). cursor mentions the model has a 200k context window but the tech report mentions ctx extension to 256k, so it means they reserve 50k for the compaction/self summarization? it's a bit higher than the 33k token in claude code. > nice that they report improvement on both best of k and average perf > very very nice infra section on kernels, parallelism, quantization and muchhh more (i need to read this more in depth!)

eliebakouch's tweet photo. cursor composer 2 tech report is VERY VERY nice

some of the things i found interesting:

> nice "scaling" study of how rl performance is impacted by continual pretraining (more flops => lower ppl => higher RL perf)
> they added mtp head to k2.5 for speculative decoding, they use self distillation objective which is not standard i think for mtp training
> they added length penalty RL to force the model to think more on long tasks and less on easy ones
> they use self summarization (introduced in a previous blog post). cursor mentions the model has a 200k context window but the tech report mentions ctx extension to 256k, so it means they reserve 50k for the compaction/self summarization? it's a bit higher than the 33k token in claude code.
> nice that they report improvement on both best of k and average perf
> very very nice infra section on kernels, parallelism, quantization and muchhh more (i need to read this more in depth!)

7

371

30

180

40K

Shomil_J retweeted

3 months ago

We're releasing a technical report describing how Composer 2 was trained.

169

5K

479

4K

1M

Shomil_J retweeted

Mike

@grabbou

3 months ago

We evaluated Composer 2 in our React Native evals, and I'll say this: the @cursor_ai team is cooking 🧑‍🍳

44

1K

60

189

109K

Shomil_J retweeted

Next.js

@nextjs

3 months ago

Cursor's Composer 2 just took second place on the Next.js evals leaderboard, beating both Opus and Gemini. See the full rankings ↓ https://t.co/9lEr5K7lUT

36

1K

71

165

245K

Shomil_J retweeted

3 months ago

Composer 2 marks the one-year anniversary of our large model training efforts. Since then, we've built an exceptionally talent-dense team of ~40 people with some of the best researchers and engineers from the labs, academia, industry, and more heterogeneous backgrounds. And we are exclusively focused on coding. We don't care about models that can respond to emails, do your tax returns, or be your friend. Every FLOP, token, parameter, and researcher is entirely dedicated to software engineering.

77

1K

51

88

133K

Shomil_J retweeted

Federico Cassano

@ellev3n11

3 months ago

a lot went into this model. it was fun! i hope people enjoy it.

24

244

12

3

21K

Shomil_J retweeted

3 months ago

Composer 2 is now available in Cursor.

646

10K

878

2K

5M

Shomil_J retweeted

Naman Jain

@StringChaos

3 months ago

New post: how we do evals at @cursor_ai. Takeaways: 1. Online metrics from real Cursor requests provide construct validity 2. CursorBench: a dynamic offline suite distilled from online learnings 3. Multi-axes evals -- correctness, efficiency, agent interaction behavior

5

147

18

103

39K

Shomil_J retweeted

6 months ago

Graphite is joining Cursor. https://t.co/bfj0447Q13

195

4K

255

760

1M

Shomil_J retweeted