Amrita

9 days ago

My favorite feature since Cursor Automations. Visual learners, this is for you!

are u feeling these electric vibes

10 days ago

With canvases, Cursor can create apps like dashboards, reports, and internal tools. Now you can publish a canvas and share it with your team via URL.

98

2K

100

577

159K

0

3

0

46

iamrita98 retweeted

Dwarkesh Patel

@dwarkesh_sp

10 days ago

Recently met @srush_nlp and he started giving me an impromptu lecture on how targeted on-policy self-distillation works. I asked him if I could record it on my iPhone. The basic idea is this: if the model made a mistake at some point in the rollout (for example, calling a tool that doesn't exist), we want to discourage this specific error, but we don't want to just learn from the final reward, because it's a very noisy signal spread out over the whole trajectory. So we have another model read this trajectory and figure where the error was made. It simply inserts some hint tokens to the part of the trajectory right above where the mistake was made. Now with these injected hint tokens, have the model run a forward pass. You're not having to regenerate a new rollout - aka no new decode required. The hint causes the model to assign lower probabilities to the error tokens. You then trains the original model to match these new probabilities, teaching it to downweight that specific mistake.

41

3K

173

3K

413K

iamrita98 retweeted

Artificial Analysis

@ArtificialAnlys

24 days ago

Cursor's new Composer 2.5 takes third on the Artificial Analysis Coding Agent Index and is ~10-60x lower cost than the higher-effort Opus 4.7 and GPT-5.5 variants above it. This release puts Composer among the leading coding agent models, something that wasn’t clear for past releases @cursor_ai has released Composer 2.5, the latest model in its Composer line. Composer 2.5 scored 62 on our Coding Agent Index, a 14 point gain over Composer 2 (48). This puts it in third place of our tested agents, behind only Claude Opus 4.7 (max) in Claude Code (66) and GPT-5.5 (xhigh reasoning) in Codex (65). These cost $4.10 and $4.82 per task respectively, ~10x the cost of Composer 2.5 Fast ($0.44) and ~60x the cost of Composer 2.5 standard ($0.07). Key results for Composer 2.5 in Cursor CLI: ➤ Cost-quality Pareto frontier: At $0.07 (standard) and $0.44 (Fast) per task, Composer 2.5 is cheaper than every other agent scoring above 60 on the Index. Medium-effort peers cost $1.24–$2.21 per task; higher-effort variants land 3-4 points above at $4.10–$4.82 ➤ Per-benchmark gains vs Composer 2: +35 points on SWE-Bench-Pro-Hard-AA (12% → 47%), +2 points on Terminal-Bench v2 (64% → 66%), and +3 points on SWE-Atlas-QnA (69% → 72%). At 47%, Composer 2.5's score on SWE-Bench-Pro-Hard-AA is comparable to Claude Opus 4.7 (max) in Claude Code ➤ Among the fastest coding agents: Composer 2.5 Fast runs at an average wall time of 6.7 minutes per task, the third-fastest agent on the Artificial Analysis Coding Agent Index, behind only Claude Opus 4.7 (medium) in Claude Code (5.8m) and GPT-5.5 (medium) in Cursor CLI (6.2m) ➤ Fast mode enables better responsiveness at 6x pricing: Fast runs 30% faster than standard Composer 2.5, but is ~6x the cost per task ($0.44 vs $0.07). Token pricing is 6x higher for Fast: $3.00/$15.00 vs $0.50/$2.50 per million input/output tokens Model details: ➤ Base model: Continued training on @Kimi_Moonshot's open weights Kimi K2.5 as with Composer 2, with Cursor reporting ~85% of total compute from its own additional training and reinforcement learning ➤ Pricing: $0.50/$2.50 per million input/output tokens for the standard variant; $3.00/$15.00 for the Fast variant (the default in Cursor) ➤ Available exclusively in Cursor: both Cursor IDE and Cursor CLI, an externally accessible API is not available Congratulations @cursor_ai and @mntruell on the impressive release!

ArtificialAnlys's tweet photo. Cursor's new Composer 2.5 takes third on the Artificial Analysis Coding Agent Index and is ~10-60x lower cost than the higher-effort Opus 4.7 and GPT-5.5 variants above it. This release puts Composer among the leading coding agent models, something that wasn’t clear for past releases

@cursor_ai has released Composer 2.5, the latest model in its Composer line. Composer 2.5 scored 62 on our Coding Agent Index, a 14 point gain over Composer 2 (48). This puts it in third place of our tested agents, behind only Claude Opus 4.7 (max) in Claude Code (66) and GPT-5.5 (xhigh reasoning) in Codex (65). These cost $4.10 and $4.82 per task respectively, ~10x the cost of Composer 2.5 Fast ($0.44) and ~60x the cost of Composer 2.5 standard ($0.07).

Key results for Composer 2.5 in Cursor CLI:

➤ Cost-quality Pareto frontier: At $0.07 (standard) and $0.44 (Fast) per task, Composer 2.5 is cheaper than every other agent scoring above 60 on the Index. Medium-effort peers cost $1.24–$2.21 per task; higher-effort variants land 3-4 points above at $4.10–$4.82

➤ Per-benchmark gains vs Composer 2: +35 points on SWE-Bench-Pro-Hard-AA (12% → 47%), +2 points on Terminal-Bench v2 (64% → 66%), and +3 points on SWE-Atlas-QnA (69% → 72%). At 47%, Composer 2.5's score on SWE-Bench-Pro-Hard-AA is comparable to Claude Opus 4.7 (max) in Claude Code

➤ Among the fastest coding agents: Composer 2.5 Fast runs at an average wall time of 6.7 minutes per task, the third-fastest agent on the Artificial Analysis Coding Agent Index, behind only Claude Opus 4.7 (medium) in Claude Code (5.8m) and GPT-5.5 (medium) in Cursor CLI (6.2m)

➤ Fast mode enables better responsiveness at 6x pricing: Fast runs 30% faster than standard Composer 2.5, but is ~6x the cost per task ($0.44 vs $0.07). Token pricing is 6x higher for Fast: $3.00/$15.00 vs $0.50/$2.50 per million input/output tokens

Model details:

➤ Base model: Continued training on @Kimi_Moonshot's open weights Kimi K2.5 as with Composer 2, with Cursor reporting ~85% of total compute from its own additional training and reinforcement learning

➤ Pricing: $0.50/$2.50 per million input/output tokens for the standard variant; $3.00/$15.00 for the Fast variant (the default in Cursor)

➤ Available exclusively in Cursor: both Cursor IDE and Cursor CLI, an externally accessible API is not available

Congratulations @cursor_ai and @mntruell on the impressive release!

60

1K

146

254

242K

Who to follow

matt

@matt_bern_

Anna Yang

@apeelingbananna

she/her | swe @amazon | symsys ai, nlp, & music @stanfordsymsys @stanfordhci | mildly chaotic thoughts may or may not be my own

eliza

@elizaminnelli_

@Stanford alum • law student & former philanthropoid • tweets my own!

iamrita98 retweeted

NVIDIA AI

@NVIDIAAI

26 days ago

New model loading 💪

44

3K

114

135

218K

iamrita98 retweeted

Danny Limanseta

@DannyLimanseta

26 days ago

Very impressed with Composer 2.5 after about a day of usage. I've almost moved over to it exclusively from GPT 5.5, even using it for planning now. It's like Opus 4.7 on steroids, crazy fast. Fast models really get me into the flow of building, which is exhilarating. Also, a sneak peek of a weekend mini-project I'm building: a racing game where you race your mouse cursor.

132

3K

116

1K

298K

iamrita98 retweeted

Michael Truell

@mntruell

27 days ago

Composer 2.5 is a significant step up from Composer 2. This is the very start of our work with SpaceXAI. Hope to have more improvements out soon.

369

5K

906

184

1M

about 1 month ago

!!!!!!

DiscussingFilm

@DiscussingFilm

about 1 month ago

First poster for ‘EAST OF EDEN’, starring Florence Pugh. The series follows the intertwined destinies of the Trask and Hamilton families in California's Salinas Valley. Releasing this Fall on Netflix.

DiscussingFilm's tweet photo. First poster for ‘EAST OF EDEN’, starring Florence Pugh.

The series follows the intertwined destinies of the Trask and Hamilton families in California's Salinas Valley.

Releasing this Fall on Netflix. https://t.co/Ed4IPRCnKj

151

6K

435

575

526K

0

1

0

102

@aye_aye_kaplan @ericzakariasson @drawitpoorly @drawitpoorly me

about 1 month ago

1

0

75

iamrita98 retweeted

about 1 month ago

Cursor is now available in Microsoft Teams. Mention @Cursor in any channel to delegate tasks to an agent or pull information from Cursor into Teams.

72

1K

104

242

181K

about 1 month ago

@JamesLeakos @cursor_ai this is awesome!

0

3

0

176

iamrita98 retweeted

Ben Lang

@benln

about 1 month ago

Cafe Cursor in Lagos

43

1K

66

46

41K

iamrita98 retweeted

eric zakariasson

@ericzakariasson

about 1 month ago

cursor sdk launched yesterday! people are already putting cursor agents in places they already work: gmail, chrome, ci, terminal, docs github issues here are 11 projects built in the first day ↓

40

944

53

891

93K

about 2 months ago

@Reshusaur with @cursor_ai cloud agents, you can close your laptop :)

0

46

about 2 months ago

my adhd brain using btw like there's no tomorrow

about 2 months ago

/btw allows you to ask a quick side question without derailing the agent's current run.

17

316

11

41

186K

0

2

0

171

2 months ago

Especially love working with the CLI across multiple repos and when I’m building things in XCode. Go give it a whirl!

Michael Feldstein

@msfeldstein

2 months ago

Great first week on Cursor CLI with @luis18 We shipped a lot Rough edges sanded down And here's a thread with a few new features we added Lots to still do, feedback always welcome

msfeldstein's tweet photo. Great first week on Cursor CLI with @luis18
We shipped a lot
Rough edges sanded down
And here's a thread with a few new features we added
Lots to still do, feedback always welcome https://t.co/Paq80BGcRl

22

221

19

37

55K

0

9

1

0

487

2 months ago

@msfeldstein I think you’d love this @jasonedamour

0

7

2 months ago

This still blows my mind every day!

2 months ago

Cursor can now attach demos and screenshots of its work to PRs it opens. Your team can review artifacts created by cloud agents directly in GitHub.

114

2K

135

581

323K

2

17

0

1

800

iamrita98 retweeted

Michael Feldstein

@msfeldstein

2 months ago

Glass / Agents Window is the big story of Cursor 3 but there's lots of improvements we made to the standard IDE, its not going anywhere. My favorite is using standard vscode tabs for our chats so you can easily pull them out side by side and use all the standard VSCode shortcuts to manage them. It makes managing parallel agents a lot easier.

7

156

6

45

25K