Cursor just crushed Claude Code — Summary & Key Points

Theo - t3․ggMay 24, 202637:3070K views

TL;DR

Cursor released Composer 2.5, a coding model that rivals GPT-5.5 and Claude Opus on internal benchmarks while costing significantly less and running faster, though it remains locked behind the Cursor IDE. The model uses a novel RL technique called targeted textual feedback and synthetic data generation to distill Kimi K25 into a top-tier coder, but its lack of an API makes external verification difficult.

Key Quotes

"With Colossus 2's million H100 equivalents... we expect this to be a major leap in model capability."
Cursor announcement

The argument

The value thesis

Composer 2.5 scores approximately 63 to 64 percent on Cursor's internal benchmark, matching GPT-5.5 and Opus, but costs 50 cents per million tokens in and 2.50 per million tokens out compared to GPT-5.5's 5 and 30. This makes it seven times cheaper than Composer 1.5 and significantly faster.

The technical mechanism

Cursor uses a novel RL technique called targeted RL with textual feedback to correct specific errors like bad tool calls during training without explicit instructions. They also generate synthetic data by asking agents to delete and re-implement features, using 25x more synthetic tasks than Composer 2.

The ecosystem strategy

Composer is only available inside Cursor, making it hard to benchmark externally. The speaker argues this is a strategic move to lock in users and compete with subsidized API access from Anthropic and OpenAI, which offer massive usage credits to developers.

Future trajectory

Cursor has a pending deal with SpaceX AI to train a significantly larger model using Colossus 2, which provides million H100 equivalents and 10x more total compute than the current base model. This could allow Cursor to leapfrog state-of-the-art coding models in the near future.

Use this with an agent

Copy or download either the structured summary or the full transcript.

Have your own business or tech recording?

Turn demos, webinars, strategy calls, and tech talks into transcripts, summaries, and agent-ready briefs.

Try Typist free
Related

Keep exploring

More summaries from the Typist library — picked for the same category