Model Selection ComparisonsComparison

Opus performance discrepancy in Cursor vs Claude Code

March 30, 2026Theo - t3.gg

Theo reports Opus scoring “20% higher in Cursor than in Claude Code,” implying tooling/harness differences can materially change perceived model quality even when the underlying model is the same.

Open in PulseSee the full expert discussion →

QUOTES

We don’t talk about this enough.

Opus scored 20% higher in Cursor than in Claude Code.

VOICES

Theo - t3.gg

RELATED TERMS

benchmarkstoolingcursorclaude codeclaude opus

OTHER FINDINGS IN MODEL SELECTION COMPARISONS

Mythos size-and-price expectations (multi-trillion parameter '10T' pricing)Codex vs Claude Code for catching errors and doing reviews Defaulting to OpenAI and Anthropic for consistency vs other LLMs breaking weirdly

AMYGDALA PULSE

See what experts are saying right now

This finding is one of many signals tracked across Artificial Intelligence. The live feed updates every few hours with new expert voices, debates, and emerging ideas.

Open Artificial Intelligence Pulse Browse all topics

← Back to Artificial Intelligence