Theo reports Opus scoring “20% higher in Cursor than in Claude Code,” implying tooling/harness differences can materially change perceived model quality even when the underlying model is the same.
We don’t talk about this enough.
Opus scored 20% higher in Cursor than in Claude Code.
This finding is one of many signals tracked across Artificial Intelligence. The live feed updates every few hours with new expert voices, debates, and emerging ideas.
← Back to Artificial Intelligence