ℏεsam says ARC-AGI-3 is extremely hard, with GPT-5.4 High, Gemini 3.1 Pro Preview, and Anthropic Opus 4.6 Max all scoring around 0.2 to 0.3 percent and Grok 4.2 at 0.00 percent.
ARC-AGI-3 IS BRUTAL.
10 days after the release and Grok 4.2 has scored a glorious 0.00%.
🥇 GPT-5.4 (High): 0.3%
🥈 Gemini 3.1 Pro (Preview): 0.2%
🥉 Anthropic Opus 4.6 (Max): 0.2%
GPT-5.4 (High): 0.3%
This finding is one of many signals tracked across Artificial Intelligence. The live feed updates every few hours with new expert voices, debates, and emerging ideas.
← Back to Artificial Intelligence