Flo Crivello and TWiST discuss open models narrowing the gap, including a claim that an OSS model beat Sonnet 4.6 on internal evals and that open models have historically lagged by about six months.
Okay this one seems real. First time ever an OSS model beats Sonnet 4.6(!!) on our evals.
Now begins vibe testing, but this is promising.
“Open models have been about six months behind for the last few years.”
This finding is one of many signals tracked across Indiehacking. The live feed updates every few hours with new authority voices, debates, and emerging ideas.
← Back to Indiehacking