On a 2.0 terminal benchmark, OpenAI’s model scores about 10% higher, guiding users toward stronger results on long, complex ...
Add Decrypt as your preferred source to see more of our stories on Google. Anthropic released Claude Sonnet 4.5, calling it the best coding model yet. The model scored 77.2% on SWE-bench Verified, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results