Anthropic is leaning into agentic coding and heavily so. It makes sense to use s... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		jascha_eng 56 days ago \| parent \| context \| favorite \| on: Claude Opus 4.5 Anthropic is leaning into agentic coding and heavily so. It makes sense to use swe verified as their main benchmark. It is also the one benchmark Google did not get the top spot last week. Claude remains king that's all that matters here.

Mkengin 56 days ago [–]

I am eagerly awaiting swe-rebench results for November with all the new models: https://swe-rebench.com/

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact