News
The startup says it trained its new family of AI models — SWE-1, SWE-1-lite, and SWE-1-mini — to be optimized for the “entire software engineering process,” not just coding.
GPT-4.1 arrives as OpenAI rivals like Google and Anthropic ratchet up efforts to build sophisticated programming models. Google’s recently released Gemini 2.5 Pro, which also has a 1-million ...
Windsurf has introduced its first set of SWE-1 models, aimed at supporting the full range of software engineering tasks, not limited to code generation. The lineup consists of three models SWE-1, SWE- ...
Unlike general-purpose AI models that have been adapted for coding tasks, the SWE-1 family was built to address the full spectrum of software engineering activities.
In a new paper, OpenAI researchers detail how they developed an LLM benchmark called SWE-Lancer to test how much foundation models can earn from real-life freelance software engineering tasks.
Benchmarks drive many areas of research forward, and this is indeed the case for two areas of research that I engage with: ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results