OpenAI: Introducing the SWE-Lancer benchmark — evaluating frontier LLMs for freelance software engineering | SignalBreak | SignalBreak