Hugging Face: OpenAI releasing VAKRA benchmark for AI agent evaluation | SignalBreak | SignalBreak