Hugging Face: Big Bench Audio evaluates audio reasoning gap in GPT-4o and Gemini 1.5 across 18 configurations | SignalBreak | SignalBreak