Hugging Face: TimeScope benchmark assesses long-video understanding for vision-language models | SignalBreak | SignalBreak