Hugging Face: TextQuests benchmark shows LLMs struggle with long-context reasoning in text-based games | SignalBreak | SignalBreak