Hugging Face: TRL: Co-located vLLM for Efficient LLM Training | SignalBreak | SignalBreak