Hugging Face: Prefill/Decode batching optimization for vLLM with Llama-3.1-8B on H100 GPUs | SignalBreak | SignalBreak