Hugging Face: Optimizing LLM Inference: Lower Precision & Flash Attention | SignalBreak | SignalBreak