Hugging Face: Q8-Chat enables 8-bit quantized LLM inference on Xeon CPUs with Hugging Face Optimum Intel | SignalBreak | SignalBreak