Hugging Face: Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator | SignalBreak | SignalBreak