Hugging Face: Bloom inference optimization: 5x latency reduction and 50x throughput on Bloom model | SignalBreak | SignalBreak