Hugging Face: Hugging Face training efficiency through packing with Flash Attention 2 and DataCollatorWithFlattening | SignalBreak | SignalBreak