Hugging Face: Differential Transformer V2: faster inference with no custom kernels and improved training stability | SignalBreak | SignalBreak