Hugging Face: Warm-starting encoder-decoder models with pre-trained checkpoints (BERT, GPT-2) in Transformers | SignalBreak | SignalBreak