Hugging Face: Sentence Transformers multimodal fine-tuning supports Qwen3-VL-Embedding-2B for Visual Document Retrieval | SignalBreak | SignalBreak