Hugging Face: Smolagents adds Vision Language Model (VLM) support | SignalBreak | SignalBreak