Hugging Face: Smolagents adds vision-language model (VLM) support for vision-enabled agents | SignalBreak | SignalBreak