Hugging Face: SmolVLM2 enables on-device video understanding with 256M/500M/2.2B models and MLX-ready APIs | SignalBreak | SignalBreak