IDEFICS open multimodal model reproduces Flamingo with 9B and 80B variants
AI Impact Summary
IDEFICS delivers an open-access Flamingo-style multimodal model at 9B and 80B parameters, built from laion/CLIP-ViT-H-14-laion2B-s32B-b79K and huggyllama/llama-65b, with claimed comparable performance to Flamingo across benchmarks. The release emphasizes transparency, dataset exploration via OBELICS, and safety testing through adversarial prompts, but mixes licenses (MIT for the CLIP component and a non-commercial license for llama-65B) that constrain production use. This enables rapid prototyping in Hugging Face workflows but requires governance around licensing, model weights distribution, and potential commercial deployment restrictions for downstream products.
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- info