Google releases Gemma 3n — open-source multimodal model available
Action Required
Developers can now leverage a powerful, open-source multimodal model for a wide range of applications, accelerating innovation in areas like AI-powered assistants and creative content generation.
AI Impact Summary
Google has officially released Gemma 3n, a new open-source multimodal model, making it available for use across a wide range of open-source libraries and frameworks. This release is significant because it provides developers with access to a powerful, locally-runnable model capable of handling text, audio, and video inputs, opening up new possibilities for applications like image captioning, speech-to-text, and multimodal understanding. The availability of different model sizes (E2B and E4B) with varying memory requirements allows developers to choose the best fit for their hardware.
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- high