HighCapability

Google releases Gemma 3n — open-source multimodal model available

Action Required

Developers can now leverage a powerful, open-source multimodal model for a wide range of applications, accelerating innovation in areas like AI-powered assistants and creative content generation.

AI Impact Summary

Google has officially released Gemma 3n, a new open-source multimodal model, making it available for use across a wide range of open-source libraries and frameworks. This release is significant because it provides developers with access to a powerful, locally-runnable model capable of handling text, audio, and video inputs, opening up new possibilities for applications like image captioning, speech-to-text, and multimodal understanding. The availability of different model sizes (E2B and E4B) with varying memory requirements allows developers to choose the best fit for their hardware.

Affected Systems

Gemma 3n

Date: Date not specified
Change type: capability
Severity: high

Google releases Gemma 3n — open-source multimodal model available

More from Hugging Face

Get alerts for Hugging Face