Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
Gemma 3 4B
Model family: gemma-3
The 4B Gemma 3 — laptop-feasible and multimodalA model that can handle more than one type of input or output — typically text plus images, sometimes plus audio or video. "GPT-4 Vision" and "Llama 3.2 11B Vision" are multimodal models that accept both text and images. A text-only model is called "unimodal" but nobody uses that term; text-only is the assumed default., under the custom Gemma Terms of Use; superseded by Gemma 4.
Identity
- Creator
- Model family
- gemma-3
- Release date
- 2025-03-11
Technical specs
- Parameter count
- 4B
- Context window
- 131K tokens
- Modalities
- Image Input
- Text
- Primary capabilities
- Chat
- Instruction Following
- Multilingual
- Reasoning
- Vision
License
- License
- Gemma Terms of Use
- Commercial use
- Allowed
- Terms
- Modification ✓
- Redistribution ✓
- Attribution ✓
Access
- Openness
- Open Weight
- Access methods
- Local Runtime Llama Cpp
- Local Runtime Lm Studio
- Local Runtime Ollama
- Local Runtime Vllm
- Weights Download Hf
- Cost tier
- Mixed
Sources
- llm
- open-weight
- commercial-friendly
- small
- on-device
- multimodal
- us-based
- gemma-terms