Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
Ministral 3 3B Instruct
Model family: ministral-3
Smallest Ministral 3 instruct variant — 3B parameters, multimodalA model that can handle more than one type of input or output — typically text plus images, sometimes plus audio or video. "GPT-4 Vision" and "Llama 3.2 11B Vision" are multimodal models that accept both text and images. A text-only model is called "unimodal" but nobody uses that term; text-only is the assumed default., fits in 8GB VRAMThe memory built into a GPU. VRAM size determines what models you can load and run — a model's weights must fit in VRAM (or be cleverly swapped in and out). A 7B model in 4-bit quantization needs about 6GB of VRAM; a 70B model in 4-bit needs about 40GB; full-precision frontier models need multiple high-end GPUs. When people talk about a model "fitting" on a GPU, they mean VRAM. in FP8. Apache 2.0, edge- and smartphone-class deployment.
Identity
- Creator
- Mistral AI
- Model family
- ministral-3
- Release date
- 2025-12-01
Technical specs
- Parameter count
- 3B
- Context window
- 262K tokens
- Modalities
- Image Input
- Text
- Primary capabilities
- Chat
- Function Calling
- Instruction Following
- Long Context
- Multilingual
- Tool Use
- Vision
License
- License
- Apache 2.0
- Commercial use
- Allowed
- Terms
- Modification ✓
- Redistribution ✓
- Attribution ✓
Access
- Openness
- Open Weight
- Access methods
- Api First Party
- Local Runtime Llama Cpp
- Local Runtime Lm Studio
- Local Runtime Ollama
- Local Runtime Vllm
- Weights Download Direct
- Weights Download Hf
- Cost tier
- Mixed
- llm
- open-weight
- commercial-friendly
- small
- long-context
- multimodal
- multilingual
- edge
- on-device
- laptop-friendly
- apache-licensed
- eu-based
- vision