Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Voxtral Mini 3B

Model family: voxtral

Edge-deployable Voxtral — 3B sibling of Voxtral Small 24B with the same speech-understanding architecture at a smaller scale. Apache 2.0.

Listing Notes

This is the edge-deployment member of the Voxtral family — a 3B-parameter speech-understanding model that shares Voxtral Small 24B's architectural approach at a smaller scale. Unlike the transcription-optimized Voxtral Mini Transcribe V2 and the streaming-optimized Voxtral Mini 4B Realtime, this model preserves the full speech-understanding capability (audio Q&A, summarization, audio-grounded reasoning) in a footprint small enough for edge deployment. Use this when you need on-device speech understanding — laptop-class devices, private cloud deployments with limited GPU budget — rather than pure transcription. As with Voxtral Small 24B, the model can act as a drop-in replacement for its corresponding text-only Mistral base (Ministral 3B in this case) if you need a text-only deployment without maintaining separate checkpoints.

Identity

Creator: Mistral AI
Model family: voxtral
Release date: 2025-07-14

Technical specs

Parameter count

Context window

33K tokens

Modalities

Audio Input
Text

Primary capabilities

Multilingual
Speech To Text
Summarization

License

License

Apache 2.0

Commercial use

Allowed

Terms

Modification ✓
Redistribution ✓
Attribution ✓

Access

Openness

Open Weight

Access methods

Local Runtime Vllm
Weights Download Hf

Cost tier

Self Hosted Only

Sources

Full model card →

audio
speech-to-text
speech-understanding
audio-qa
multilingual
open-weight
commercial-friendly
edge
laptop-friendly
apache-licensed
eu-based