Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Catalog entry last reviewed 91 days ago.

Mistral Medium 3.1

Model family: mistral-medium

Context

131,072 tokens

Released

2025-08-12

Openness

closed-api

License

Mistral AI Terms of Service (Proprietary API) · commercial: yes

Cost tier

paid-api

Rating

3.5 ★ — Competitive capability and pricing for a hosted API, but the absence of independent benchmarks at the same depth as GPT or Claude coverage, combined with no open weights to inspect, makes due diligence harder than for Mistral's open-weight tier. Good for teams already in the Mistral ecosystem; less compelling as a first choice.

Modalities

text

Capabilities

chat, coding, instruction-following, long-context, multilingual, reasoning, tool-use

Access

api-first-party, api-third-party

llm
proprietary
closed-api
mid
long-context
multilingual
eu-based
coding
stem

Quick Take

Mistral's closed-API enterprise tier — sits between Small 4 and Large 3 on capability and cost, with hybrid and on-prem deployment available but no published weights.

Plain-English Description

Mistral Medium 3 was the first Mistral model to break the company's open-weight convention. Released May 7, 2025 and updated to Medium 3.1 on August 13, 2025, the Medium tier exists for enterprise customers who want Mistral-quality models through a managed API without the operational lift of self-hosting and without needing the headline capability of Mistral Large. It's the middle of what Mistral calls a three-tier enterprise strategy: Small (self-host or cheap API), Medium (managed API only, enterprise features), Large (frontier capability, either API or demanding self-host).

The specs are deliberately modest in disclosure: 128K-token context window, text-only processing, no multimodal input, and Mistral hasn't published parameter counts or architectural details. The model is proprietary and closed-weight; there is no open-source release planned. At $0.40 per million input tokens and $2 per million output tokens, pricing undercuts comparable capability tiers from U.S. competitors meaningfully. Mistral's own pitch at launch was that Medium 3 delivers "frontier performance at or above 90% of Claude Sonnet 3.7" — though as of April 2026, independent benchmark verification of that specific claim is thin. The Artificial Analysis Intelligence Index assigns Medium 3 a composite score in line with Mistral's positioning, but granular breakdowns (MMLU-Pro, HumanEval, SWE-bench) that developers typically use for model selection are sparse or missing.

The model's real differentiator is deployment flexibility at the enterprise tier. Medium 3.1 is available through Mistral's own API, AWS Bedrock, Azure AI Foundry, IBM WatsonX, and Google Cloud Vertex — the standard enterprise cloud distribution. More interestingly, Mistral offers hybrid and on-prem deployment arrangements for enterprise customers who need the model running in their own data center or VPC. The weights don't leave Mistral's control, but the inference can happen on customer infrastructure. This posture is similar to Anthropic's AWS Bedrock and Google Cloud Vertex enterprise arrangements and is mostly about procurement and data-residency rather than technical flexibility.

Best For

Teams already using Mistral's API who want more capability than Small 4 without jumping to Large 3's cost profile. Medium 3.1 is the natural upgrade path within the Mistral API ecosystem.
Enterprise workloads in financial services, energy, and healthcare where Mistral has existing beta-customer traction. Mistral has explicitly targeted and onboarded customers in these sectors; the model has been shaped by that feedback.
Hybrid and on-prem deployments that need Mistral-quality models behind firewall. This is the tier where Mistral will negotiate bespoke deployment arrangements, including continuous pretraining on private data.
EU-compliant closed-API workloads. For European teams that want a closed API with French jurisdiction rather than a U.S.-based vendor.

Not For

Teams that value open weights for inspection, audit, or self-hosting flexibility. Mistral Medium 3.1 is closed. If openness is a requirement, reach for Mistral Small 4 or Large 3 instead.
Workloads needing multimodal input. Text-only. For vision tasks, Mistral Small 4 or Large 3 are the options.
Teams that rely heavily on independent benchmark coverage to select models. Medium 3.1 has much thinner third-party benchmark coverage than GPT-4-class or Claude-class models. Verifying performance for your specific workload requires running your own evals.
Applications that would benefit more from reasoning-mode or extended-thinking capability. Medium 3.1 doesn't expose a configurable reasoning parameter the way Mistral Small 4 does, nor does it have a dedicated reasoning variant. For reasoning-heavy work, Small 4 with reasoning_effort: high is often the smarter choice even at the lower capability tier.

License — Plain-English Summary

Mistral Medium 3.1 is a proprietary closed-API model. You pay per token to call Mistral's hosted API (or one of the cloud partner deployments) and you get the right to use the model's outputs in your applications. You don't get the weights, you can't fine-tune the base model outside of Mistral's enterprise program, and you can't redistribute the model. For most API consumers this is a normal arrangement — the same terms govern GPT, Claude, and Gemini API usage.

How It Compares

vs. Mistral Small 4 — Small 4 is open-weight under Apache 2.0 and cheaper ($0.15 vs $0.40 input). Medium 3.1 is proprietary but is the tier where Mistral offers hybrid deployment and enterprise fine-tuning. For most teams, Small 4 is where you should start; Medium 3.1 is where you end up if Small 4's ceiling doesn't hold for your workload and you want Mistral-grade models without Large 3's self-hosting footprint.
vs. Mistral Large 3 Instruct — Large 3 is the capability tier and is open-weight. Medium 3.1 is less capable but has meaningfully more enterprise flexibility (hybrid, VPC, negotiated fine-tuning) and is easier to adopt for teams without GPU infrastructure.
vs. Claude Sonnet or GPT-4-class proprietary models — Similar tier, but Medium 3.1 is priced aggressively against U.S. competitors and carries EU-jurisdiction benefits that matter for some regulated deployments. Claude and GPT have meaningfully better independent benchmark coverage and mature tooling ecosystems.

Under the Hood

Mistral has not published detailed architectural or training specifications for the Medium tier. What's known from official documentation: 131K context window, text-only, API-only access, and a knowledge cutoff of June 30, 2025 for Medium 3.1. The composite Intelligence Index score from Artificial Analysis suggests a model competitive with Llama 4 Maverick and Cohere Command A in capability tier, which aligns roughly with Mistral's "90% of Claude Sonnet 3.7" positioning.

No public version history or changelog is maintained beyond the 3.0 → 3.1 generation bump. This is unusual for an actively-marketed enterprise model in 2026 — OpenAI and Anthropic both publish detailed version notes — and represents a visibility gap that matters for enterprise procurement where version governance is a line-item concern.

Cost

API input (per 1M tokens): $0.40
API output (per 1M tokens): $2.00
API providers: mistral, openrouter, bedrock, azure, watsonx
Notes: Mistral's hosted API is the primary access path. Also available on AWS Bedrock, Azure AI Foundry, IBM WatsonX, and Google Cloud Vertex. Enterprise customers can negotiate fine-tuning agreements and hybrid on-prem deployments with Mistral directly, but the model weights are never released.

Pricing data is 91 days old. Verify with the source before relying on it.