Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
Mistral OCR
Model family: mistral-other
Mistral's document-understanding API. Extracts markdown + HTML tables from PDFs, images, and handwriting at $2/1,000 pages ($1 batch). 74% win rate over OCR 2 as of December 2025, undercuts AWS/Google/Azure on price.
Listing Notes
Current version is Mistral OCR 3 (mistral-ocr-2512), released December 18, 2025 — the first version of this product that genuinely competes on pricing with the hyperscaler document-AI services. Where AWS Textract charges ~$65/1,000 pages for forms-and-tables extraction and Google Document AI charges $30–45/1,000, Mistral OCR 3 is $2/1,000 pages standard or $1/1,000 via the Batch API. The trade-off is that this is a proprietary closed API without SOC 2 / HIPAA / FedRAMP compliance attestations published, and without custom model fine-tuning — so for regulated industries that need a compliance paper trail, AWS/Google/Azure still win on procurement posture even when Mistral wins on price.
What Mistral OCR 3 does well: handwriting recognition (88.9% accuracy per independent benchmarks, versus Azure's 78.2% and DeepSeek's 57.2%), complex table reconstruction with HTML output preserving row/column spans, mathematical expressions in LaTeX, and multilingual document parsing across 100+ languages. Independent reviews note it's weaker on extremely complex multi-column layouts (magazine-style) where it sometimes imposes table structure on columnar text. For financial data extraction specifically, human-in-the-loop review is still recommended — structural fidelity can mask individual-digit OCR errors that look correct but aren't.
Self-hosting: Mistral offers enterprise self-hosting for organizations with classified or highly sensitive data needs, but this is a separate commercial track — not an open-weightA model where the trained weights are freely downloadable — you can run it yourself without contacting the creator. Llama, Mistral, Qwen, and Gemma are open-weight. Open-weight does not mean open-source: the training data and code often stay private. The license still governs what you can do with the weights, including whether you can use them commercially. release. Contact Mistral sales if that's your deployment shape.
Identity
- Creator
- Mistral AI
- Model family
- mistral-other
- Release date
- 2025-03-05
- Version history
- 2503 · 2025-03-06T00:00:00.000Z — Initial Mistral OCR release. Document understanding model with OCR, layout parsing, and structured output extraction.
- 2505 · 2025-05-27T00:00:00.000Z — Mistral OCR 2 (mistral-ocr-2505). Added annotations and bbox extraction capabilities.
- 2512 · 2025-12-18T00:00:00.000Z — Mistral OCR 3 (mistral-ocr-2512). Current version. 74% win rate over OCR 2 on forms, scanned documents, complex tables, and handwriting. Introduced batch-API pricing at $1 per 1,000 pages (50% discount from $2 standard). Backward-compatible with OCR 2.
Technical specs
- Parameter count
- Mistral has not published parameter counts for the OCR line, but describes OCR 3 as "significantly smaller" than peer document-AI models. The architecture is a vision-language pipeline that produces structured markdown and HTML tables.
- Context window
- 8.2K tokens
- Modalities
- Image Input
- Text
- Primary capabilities
- Multilingual
- Vision
License
- License
- Proprietary (Mistral API Terms)
- Commercial use
- Allowed
- Terms
- Modification ✗
- Redistribution ✗
- Attribution ✗
Access
- Openness
- Closed Api
- Access methods
- Api First Party
- Api Third Party
- Cost tier
- Paid Api
- Cost details
- $2 per 1,000 pages for standard API, or $1 per 1,000 pages via the Batch API (50% discount). For context, this undercuts AWS Textract's forms-and- tables pricing by roughly 97%, Google Document AI by 93%, and Azure Form Recognizer by 50–75%. For organizations with strict data-residency or classification requirements, self-hosting is available on request — not a general open-weight offering.
- ocr
- document-ai
- vision
- multilingual
- proprietary
- api-only
- eu-based
- specialist