Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
DeepSeek-R1-Zero
Model family: deepseek-r1
The pure-reinforcement-learning reasoning modelA model trained to "think through" problems step by step before answering, often by producing internal reasoning that's either shown or hidden from the user. Reasoning models trade speed for accuracy on hard problems — they're slower and more expensive per answer, but markedly better at math, logic, and complex analysis. OpenAI's o1 series and Mistral's Magistral are reasoning models. behind R1 — MIT-licensed and fascinating for research, but rougher than R1 for production use.
Identity
- Creator
- DeepSeek
- Model family
- deepseek-r1
- Release date
- 2025-01-19
Technical specs
- Parameter count
- 685B
- Context window
- 131K tokens
- Modalities
- Text
- Primary capabilities
- Coding
- Math
- Reasoning
License
- License
- MIT License
- Commercial use
- Allowed
- Terms
- Modification ✓
- Redistribution ✓
- Attribution ✓
Access
- Openness
- Open Weight
- Access methods
- Local Runtime Vllm
- Weights Download Hf
- Cost tier
- Mixed
- llm
- open-weight
- commercial-friendly
- frontier
- reasoning
- china-based