Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
DeepSeek-V3.2-Exp
Model family: deepseek-v3-2
The experimental V3.2 build that pioneered sparse attentionThe mechanism inside a Transformer that lets the model weigh which parts of the input matter most when processing each word. When you read "the cat sat on the mat," attention is how the model knows that "it" in a later sentence refers back to the cat, not the mat. Attention is what made modern language models possible. for cheaper long context; MIT-licensed and now superseded by stable V3.2 and V4.
Identity
- Creator
- DeepSeek
- Model family
- deepseek-v3-2
- Release date
- 2025-09-28
Technical specs
- Parameter count
- 671B
- Context window
- 131K tokens
- Modalities
- Text
- Primary capabilities
- Chat
- Long Context
- Reasoning
License
- License
- MIT License
- Commercial use
- Allowed
- Terms
- Modification ✓
- Redistribution ✓
- Attribution ✓
Access
- Openness
- Open Weight
- Access methods
- Local Runtime Vllm
- Weights Download Hf
- Cost tier
- Mixed
- llm
- open-weight
- commercial-friendly
- frontier
- long-context
- china-based