← Back to hard AIs

Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Qwen

Feature-frozen. The creator has frozen feature development on this model (critical fixes only).

Qwen2.5-Math-1.5B

Model family: qwen2-5-math

Qwen2.5's 1.5B math model, Apache 2.0 — notable as the base for DeepSeek-R1-Distill-Qwen-1.5B.

Listing Notes

Primarily cataloged as the base modelA model straight out of pretraining, before any fine-tuning for chat or specific tasks. Base models predict the next token but don't follow instructions well — they'll continue your prompt rather than respond to it. Most people never use base models directly; they use the instruct-tuned or chat versions built on top. Useful mostly for researchers and people doing their own fine-tuning. for DeepSeek-R1-Distill-Qwen-1.5B. A small, math-focused Qwen2.5 checkpointA specific saved version of a model at a particular point in training. When a creator releases "Llama 3.1 8B Instruct," they're releasing a checkpoint — a frozen snapshot of the model as it existed at the end of training. Most models ship only a single public checkpoint; some creators release multiple (base, instruct, reasoning variants of the same underlying model). under Apache 2.0.

Identity

Creator
Qwen
Model family
qwen2-5-math
Release date
2024-09-18

Technical specs

Parameter count
1.5B
Context window
4.1K tokens
Modalities
  • Text
Primary capabilities
  • Math
  • Reasoning

License

License
Apache License 2.0
Commercial use
  • Allowed
Terms
  • Modification
  • Redistribution
  • Attribution

Access

Openness
  • Open Weight
Access methods
  • Local Runtime Llama Cpp
  • Local Runtime Ollama
  • Local Runtime Vllm
  • Weights Download Hf
Cost tier
  • Mixed

Full model card →