Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · DeepSeek

Feature-frozen. The creator has frozen feature development on this model (critical fixes only).

DeepSeek-R1

Model family: deepseek-r1

Size

frontier (685.0B params)

Context

131,072 tokens

Released

2025-01-19

Openness

open-weight

License

MIT License · commercial: yes

Cost tier

mixed

Rating

4.0 ★ — A landmark open reasoning model — MIT, distillable, and the release that reset expectations for what cheap open weights could do. Now overtaken by V4 for frontier work, but still strong and unusually accessible via its distills.

Modalities

text

Capabilities

chat, coding, instruction-following, math, reasoning

Access

api-third-party, local-runtime-llama-cpp, local-runtime-vllm, weights-download-hf

llm
open-weight
commercial-friendly
frontier
reasoning
math
china-based
mixture-of-experts
distillable

Quick Take

The MIT-licensed reasoning model that put DeepSeek on the map — matched OpenAI's best at a fraction of the cost, and ships in small distilled versions you can run on a laptop.

Plain-English Description

DeepSeek-R1, released in January 2025, is the model that made DeepSeek a household name in AI. It's a "reasoning" model — one trained to think step by step before answering, which makes it strong at math, code, and multi-step logic. What stunned the industry wasn't only that R1 matched OpenAI's o1 on hard reasoning tasks; it was that DeepSeek built it on comparatively modest computing resources and then gave the weights away for free. The release triggered a sharp market reaction and set off a wave of open-weight releases across the Chinese AI sector.

R1 was trained with an unusual recipe: instead of the standard approach of teaching the model with large labeled datasets first, DeepSeek applied reinforcement learning directly and let reasoning behaviors emerge. The current version, R1-0528, deepened that reasoning further — its score on the AIME 2025 math exam jumped from 70% to 87.5% between versions.

The full R1 is a 685-billion-parameter model that needs serious hardware, but DeepSeek also released six "distilled" versions — smaller models (built on Llama and Qwen) trained to imitate R1's reasoning. Distilling is like having a brilliant professor train a sharp student: the student is far smaller and cheaper to run but inherits much of the reasoning skill. The 7B and 8B distills run on a single consumer GPU, which is how most people actually use R1 today.

Best For

Math, logic, and step-by-step reasoning tasks, especially where you want to inspect the model's chain of thought.
Running a capable reasoning model locally and cheaply via the small distilled variants.
Fine-tuning and distillation projects — DeepSeek explicitly permits both, and R1's reasoning traces are valuable training material.
Educational and research use where the open RL-trained recipe is itself the point of interest.

Not For

Frontier production work today — DeepSeek-V4-Pro and DeepSeek-V4-Flash have overtaken R1 on capability and context length.
Multimodal tasks — R1 is text-only.
Anyone relying on DeepSeek's hosted service in a regulated or privacy-sensitive setting. R1 is the specific model whose hosted app drew bans and restrictions from multiple governments over data routing to China; if that's a concern, run the open weights yourself rather than using the hosted API.
General-purpose chat where a non-reasoning model would be faster and cheaper.

License — Plain-English Summary

R1's weights are MIT-licensed, and DeepSeek goes out of its way to spell out that commercial use and distillation of the R1 series are explicitly allowed — including the base and chat variants. That makes R1 one of the most legally unencumbered reasoning models available; you can build on it, ship products with it, and even train your own models from its outputs. The recurring DeepSeek point applies with extra force here: R1's hosted app is the one governments restricted over data-sovereignty concerns, but those concerns attach to the hosted service, not to the open weights you run yourself.

How It Compares

Against the V4 family, R1 is the previous generation — V4 is smarter, handles far longer context, and is the model to choose for new frontier work. Against DeepSeek-V3.2, R1 is the dedicated reasoning specialist where V3.2 is more general-purpose (though V3.2-Speciale blurred that line). Against OpenAI's o1/o3 reasoning models, R1 reached comparable quality on math and coding at a tiny fraction of the cost and as open weights — the headline contrast that made it famous. And uniquely in DeepSeek's lineup, R1's distilled variants give it a genuine accessibility story: there's a version small enough for almost any hardware budget.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: R1 was served first-party behind the deepseek-reasoner endpoint at roughly $0.55 per million input and $2.19 per million output tokens; that endpoint is now a compatibility alias that points to V4 Flash's thinking mode and retires 2026-07-24. R1's open weights remain freely downloadable under MIT and are served by third-party hosts. Treat it now mainly as a self-host or third-party option.

Hardware requirements

Min VRAM: 16 GB
Recommended VRAM: 384 GB
Runs on laptop: Yes
Notes: The full 685B model needs a multi-GPU cluster. But the distilled variants span the range — the Qwen-7B and Llama-8B distills run on a single consumer GPU or a capable laptop, which is how most people actually run "R1."