Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · OpenAI

gpt-oss-20b

Model family: gpt-oss

Size

mid (21.0B params)

Context

131,072 tokens

Released

2025-08-04

Openness

open-weight

License

Apache License 2.0 (+ gpt-oss usage policy) · commercial: yes

Cost tier

mixed

Rating

4.0 ★ — An excellent on-device reasoning model — o3-mini-class quality on 16GB of memory, clean Apache 2.0, with tool use and adjustable reasoning. Text-only and naturally limited by its size, hence 4.0.

Modalities

text

Capabilities

chat, coding, function-calling, instruction-following, long-context, reasoning, tool-use

Access

local-runtime-llama-cpp, local-runtime-lm-studio, local-runtime-mlx, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
commercial-friendly
small
reasoning
on-device
self-hostable
us-based
apache-2-0
mixture-of-experts

Quick Take

OpenAI's on-device open model: o3-mini-class reasoning that runs locally on 16GB of memory, under clean Apache 2.0 — download it and run it on a good laptop.

Plain-English Description

gpt-oss-20b is the smaller of OpenAI's two open-weight models, built for local and on-device use. Where the 120b needs a datacenter GPU, the 20b runs on 16GB of memory — a high-end laptop or desktop — while delivering reasoning quality OpenAI compares to its o3-mini model. It's a mixture-of-experts design (21B total, ~3.6B active per token), which is how it stays light.

Like its larger sibling it's text-only and built for reasoning and agentic tasks: adjustable reasoning effort, full chain-of-thought, and native tool use (function calling, browsing, Python). The point is to put capable reasoning on hardware people already have, with no API, no per-token cost, and no data leaving the device.

For privacy-sensitive local applications, rapid prototyping, or embedding reasoning into a product without infrastructure spend, it's one of the stronger small open models — and the Apache 2.0 license makes it free to build on commercially.

Best For

On-device and local reasoning where data stays on the machine.
Privacy-first or offline applications with no API dependency.
Rapid prototyping and iteration without inference costs.
Embedding reasoning and tool use into products on consumer hardware.

Not For

The strongest reasoning — step up to gpt-oss-120b or a closed flagship.
Multimodal tasks — it's text-only.
Workloads needing the largest context or deepest knowledge.
Anyone wanting frontier quality from an on-device model.

License — Plain-English Summary

Apache 2.0 — unrestricted commercial use, modification, fine-tuning, and redistribution, no royalties or carve-outs; keep the notices. OpenAI's short "gpt-oss usage policy" covers acceptable use without restricting commercial deployment. Running locally, it keeps all data on-device — ideal for privacy-sensitive products. Among the cleanest licenses available for an on-device model.

How It Compares

Against gpt-oss-120b, the 20b trades capability for portability — laptop-class versus datacenter-GPU. Against Google's on-device open models like Gemma 4 E4B, gpt-oss-20b competes on reasoning and tool use under the same Apache 2.0 license, though Gemma adds multimodality. Against any cloud API, the difference isn't raw capability — it's that gpt-oss-20b runs entirely on your own device, free of per-token cost and data-routing concerns.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host under Apache 2.0; runs locally via Ollama, LM Studio, llama.cpp, and similar. Adjustable reasoning effort (low / medium / high).

Hardware requirements

Min VRAM: 16 GB
Recommended VRAM: 24 GB
Runs on laptop: Yes
Notes: Runs on 16GB of memory — high-end laptops and desktops — making it a practical on-device reasoning model.

Comparable models

Commercial-use conditions

Apache 2.0 permits unrestricted commercial use, modification, fine-tuning, and redistribution. OpenAI attaches a short "gpt-oss usage policy" covering acceptable use; it doesn't restrict commercial deployment.