Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Meta · Llama 3.2 3B

Feature-frozen. The creator has frozen feature development on this model (critical fixes only).

Hermes 3 3B

fine-tune derivative of Llama 3.2 3B by Nous Research

Nous Research's post-training of Llama 3.2 3B into Hermes 3 — steerable instruction following in an edge-deployable size.

Size

small (3.0B params)

Context

131,072 tokens

Released

2024-09-24

Openness

open-weight

License

Llama 3.2 Community License (Nous fine-tune) · commercial: conditional

Cost tier

mixed

Rating

3.5 ★ — Useful Hermes steerability in an on-device size, but small and a generation old — 3.5.

Modalities

text

Capabilities

chat, instruction-following, long-context, multilingual, tool-use

Access

local-runtime-llama-cpp, local-runtime-ollama, local-runtime-vllm, weights-download-hf

llm
open-weight
small
on-device
fine-tune
us-based
llama-derivative

Quick Take

The smallest Hermes: a 3B fine-tune of Llama 3.2 3B for on-device and edge use, with Hermes's steerable instruction following.

Plain-English Description

Hermes 3 3B brings the Hermes tuning — steerable, low-refusal, system-prompt-responsive instruction following — to an edge-deployable 3-billion-parameter size built on Llama 3.2 3B. It's the model to reach for when you want Hermes behavior on a phone, a laptop, or constrained hardware.

At 3B it's a small model: useful for on-device assistants, classification, and simple agentic tasks, but not for hard reasoning. Note it's built on Llama 3.2 (not 3.1), so its license is the Llama 3.2 Community License.

License details below.

Best For

On-device and edge assistants wanting Hermes's steerability in a tiny footprint.
Simple instruction-following, classification, and routing at low cost.
Privacy-first local deployments.

Not For

Hard reasoning or complex tasks — step up to a larger Hermes.
Products near 700M MAU (Llama carve-out).
Multimodal tasks — text only.

License — Plain-English Summary

Two layers. Nous's open weights sit on Meta's Llama 3.2 3B, so the Llama 3.2 Community License governs (note: 3.2, not 3.1) — commercial use with "Built with Llama" attribution and the 700M-MAU carve-out. Confirm you're reading the Llama 3.2 terms specifically.

How It Compares

Against the existing Hermes 3 — Llama 3.1 8B, the 3B is smaller and lighter — the 8B is more capable where hardware allows. Against its base Llama 3.2 3B, it's Nous's steerable tuning. Against DeepHermes 3 8B, that DeepHermes variant adds toggleable reasoning at a larger size.

Cost

Self-hosted cost: $0.00 beyond compute
Notes: Free to self-host; the base model's license governs commercial use (see License).

Comparable models

Commercial-use conditions

Nous releases the Hermes weights openly, but the base is Meta's Llama 3.2, so Meta's Llama 3.2 Community License governs the model — including the clause requiring a separate Meta license if your product exceeds 700 million monthly active users.