Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Nous Research

4.0 ★ — Among the most respected names in open-source fine-tuning, with real research output (YaRN, DeMo, the Hermes family) and $65M in funding behind them. The half-point ding is that their work is all derivative — valuable, but dependent on upstream base models for foundational capability.

Type

ai-native-company

Country

Founded

2023

License posture

predominantly-open-weight

Website

https://nousresearch.com

derivative-author
us-based
ai-native-company
open-source
fine-tuning

Quick Take

A respected open-source AI research lab that specializes in high-quality fine-tunes of other people's foundation models — most notably the Hermes series built on Llama.

Who They Are

Nous Research is an open-source AI lab founded in 2023 by Jeffrey Quesnelle, Karan Malhotra, Teknium, and Shivani Mitra, headquartered in New York. Unlike Meta, Google, or OpenAI, they don't train foundation models from scratch. What they do is take existing open-weight foundations — historically the Llama family, and more recently Mistral and ByteDance's Seed models too — and fine-tune them into models that often outperform the original creator's own instruction-tuned versions on specific capabilities. Their Hermes series — now on version 4, with a recent 4.3 release built on a non-Llama base — is their flagship line, downloaded over 33 million times.

The lab has real financial and research credibility behind it. They've raised $65 million in funding, including a $50M Series A led by Paradigm in April 2025, with participation from Together AI, Distributed Global, North Island Ventures, and others. Their published research isn't limited to fine-tuning recipes — they've contributed the YaRN context-extension method (used by Meta and DeepSeek among others), the DeMo optimization paper co-authored with OpenAI's Diederik Kingma, and several technical reports on their Hermes training process. This isn't a hobbyist Discord server releasing weekend projects; it's a funded, researched, and published operation.

Their specific niche is post-training — the process of taking a pretrained model and tuning it for specific behaviors like instruction-following, function calling, roleplay, and structured output. Meta, Mistral, and other foundation model makers release their own instruction-tuned variants, but Nous's Hermes versions often trade blows with, or beat, the originals on specific tasks. For developers building on open-weight Llama models, Hermes is frequently the starting point instead of Meta's own Instruct release.

Model Philosophy

Nous leans hard into the "user steerability" direction of open-source AI. Their public positioning, visible in their model cards and technical reports, emphasizes that end users should have meaningful control over the models they run — guiding rules, roles, stylistic choices, and system-level behavior. In practice, this means Hermes models tend to be more willing to adopt strong personas, less heavy-handed with refusals, and more responsive to detailed system prompts than Meta's own Instruct versions.

They've also been early and active on decentralized training — their Psyche Network is an attempt to coordinate distributed GPU compute for model training across contributor hardware, using their DisTrO technology to reduce inter-GPU communication overhead. That bet has started to pay off: Hermes 4.3 (December 2025) was the first Hermes model trained on Psyche rather than a centralized cluster — a notable milestone, even if decentralized training at scale is still proving itself.

Pseudonymity is part of the culture. "Teknium," the Head of Post-Training, is publicly known only by that handle. That's worth flagging for businesses evaluating where their AI stack comes from — not as a red flag (the research is published, the funding is documented, the models are widely adopted and verified) but as a cultural fact that distinguishes Nous from a traditional corporate AI lab.

What To Know Before You Commit

Three practical considerations for a business considering a Nous Research model.

License inheritance matters. Nous doesn't train their own foundation models — they fine-tune someone else's. That means every Nous model's license is inherited from its base. A Hermes-3 Llama fine-tune is governed by Meta's Llama Community License, not by some Nous-specific license. Before using any Nous model commercially, read the base model's license. This isn't a gotcha — it's how the open-source ecosystem works — but it's a question people sometimes skip.

Their niche is fine-tune quality, not foundation capability. A Nous fine-tune of Llama will not outperform a Llama model at tasks Llama fundamentally can't do. If Llama 3.1 8B is too small for your use case, Hermes-3 Llama 3.1 8B is also too small for your use case. Nous's value-add is in how the model responds, follows instructions, and handles edge cases — not in raw capability ceiling.

Steerability cuts both ways. Hermes models being more responsive to system prompts is genuinely useful for many applications and genuinely riskier for others. If you're deploying a consumer-facing chatbot in a regulated industry, the more tightly aligned behavior of a first-party Instruct model (Meta's own Llama-3.1-8B-Instruct, for example) may be the safer default. If you're building something where you want more control over the model's voice and behavior, Nous's approach is a feature.

Original Models

This creator has no original models in the catalog yet.

Derivatives Authored

Hermes 4

Hermes 4.3 36B — full entry

Hermes 4 405B — full entry

Hermes 4 70B — full entry

Deephermes 3

DeepHermes 3 Mistral 24B — full entry

DeepHermes 3 8B — full entry

Hermes 3

Hermes 3 3B — full entry

Hermes 3 405B — full entry

Hermes 3 70B — full entry

Hermes 3 — Llama 3.1 8B — full entry