← Back to hard AIs

Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →

Models · Meta

Llama Guard 4 12B

Model family: safeguards

Meta's multimodalA model that can handle more than one type of input or output — typically text plus images, sometimes plus audio or video. "GPT-4 Vision" and "Llama 3.2 11B Vision" are multimodal models that accept both text and images. A text-only model is called "unimodal" but nobody uses that term; text-only is the assumed default. safety classifier for LLM applications. Screens prompts and responses (text and images) against the MLCommons hazards taxonomy. Replaces Llama Guard 3 8B and Llama Guard 3 11B-Vision with a single model.

Identity

Creator
Meta
Model family
safeguards
Release date
2025-04-28

Technical specs

Parameter count
12B
Context window
131K tokens
Modalities
  • Image Input
  • Text
Primary capabilities
  • Classification
  • Multilingual

License

License
Llama 4 Community License
Commercial use
  • Conditional

Free for commercial use unless the licensee's product has 700 million monthly active users at the Llama 4 release date (April 5, 2025), in which case a separate Meta license is required. EU-based companies and individuals do NOT receive grant-of-rights for the multimodal components of Llama 4 models under Section 1(a) of the license.

Terms
  • Modification
  • Redistribution
  • Attribution

Access

Openness
  • Open Weight
Access methods
  • Local Runtime Vllm
  • Weights Download Direct
  • Weights Download Hf
Cost tier
  • Mixed

Full model card →