Verify critical details — pricing, licensing, availability — with the model's source before business decisions. Full methodology →
Llama Guard 4 12B
Model family: safeguards
Meta's multimodalA model that can handle more than one type of input or output — typically text plus images, sometimes plus audio or video. "GPT-4 Vision" and "Llama 3.2 11B Vision" are multimodal models that accept both text and images. A text-only model is called "unimodal" but nobody uses that term; text-only is the assumed default. safety classifier for LLM applications. Screens prompts and responses (text and images) against the MLCommons hazards taxonomy. Replaces Llama Guard 3 8B and Llama Guard 3 11B-Vision with a single model.
Identity
- Creator
- Meta
- Model family
- safeguards
- Release date
- 2025-04-28
Technical specs
- Parameter count
- 12B
- Context window
- 131K tokens
- Modalities
- Image Input
- Text
- Primary capabilities
- Classification
- Multilingual
License
- License
- Llama 4 Community License
- Commercial use
- Conditional
Free for commercial use unless the licensee's product has 700 million monthly active users at the Llama 4 release date (April 5, 2025), in which case a separate Meta license is required. EU-based companies and individuals do NOT receive grant-of-rights for the multimodal components of Llama 4 models under Section 1(a) of the license.
- Terms
- Modification ✓
- Redistribution ✓
- Attribution ✓
Access
- Openness
- Open Weight
- Access methods
- Local Runtime Vllm
- Weights Download Direct
- Weights Download Hf
- Cost tier
- Mixed
- classifier
- safety
- open-weight
- commercial-friendly
- mid
- multimodal
- multilingual
- us-based