Llama Guard 4 12B
12Bby Meta
Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM inputs (prompt classification) and in LLM responses (response classification). It acts as an LLM—generating text in its output that indicates whether a given prompt or response is safe or unsafe, and if unsafe, it also lists the content categories violated. Llama Guard 4 was aligned to safeguard against the standardized MLCommons hazards taxonomy and designed to support multimodal Llama 4 capabilities. Specifically, it combines features from previous Llama Guard models, providing content moderation for English and multiple supported languages, along with enhanced capabilities to handle mixed text-and-image prompts, including multiple images. Additionally, Llama Guard 4 is integrated into the Llama Moderations API, extending robust safety classification to text and images.
Specifications
Technical details and pricing.
Frequently Asked Questions
What is Llama Guard 4 12B good for?
Use Llama Guard 4 12B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Llama Guard 4 12B cost?
Pricing is based on usage. Current rates are $0.18/1M tokens for input and $0.18/1M tokens for output.
Can I try Llama Guard 4 12B for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Llama Guard 4 12B support images or audio?
Llama Guard 4 12B can understand images.
Similar Models
Other models you might want to explore.
Pricing, context, and capability data are sourced from OpenRouter.