AllenAI Models
AllenAI logo

Molmo2 8B

8B

by AllenAI

Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.

Input Price$0.20/1M tokens
Output Price$0.20/1M tokens
Context Window36,864 tokens
Modalitiestext, image, video

Specifications

Technical details and pricing.

ProviderAllenAI
Context Window36,864 tokens
Release DateJan 9, 2026
ModalitiesText, Image, Video β†’ Text
CapabilitiesVision

Frequently Asked Questions

What is Molmo2 8B good for?

Use Molmo2 8B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Molmo2 8B cost?

Pricing is based on usage. Current rates are $0.20/1M tokens for input and $0.20/1M tokens for output.

Can I try Molmo2 8B for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Molmo2 8B support images or audio?

Molmo2 8B can understand images.

Pricing, context, and capability data are sourced from OpenRouter.