Molmo2 8B
8Bby AllenAI
Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.
Specifications
Technical details and pricing.
Frequently Asked Questions
What is Molmo2 8B good for?
Use Molmo2 8B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Molmo2 8B cost?
Pricing is based on usage. Current rates are $0.20/1M tokens for input and $0.20/1M tokens for output.
Can I try Molmo2 8B for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Molmo2 8B support images or audio?
Molmo2 8B can understand images.
Similar Models
Other models you might want to explore.
Pricing, context, and capability data are sourced from OpenRouter.