All Models
olmOCR-2-7B
7Bby AllenAI
Allen AI's 7B OCR model fine-tuned from Qwen2.5-VL-7B on curated academic papers and technical documentation. Supports 128K context and extracts structured text from PDFs and scanned documents with high fidelity.
Context Window128,000 tokens
Parameters7B
LicenseQwen 2.5
ModalitiesText, Image
Specifications
Technical details and pricing.
ProviderAllenAI
Context Window128,000 tokens
Release DateOct 1, 2025
ModalitiesText, Image → Text
CapabilitiesOCR, Document Parsing, Vision
LicenseQwen 2.5
Frequently Asked Questions
What is olmOCR-2-7B?
Allen AI's 7B OCR model fine-tuned from Qwen2.5-VL-7B on curated academic papers and technical documentation. Supports 128K context and extracts structured text from PDFs and scanned documents with high fidelity.
What input formats does olmOCR-2-7B support?
olmOCR-2-7B accepts text, image as input and produces text output.
What is the context length of olmOCR-2-7B?
olmOCR-2-7B supports up to 128,000 tokens of context.
Is olmOCR-2-7B open source?
olmOCR-2-7B is available under the Qwen 2.5 license.
Specifications are based on publicly available model documentation.