Google Models
Google logo

Gemini 2.5 Flash Lite

by Google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e.

Chat with Gemini 2.5 Flash Lite
Input Price$0.10/1M tokens
Output Price$0.40/1M tokens
Intelligence17.4
Coding9.5

Specifications

Technical details and pricing.

ProviderGoogle
Context Window1,048,576 tokens
Release DateJun 17, 2025
ModalitiesText, Image, File, Audio, Video β†’ Text
CapabilitiesVision, Audio Input

Benchmarks

12 benchmark scores from Artificial Analysis.

GPQA62.5%
MMLU Pro75.9%
HLE6.4%
LiveCodeBench59.3%
MATH 50096.9%
AIME 202553.3%
AIME70.3%
SciCode19.3%
LCR51.3%
IFBench49.9%
Tau218.4%
TerminalBench Hard4.5%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is Gemini 2.5 Flash Lite good for?

Use Gemini 2.5 Flash Lite for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Gemini 2.5 Flash Lite cost?

Pricing is based on usage. Current rates are $0.10/1M tokens for input and $0.40/1M tokens for output.

Can I try Gemini 2.5 Flash Lite for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Gemini 2.5 Flash Lite support images or audio?

Gemini 2.5 Flash Lite can understand images.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.