Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

Released Jul 22, 20251,048,576 context

$0.10/M input tokens$0.40/M output tokens$0.30/M audio tokens

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

OpenRouter

Product

Chat
Rankings
Apps
Models
Providers
Pricing
Enterprise
Labs

Company

About
Announcements
CareersHiring
Privacy
Terms of Service
Support
State of AI
Works With OR

Developer

Documentation
API Reference
SDK
Status

Connect

Discord
GitHub
LinkedIn
X
YouTube

Google: Gemini 2.5 Flash Lite