Back to all LLMs

Google
Gemini 2.0 Flash-Lite
Overview
Gemini 2.0 Flash-Lite is a streamlined, cost-effective large language model optimized for speed and efficiency. It supports multimodal inputs—including text, images, audio, and video—and offers a 1 million token context window, making it suitable for tasks like summarization, content generation, and basic reasoning. Compared to its predecessor, Gemini 1.5 Flash, Flash-Lite delivers better quality at the same speed and cost, making it ideal for large-scale, latency-sensitive applications where affordability and responsiveness are key.
Model Information
Max Input
1M
Tokens
Input Price
-
per 1M Tokens
Output price
-
per 1M Tokens
Size
Billion Parameters
Release date
2025-02-25
Licence
Proprietary
Parameters
Unknown
Input Context Length
1,048,576
Tokens
Output Context Length
8,192
Tokens
Features & Capabilities
Web Access
Yes
Real-time access to current web information
Multimodal
Yes
Ability to process multiple data types (text, images, etc...)
API
Sample code and API for GPT-4o Search Preview