Back to all LLMs
Google

Gemini 2.0 Flash-Lite

Overview

Gemini 2.0 Flash-Lite is a streamlined, cost-effective large language model optimized for speed and efficiency. It supports multimodal inputs—including text, images, audio, and video—and offers a 1 million token context window, making it suitable for tasks like summarization, content generation, and basic reasoning. Compared to its predecessor, Gemini 1.5 Flash, Flash-Lite delivers better quality at the same speed and cost, making it ideal for large-scale, latency-sensitive applications where affordability and responsiveness are key.

Model Information

Max Input
1M
Tokens
Input Price
-
per 1M Tokens
Output price
-
per 1M Tokens
Size
Billion Parameters
Release date
2025-02-25
Licence
Proprietary
Parameters
Unknown
Input Context Length
1,048,576
Tokens
Output Context Length
8,192
Tokens

Features & Capabilities

Web Access

Yes
Real-time access to current web information

Multimodal

Yes
Ability to process multiple data types (text, images, etc...)

API

Sample code and API for GPT-4o Search Preview
Create API Key