Google

Gemini 2.0 Flash-Lite

Chat

Overview

Gemini 2.0 Flash-Lite is a streamlined, cost-effective large language model optimized for speed and efficiency. It supports multimodal inputs—including text, images, audio, and video—and offers a 1 million token context window, making it suitable for tasks like summarization, content generation, and basic reasoning. Compared to its predecessor, Gemini 1.5 Flash, Flash-Lite delivers better quality at the same speed and cost, making it ideal for large-scale, latency-sensitive applications where affordability and responsiveness are key.

Model Information

Max Input

Tokens

Input Price

per 1M Tokens

Output price

per 1M Tokens

Size

Billion Parameters

Release date

2025-02-25

Licence

Proprietary

Parameters

Unknown

Input Context Length

1,048,576

Tokens

Output Context Length

8,192

Tokens

Features & Capabilities

Web Access

Yes

Real-time access to current web information

Multimodal

Yes

Ability to process multiple data types (text, images, etc...)

API

Sample code and API for GPT-4o Search Preview

Create API Key