Overview

GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.

Model Information

Max Input
128K
Tokens
Input Price
$2.50
per 1M Tokens
Output price
$10.00
per 1M Tokens
Size
-
Billion Parameters
Release date
2024-08-06
Licence
Proprietary
Parameters
Unknown
Input Context Length
128,000
Tokens
Output Context Length
16,384
Tokens

Features & Capabilities

Web Access

Yes
Real-time access to current web information

Multimodal

Yes
Ability to process multiple data types (text, images, etc...)

API

Sample code and API for GPT-4o Search Preview
Create API Key