Back to all LLMs
xAI

Grok-2 Vision

Overview

Grok-2 Vision is xAI’s state-of-the-art multimodal AI model, designed to process and understand both text and images. It excels in tasks such as object recognition, style analysis, and complex visual reasoning. With enhanced instruction-following and multilingual support, Grok-2 Vision is well-suited for applications requiring sophisticated visual analysis and creative design.

Model Information

Max Input
32,768
Tokens
Input Price
$2.00
per 1M Tokens
Output price
$10.00
per 1M Tokens
Size
-
Billion Parameters
Release date
2024-12-15
Licence
Proprietary
Parameters
Unknown
Input Context Length
32,768
Tokens
Output Context Length
32,768
Tokens

Features & Capabilities

Web Access

Yes
Real-time access to current web information

Multimodal

Yes
Ability to process multiple data types (text, images, etc...)

API

Sample code and API for GPT-4o Search Preview
Create API Key