G

Google: Gemini 2.0 Pro Vision

Google
Vision Tools
1M
Context
$1.25
Input /1M
$5.00
Output /1M
8K
Max Output

Google: Gemini 2.0 Pro Vision

Model Overview

Property Value
Model ID google/gemini-2.0-pro-vision
Name Gemini 2.0 Pro Vision
Status Stable
Released 2024-12-19

Description

Vision-optimized Gemini 2.0 Pro.

Description

Model Overview

Property Value
Model ID `google/gemini-2. Name

Specifications

Spec Value
Context Window 1,000,000 tokens
Max Output 8,000 tokens
Modalities text, image, audio, video

Pricing

Type Price
Input $1.25/1M tokens
Output $5.00/1M tokens

Capabilities

  • Text: Yes
  • Image: Yes
  • Audio: Yes
  • Video: Yes
  • Tool Use: Yes
  • JSON Mode: Yes

Key Features

  1. Multimodal Support - Text, images, audio, and video
  2. Large Context - Up to 1,000,000 tokens
  3. Tool Use - Supported
  4. JSON Mode - Supported
  5. Streaming - Real-time generation
  6. Cost Effective - Optimized pricing

Best For

  • Advanced vision
  • Image analysis
  • Video understanding
  • Multimodal analysis

Data & Usage Policies

Policy Status
Training Data Not used for training
Prompt Retention Does not retain prompts
Data Processing Google Cloud privacy compliant

Status & Availability

  • Status: STABLE
  • Free Tier: No
  • Provider: Google

API Usage Example

curl https://api.langmart.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/gemini-2.0-pro-vision",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 8000
  }'
  • google/gemini-3-pro-preview - Latest flagship
  • google/gemini-2.5-pro - Advanced 2.5 model
  • google/gemini-2.0-flash - Fast multimodal
  • google/gemma-3-27b-it - Open-source alternative

Source

Generated for LangMart AI Platform on 2025-12-28