Google: Gemini Multimodal Live

Model Overview

Property	Value
Model ID	`google/gemini-multimodal-live`
Name	Gemini Multimodal Live
Status	Experimental
Released	2025-11-15

Description

Real-time streaming multimodal model.

Description

Google: Gemini Multimodal Live is a language model provided by the provider. This model offers advanced capabilities for natural language processing tasks.

Specifications

Spec	Value
Context Window	100,000 tokens
Max Output	8,000 tokens
Modalities	text, image, audio, video, stream

Pricing

Type	Price
Input	$0.5/1M tokens
Output	$1.5/1M tokens

Capabilities

Text: Yes
Image: Yes
Audio: Yes
Video: Yes
Tool Use: Yes
JSON Mode: Yes

Key Features

Multimodal Support - Text, images, audio, and video
Large Context - Up to 100,000 tokens
Tool Use - Supported
JSON Mode - Supported
Streaming - Real-time generation
Cost Effective - Optimized pricing

Best For

Live streaming
Real-time analysis
Interactive applications
Live transcription

Data & Usage Policies

Policy	Status
Training Data	Not used for training
Prompt Retention	Does not retain prompts
Data Processing	Google Cloud privacy compliant

Status & Availability

Status: EXPERIMENTAL
Free Tier: No
Provider: Google

API Usage Example

curl https://api.langmart.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "google/gemini-multimodal-live",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 8000
  }'

google/gemini-3-pro-preview - Latest flagship
google/gemini-2.5-pro - Advanced 2.5 model
google/gemini-2.0-flash - Fast multimodal
google/gemma-3-27b-it - Open-source alternative

Source

Generated for LangMart AI Platform on 2025-12-28

Google: Gemini Multimodal Live

Google: Gemini Multimodal Live

Model Overview

Description

Description

Specifications

Pricing

Capabilities

Key Features

Best For

Data & Usage Policies

Status & Availability

API Usage Example

Related Models

Source