Google: Gemini 2.5 Flash-Lite
Model Overview
| Property | Value |
|---|---|
| Model ID | google/gemini-2.5-flash-lite |
| Name | Gemini 2.5 Flash-Lite |
| Provider | |
| Released | 2025 |
Description
Built for massive scale, Gemini 2.5 Flash-Lite balances cost and performance for high-throughput tasks. Optimized for efficiency without sacrificing multimodal capabilities.
Specifications
| Spec | Value |
|---|---|
| Context Window | 1,000,000 tokens |
| Max Completion | 8,192 tokens |
| Modalities | Text, Image |
Pricing
| Type | Price |
|---|---|
| Input | $0.10 per 1M tokens |
| Output | $0.40 per 1M tokens |
Capabilities
- Vision: Yes
- Tool Use: Yes
- JSON Mode: Yes
- Streaming: Yes
- High Throughput: Yes
Use Cases
High-volume batch processing, cost-sensitive applications, simple classification and extraction tasks.
Integration with LangMart
Gateway Support: Type 2 (Cloud), Type 3 (Self-hosted)
API Usage:
curl -X POST https://api.langmart.ai/v1/chat/completions \
-H "Authorization: Bearer sk-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemini-2.5-flash-lite",
"messages": [{"role": "user", "content": "Hello"}],
"max_tokens": 2048
}'
Related Models
- google/gemini-2.5-flash - Full-featured Flash
- google/gemini-2.0-flash-lite - Previous generation lite
Last Updated: December 28, 2025