Groq: Claude 3.5 Sonnet
Model Overview
| Property | Value |
|---|---|
| Model ID | groq/claude-3-5-sonnet-20241022 |
| Name | Claude 3.5 Sonnet |
| Parameters | Unknown |
Description
Model Overview
| Property | Value |
|---|---|
| Model ID | groq/claude-3-5-sonnet-20241022 |
| Name | Claude 3. This model supports multimodal capabilities including vision and image understanding. It features advanced reasoning capabilities for complex problem-solving tasks. |
Specifications
| Spec | Value |
|---|---|
| Context Window | 200K tokens |
| Max Completion | 4K tokens |
| Inference Speed | 200 tokens/second |
Pricing
| Type | Price |
|---|---|
| Input | $3.00 per 1M tokens |
| Output | $15.00 per 1M tokens |
Capabilities
- Fast inference engine (Groq LPU)
- Cost-effective token processing
- Reliable production performance
- Streaming support
Limitations
- 200K token context window
- Maximum completion tokens: 4K
- No image generation (inference only)
Performance
Groq specializes in rapid inference with industry-leading token throughput. Typical use cases include:
- Real-time chat applications
- Batch processing with guaranteed latency
- High-volume inference workloads
- Cost-sensitive deployments
Best Practices
- Token Optimization: Craft prompts to minimize token usage while maintaining quality
- Streaming: Use streaming responses for real-time applications
- Batch Processing: Leverage high TPM limits for batch inference
- Context Management: Utilize full context window for complex tasks
Rate Limits
- 30000 TPM (Tokens Per Minute)
- Optimized for high-throughput inference
Features
- High-speed token generation (200 tokens/sec)
- 200K token context window
- Suitable for: Advanced reasoning, complex tasks, creative writing
Integration
Use the standard OpenAI-compatible API endpoint:
curl -X POST https://api.langmart.ai/v1/chat/completions \
-H "Authorization: Bearer $GROQ_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "groq/claude-3-5-sonnet-20241022",
"messages": [
{"role": "user", "content": "Hello!"}
]
}'
Resources
Last updated: December 2025 Source: Groq Official Documentation