Groq: Qwen3 32B
Model Overview
| Property | Value |
|---|---|
| Model ID | groq/qwen/qwen3-32b |
| Name | Qwen3 32B |
| Provider | Groq / Qwen |
| Parameters | 32B |
Description
Alibaba's Qwen3 32 billion parameter model hosted on Groq infrastructure. Offers strong multilingual capabilities with fast inference. The model is optimized for code generation and programming tasks.
Specifications
| Spec | Value |
|---|---|
| Context Window | 131,072 tokens |
| Max Completion | 8,192 tokens |
| Inference Speed | ~400 tokens/sec |
Pricing
| Type | Price |
|---|---|
| Input | $0.05 per 1M tokens |
| Output | $0.20 per 1M tokens |
Capabilities
- Multilingual: Yes
- Fast Inference: Yes
- Tool Use: Yes
- Streaming: Yes
Use Cases
Multilingual applications, general-purpose chat, code generation.
Integration with LangMart
Gateway Support: Type 2 (Cloud), Type 3 (Self-hosted)
API Usage:
curl -X POST https://api.langmart.ai/v1/chat/completions \
-H "Authorization: Bearer sk-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "groq/qwen/qwen3-32b",
"messages": [{"role": "user", "content": "Hello"}],
"max_tokens": 4096
}'
Related Models
- groq/llama-3.3-70b-versatile - Alternative large model
Last Updated: December 28, 2025