G

Groq: Qwen3 32B

Groq
Tools Streaming
131K
Context
$0.0500
Input /1M
$0.2000
Output /1M
8K
Max Output

Groq: Qwen3 32B

Model Overview

Property Value
Model ID groq/qwen/qwen3-32b
Name Qwen3 32B
Provider Groq / Qwen
Parameters 32B

Description

Alibaba's Qwen3 32 billion parameter model hosted on Groq infrastructure. Offers strong multilingual capabilities with fast inference. The model is optimized for code generation and programming tasks.

Specifications

Spec Value
Context Window 131,072 tokens
Max Completion 8,192 tokens
Inference Speed ~400 tokens/sec

Pricing

Type Price
Input $0.05 per 1M tokens
Output $0.20 per 1M tokens

Capabilities

  • Multilingual: Yes
  • Fast Inference: Yes
  • Tool Use: Yes
  • Streaming: Yes

Use Cases

Multilingual applications, general-purpose chat, code generation.

Integration with LangMart

Gateway Support: Type 2 (Cloud), Type 3 (Self-hosted)

API Usage:

curl -X POST https://api.langmart.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "groq/qwen/qwen3-32b",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 4096
  }'
  • groq/llama-3.3-70b-versatile - Alternative large model

Last Updated: December 28, 2025