Groq: Qwen3 32B

Model Overview

Property	Value
Model ID	`groq/qwen/qwen3-32b`
Name	Qwen3 32B
Provider	Groq / Qwen
Parameters	32B

Description

Alibaba's Qwen3 32 billion parameter model hosted on Groq infrastructure. Offers strong multilingual capabilities with fast inference. The model is optimized for code generation and programming tasks.

Specifications

Spec	Value
Context Window	131,072 tokens
Max Completion	8,192 tokens
Inference Speed	~400 tokens/sec

Pricing

Type	Price
Input	$0.05 per 1M tokens
Output	$0.20 per 1M tokens

Capabilities

Multilingual: Yes
Fast Inference: Yes
Tool Use: Yes
Streaming: Yes

Use Cases

Multilingual applications, general-purpose chat, code generation.

Integration with LangMart

Gateway Support: Type 2 (Cloud), Type 3 (Self-hosted)

API Usage:

curl -X POST https://api.langmart.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "groq/qwen/qwen3-32b",
    "messages": [{"role": "user", "content": "Hello"}],
    "max_tokens": 4096
  }'

groq/llama-3.3-70b-versatile - Alternative large model

Last Updated: December 28, 2025

Groq: Qwen3 32B

Groq: Qwen3 32B

Model Overview

Description

Specifications

Pricing

Capabilities

Use Cases

Integration with LangMart

Related Models