Qwen2.5-Coder-32B
Open-weight code specialist, frequently the top open option for self-hosted code completion.
Updated
Quick facts
- Released
- Nov 2024
- Context
- 128K tokens
- Output / 1M
- Pricing not published
- License
- Apache-2.0
About Qwen2.5-Coder-32B
Qwen2.5-Coder-32B has been the consensus best open-weight code model since late 2024. Strong on FIM, broad language coverage, fast at the 32B size class, and Apache-2.0 licensed for clean commercial use.
Most serious self-hosted Cursor / Continue.dev / Aider stacks use Qwen Coder somewhere, typically as the inline-completion model paired with a closed flagship for chat steps.
Benchmarks
Published scores from Alibaba's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.
- HumanEval92.7
Capabilities
Strengths
- Top open-weight coder
- Apache-2.0
- FIM-native
Tracked weaknesses
- Coding-focused, not a general chat model
Pricing
Per-million-token rates as published by Alibaba.
Call Qwen2.5-Coder-32B from your code
Drop-in snippet for the Alibaba SDK. Set your API key in the environment and run.
import OpenAI from "openai";
const client = new OpenAI({
// Use OPENAI_API_KEY for OpenAI, or your provider's key + baseURL.
apiKey: process.env.OPENAI_API_KEY,
});
const completion = await client.chat.completions.create({
model: "qwen-2-5-coder-32b",
messages: [
{ role: "user", content: "What's the time complexity of quicksort?" },
],
});
console.log(completion.choices[0].message.content);Best for
Tasks where Qwen2.5-Coder-32B ranks among LLMDex's top picks.
- Best LLM for CodingDevelopers searching for the most capable model to write, edit, or refactor code in real codebases.See ranking
- Best LLM for Code CompletionInline autocomplete inside an IDE, latency, accuracy, and cost matter equally.See ranking
- Best LLM for PythonPython-specific coding tasks: scripts, data work, ML pipelines, scientific computing.See ranking
Compare Qwen2.5-Coder-32B with…
Frequently asked
How much does Qwen2.5-Coder-32B cost?
Alibaba has not published per-token API pricing for Qwen2.5-Coder-32B at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.What is Qwen2.5-Coder-32B's context window?
Qwen2.5-Coder-32B supports a context window of 128K tokens.Is Qwen2.5-Coder-32B open source?
Qwen2.5-Coder-32B ships with open weights under the Apache-2.0 license. You can self-host it, fine-tune it, and (subject to the license terms) deploy it commercially.When was Qwen2.5-Coder-32B released?
Qwen2.5-Coder-32B was released on Nov 12, 2024 by Alibaba.
Intelligence, distilled weekly.
One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.