LLM·Dex
Open weightsAlibabaopentext

Qwen2.5-Coder-32B

Open-weight code specialist, frequently the top open option for self-hosted code completion.

Updated


Quick facts

Released
Nov 2024
Context
128K tokens
Output / 1M
Pricing not published
License
Apache-2.0

About Qwen2.5-Coder-32B

Qwen2.5-Coder-32B has been the consensus best open-weight code model since late 2024. Strong on FIM, broad language coverage, fast at the 32B size class, and Apache-2.0 licensed for clean commercial use.

Most serious self-hosted Cursor / Continue.dev / Aider stacks use Qwen Coder somewhere, typically as the inline-completion model paired with a closed flagship for chat steps.

Benchmarks

Published scores from Alibaba's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.

HumanEval
92.7
Python coding pass@1
MMLU
,
Broad academic knowledge
GPQA
,
Graduate-level reasoning
SWE-bench
,
Real software-engineering tasks
  • HumanEval92.7

Capabilities

Strengths

  • Top open-weight coder
  • Apache-2.0
  • FIM-native

Tracked weaknesses

  • Coding-focused, not a general chat model

Pricing

Per-million-token rates as published by Alibaba.

Per-token pricing not yet published for Qwen2.5-Coder-32B. Check the official provider site for current tiers.

Call Qwen2.5-Coder-32B from your code

Drop-in snippet for the Alibaba SDK. Set your API key in the environment and run.

typescript
import OpenAI from "openai";

const client = new OpenAI({
  // Use OPENAI_API_KEY for OpenAI, or your provider's key + baseURL.
  apiKey: process.env.OPENAI_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "qwen-2-5-coder-32b",
  messages: [
    { role: "user", content: "What's the time complexity of quicksort?" },
  ],
});

console.log(completion.choices[0].message.content);

Best for

Tasks where Qwen2.5-Coder-32B ranks among LLMDex's top picks.

Compare Qwen2.5-Coder-32B with…

Frequently asked

  • How much does Qwen2.5-Coder-32B cost?
    Alibaba has not published per-token API pricing for Qwen2.5-Coder-32B at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.
  • What is Qwen2.5-Coder-32B's context window?
    Qwen2.5-Coder-32B supports a context window of 128K tokens.
  • Is Qwen2.5-Coder-32B open source?
    Qwen2.5-Coder-32B ships with open weights under the Apache-2.0 license. You can self-host it, fine-tune it, and (subject to the license terms) deploy it commercially.
  • When was Qwen2.5-Coder-32B released?
    Qwen2.5-Coder-32B was released on Nov 12, 2024 by Alibaba.
Friday digest

Intelligence, distilled weekly.

One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.