LLM·Dex
Open weightsAlibabasmalltext

Qwen2.5-7B

Small Qwen, practical default for laptop and edge inference.

Updated


Quick facts

Released
Sep 2024
Context
128K tokens
Output / 1M
Pricing not published
License
Apache-2.0

About Qwen2.5-7B

Qwen2.5-7B is Alibaba's small-model entry, frequently used for on-device chat and edge deployments. Apache-2.0 licensed, broad tooling support across Ollama, llama.cpp, MLX, and vLLM.

For multilingual on-device chat, particularly Chinese, Japanese, and Korean, Qwen2.5-7B is the strongest pick in its size class and a popular default in mobile-AI products outside North America.

Benchmarks

Published scores from Alibaba's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.

HumanEval
,
Python coding pass@1
MMLU
,
Broad academic knowledge
GPQA
,
Graduate-level reasoning
SWE-bench
,
Real software-engineering tasks
Benchmark scores not yet available. We only publish numbers we can source from official model cards or independent leaderboards, see methodology.

Capabilities

Strengths

  • Apache-2.0
  • Runs on laptops
  • Strong multilingual

Tracked weaknesses

  • Quality limited by size

Pricing

Per-million-token rates as published by Alibaba.

Per-token pricing not yet published for Qwen2.5-7B. Check the official provider site for current tiers.

Call Qwen2.5-7B from your code

Drop-in snippet for the Alibaba SDK. Set your API key in the environment and run.

typescript
import OpenAI from "openai";

const client = new OpenAI({
  // Use OPENAI_API_KEY for OpenAI, or your provider's key + baseURL.
  apiKey: process.env.OPENAI_API_KEY,
});

const completion = await client.chat.completions.create({
  model: "qwen-2-5-7b",
  messages: [
    { role: "user", content: "What's the time complexity of quicksort?" },
  ],
});

console.log(completion.choices[0].message.content);

Best for

Tasks where Qwen2.5-7B ranks among LLMDex's top picks.

Compare Qwen2.5-7B with…

Frequently asked

  • How much does Qwen2.5-7B cost?
    Alibaba has not published per-token API pricing for Qwen2.5-7B at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.
  • What is Qwen2.5-7B's context window?
    Qwen2.5-7B supports a context window of 128K tokens.
  • Is Qwen2.5-7B open source?
    Qwen2.5-7B ships with open weights under the Apache-2.0 license. You can self-host it, fine-tune it, and (subject to the license terms) deploy it commercially.
  • When was Qwen2.5-7B released?
    Qwen2.5-7B was released on Sep 19, 2024 by Alibaba.
Friday digest

Intelligence, distilled weekly.

One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.