LLM·Dex
ProprietaryGooglemidtextvisionaudio

Gemini 2.5 Flash

Mid-2025 fast tier, set the bar for cost-efficient long-context generation.

Updated


Quick facts

Released
Apr 2025
Context
1.0M tokens
Output / 1M
$0.30 / 1M tokens
License
Proprietary

About Gemini 2.5 Flash

Gemini 2.5 Flash hit a sweet spot for high-throughput long-context workloads in 2025: 1M-token window, very fast inference, and prices low enough that streaming entire docs into the model became routine.

Benchmarks

Published scores from Google's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.

HumanEval
,
Python coding pass@1
MMLU
,
Broad academic knowledge
GPQA
,
Graduate-level reasoning
SWE-bench
,
Real software-engineering tasks
Benchmark scores not yet available. We only publish numbers we can source from official model cards or independent leaderboards, see methodology.

Capabilities

Strengths

  • Cheap and fast
  • Long context
  • Solid vision

Tracked weaknesses

  • Reasoning depth below Pro tier

Pricing

Per-million-token rates as published by Google.

TierPriceNotes
Input$0.075 / 1M tokensTokens you send to the model
Output$0.30 / 1M tokensTokens the model generates
Context1.0M tokensMax combined input + output

Call Gemini 2.5 Flash from your code

Drop-in snippet for the Google SDK. Set your API key in the environment and run.

typescript
import { GoogleGenerativeAI } from "@google/generative-ai";

const genAI = new GoogleGenerativeAI(process.env.GEMINI_API_KEY);
const model = genAI.getGenerativeModel({ model: "gemini-2-5-flash" });

const result = await model.generateContent(
  "What's the time complexity of quicksort?",
);

console.log(result.response.text());

Compare Gemini 2.5 Flash with…

Frequently asked

  • How much does Gemini 2.5 Flash cost per million tokens?
    Gemini 2.5 Flash is priced at $0.075 / 1M tokens for input tokens and $0.30 / 1M tokens for output tokens via the official Google API at the time of writing.
  • What is Gemini 2.5 Flash's context window?
    Gemini 2.5 Flash supports a context window of 1.0M tokens.
  • Is Gemini 2.5 Flash open source?
    No. Gemini 2.5 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.
  • When was Gemini 2.5 Flash released?
    Gemini 2.5 Flash was released on Apr 9, 2025 by Google.
  • What is Gemini 2.5 Flash's knowledge cutoff?
    Gemini 2.5 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.
Friday digest

Intelligence, distilled weekly.

One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.