ProprietaryGooglemidtextvisionaudio
Gemini 2.5 Flash
Mid-2025 fast tier, set the bar for cost-efficient long-context generation.
Updated
Quick facts
- Released
- Apr 2025
- Context
- 1.0M tokens
- Output / 1M
- $0.30 / 1M tokens
- License
- Proprietary
About Gemini 2.5 Flash
Gemini 2.5 Flash hit a sweet spot for high-throughput long-context workloads in 2025: 1M-token window, very fast inference, and prices low enough that streaming entire docs into the model became routine.
Benchmarks
Published scores from Google's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.
HumanEval
,
Python coding pass@1
MMLU
,
Broad academic knowledge
GPQA
,
Graduate-level reasoning
SWE-bench
,
Real software-engineering tasks
Benchmark scores not yet available. We only publish numbers we can source from official model cards or independent leaderboards, see methodology.
Capabilities
Strengths
- Cheap and fast
- Long context
- Solid vision
Tracked weaknesses
- Reasoning depth below Pro tier
Pricing
Per-million-token rates as published by Google.
| Tier | Price | Notes |
|---|---|---|
| Input | $0.075 / 1M tokens | Tokens you send to the model |
| Output | $0.30 / 1M tokens | Tokens the model generates |
| Context | 1.0M tokens | Max combined input + output |
Call Gemini 2.5 Flash from your code
Drop-in snippet for the Google SDK. Set your API key in the environment and run.
typescript
import { GoogleGenerativeAI } from "@google/generative-ai";
const genAI = new GoogleGenerativeAI(process.env.GEMINI_API_KEY);
const model = genAI.getGenerativeModel({ model: "gemini-2-5-flash" });
const result = await model.generateContent(
"What's the time complexity of quicksort?",
);
console.log(result.response.text());Compare Gemini 2.5 Flash with…
Frequently asked
How much does Gemini 2.5 Flash cost per million tokens?
Gemini 2.5 Flash is priced at $0.075 / 1M tokens for input tokens and $0.30 / 1M tokens for output tokens via the official Google API at the time of writing.What is Gemini 2.5 Flash's context window?
Gemini 2.5 Flash supports a context window of 1.0M tokens.Is Gemini 2.5 Flash open source?
No. Gemini 2.5 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.When was Gemini 2.5 Flash released?
Gemini 2.5 Flash was released on Apr 9, 2025 by Google.What is Gemini 2.5 Flash's knowledge cutoff?
Gemini 2.5 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.
Friday digest
Intelligence, distilled weekly.
One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.