How much does Gemini 3 Flash cost?

Google has not published per-token API pricing for Gemini 3 Flash at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.

What is Gemini 3 Flash's context window?

Gemini 3 Flash supports a context window of 1.0M tokens with up to 65.5K output tokens per response.

Is Gemini 3 Flash open source?

No. Gemini 3 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.

When was Gemini 3 Flash released?

Gemini 3 Flash was released on Dec 9, 2025 by Google.

What is Gemini 3 Flash's knowledge cutoff?

Gemini 3 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.

How much does Gemini 3 Flash cost?

Google has not published per-token API pricing for Gemini 3 Flash at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.

What is Gemini 3 Flash's context window?

Gemini 3 Flash supports a context window of 1.0M tokens with up to 65.5K output tokens per response.

Is Gemini 3 Flash open source?

No. Gemini 3 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.

When was Gemini 3 Flash released?

Gemini 3 Flash was released on Dec 9, 2025 by Google.

What is Gemini 3 Flash's knowledge cutoff?

Gemini 3 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.

ProprietaryGooglemidtextvisionaudiovideo

Gemini 3 Flash

Google's high-speed, low-cost mid-tier with the same massive context window, popular for high-volume RAG.

UpdatedApr 30, 2026

Quick facts

Released: Dec 2025
Context: 1.0M tokens
Output / 1M: Pricing not published
License: Proprietary

About Gemini 3 Flash

Gemini 3 Flash is the workhorse most production Gemini deployments use. It keeps the 1M-token window of the Pro tier while running noticeably faster and at a fraction of the cost, a unique combination in the market.

The trade-off shows up on hard reasoning and creative tasks where Pro pulls ahead. For RAG, summarization, and customer support, Flash often wins on total cost-to-quality.

Benchmarks

Published scores from Google's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.

HumanEval

Python coding pass@1

MMLU

Broad academic knowledge

GPQA

Graduate-level reasoning

SWE-bench

Real software-engineering tasks

Benchmark scores not yet available. We only publish numbers we can source from official model cards or independent leaderboards, see methodology.

Capabilities

Strengths

1M-token context at mid-tier price
Very fast, good for interactive UX
Strong vision support
Cheap on input-heavy workloads

Tracked weaknesses

Reasoning quality below Pro
Creative writing trails Claude / GPT mid-tiers

Pricing

Per-million-token rates as published by Google.

Per-token pricing not yet published for Gemini 3 Flash. Check the official provider site for current tiers.

Call Gemini 3 Flash from your code

Drop-in snippet for the Google SDK. Set your API key in the environment and run.

typescript

import { GoogleGenerativeAI } from "@google/generative-ai";

const genAI = new GoogleGenerativeAI(process.env.GEMINI_API_KEY);
const model = genAI.getGenerativeModel({ model: "gemini-3-flash" });

const result = await model.generateContent(
  "What's the time complexity of quicksort?",
);

console.log(result.response.text());

Best for

Tasks where Gemini 3 Flash ranks among LLMDex's top picks.

Compare Gemini 3 Flash with…

Frequently asked

How much does Gemini 3 Flash cost?
Google has not published per-token API pricing for Gemini 3 Flash at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.
What is Gemini 3 Flash's context window?
Gemini 3 Flash supports a context window of 1.0M tokens with up to 65.5K output tokens per response.
Is Gemini 3 Flash open source?
No. Gemini 3 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.
When was Gemini 3 Flash released?
Gemini 3 Flash was released on Dec 9, 2025 by Google.
What is Gemini 3 Flash's knowledge cutoff?
Gemini 3 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.

Friday digest

One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.