LLM·Dex
ProprietaryGooglemidtextvisionaudiovideo

Gemini 3 Flash

Google's high-speed, low-cost mid-tier with the same massive context window, popular for high-volume RAG.

Updated


Quick facts

Released
Dec 2025
Context
1.0M tokens
Output / 1M
Pricing not published
License
Proprietary

About Gemini 3 Flash

Gemini 3 Flash is the workhorse most production Gemini deployments use. It keeps the 1M-token window of the Pro tier while running noticeably faster and at a fraction of the cost, a unique combination in the market.

The trade-off shows up on hard reasoning and creative tasks where Pro pulls ahead. For RAG, summarization, and customer support, Flash often wins on total cost-to-quality.

Benchmarks

Published scores from Google's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.

HumanEval
,
Python coding pass@1
MMLU
,
Broad academic knowledge
GPQA
,
Graduate-level reasoning
SWE-bench
,
Real software-engineering tasks
Benchmark scores not yet available. We only publish numbers we can source from official model cards or independent leaderboards, see methodology.

Capabilities

Strengths

  • 1M-token context at mid-tier price
  • Very fast, good for interactive UX
  • Strong vision support
  • Cheap on input-heavy workloads

Tracked weaknesses

  • Reasoning quality below Pro
  • Creative writing trails Claude / GPT mid-tiers

Pricing

Per-million-token rates as published by Google.

Per-token pricing not yet published for Gemini 3 Flash. Check the official provider site for current tiers.

Call Gemini 3 Flash from your code

Drop-in snippet for the Google SDK. Set your API key in the environment and run.

typescript
import { GoogleGenerativeAI } from "@google/generative-ai";

const genAI = new GoogleGenerativeAI(process.env.GEMINI_API_KEY);
const model = genAI.getGenerativeModel({ model: "gemini-3-flash" });

const result = await model.generateContent(
  "What's the time complexity of quicksort?",
);

console.log(result.response.text());

Best for

Tasks where Gemini 3 Flash ranks among LLMDex's top picks.

Compare Gemini 3 Flash with…

Frequently asked

  • How much does Gemini 3 Flash cost?
    Google has not published per-token API pricing for Gemini 3 Flash at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.
  • What is Gemini 3 Flash's context window?
    Gemini 3 Flash supports a context window of 1.0M tokens with up to 65.5K output tokens per response.
  • Is Gemini 3 Flash open source?
    No. Gemini 3 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.
  • When was Gemini 3 Flash released?
    Gemini 3 Flash was released on Dec 9, 2025 by Google.
  • What is Gemini 3 Flash's knowledge cutoff?
    Gemini 3 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.
Friday digest

Intelligence, distilled weekly.

One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.