Gemini 3 Flash
Google's high-speed, low-cost mid-tier with the same massive context window, popular for high-volume RAG.
Updated
Quick facts
- Released
- Dec 2025
- Context
- 1.0M tokens
- Output / 1M
- Pricing not published
- License
- Proprietary
About Gemini 3 Flash
Gemini 3 Flash is the workhorse most production Gemini deployments use. It keeps the 1M-token window of the Pro tier while running noticeably faster and at a fraction of the cost, a unique combination in the market.
The trade-off shows up on hard reasoning and creative tasks where Pro pulls ahead. For RAG, summarization, and customer support, Flash often wins on total cost-to-quality.
Benchmarks
Published scores from Google's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.
Capabilities
Strengths
- 1M-token context at mid-tier price
- Very fast, good for interactive UX
- Strong vision support
- Cheap on input-heavy workloads
Tracked weaknesses
- Reasoning quality below Pro
- Creative writing trails Claude / GPT mid-tiers
Pricing
Per-million-token rates as published by Google.
Call Gemini 3 Flash from your code
Drop-in snippet for the Google SDK. Set your API key in the environment and run.
import { GoogleGenerativeAI } from "@google/generative-ai";
const genAI = new GoogleGenerativeAI(process.env.GEMINI_API_KEY);
const model = genAI.getGenerativeModel({ model: "gemini-3-flash" });
const result = await model.generateContent(
"What's the time complexity of quicksort?",
);
console.log(result.response.text());Best for
Tasks where Gemini 3 Flash ranks among LLMDex's top picks.
Compare Gemini 3 Flash with…
Frequently asked
How much does Gemini 3 Flash cost?
Google has not published per-token API pricing for Gemini 3 Flash at the time of writing. Check the official site for current pricing tiers, or compare against alternative models on LLMDex.What is Gemini 3 Flash's context window?
Gemini 3 Flash supports a context window of 1.0M tokens with up to 65.5K output tokens per response.Is Gemini 3 Flash open source?
No. Gemini 3 Flash is a closed-weight model, you can use it via Google's API but the model weights are not publicly downloadable.When was Gemini 3 Flash released?
Gemini 3 Flash was released on Dec 9, 2025 by Google.What is Gemini 3 Flash's knowledge cutoff?
Gemini 3 Flash's training data has a knowledge cutoff of Jan 2025. For information after that date you'll need a tool-use or web-search wrapper.
Intelligence, distilled weekly.
One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.