o3
OpenAI's flagship reasoning model, set the bar for hard math, GPQA, and agent benchmarks in 2025.
Updated
Quick facts
- Released
- Apr 2025
- Context
- 200K tokens
- Output / 1M
- $8.00 / 1M tokens
- License
- Proprietary
About o3
o3 was the public release that established the reasoning-model paradigm as production-ready. Its scores on GPQA Diamond and ARC-AGI were the first that suggested LLMs could genuinely tackle graduate-level science questions and abstract reasoning puzzles in a controlled setting.
Despite GPT-5's unified-model approach absorbing much of o3's mindshare, o3 remains the model of choice for the most demanding reasoning workloads where wall-clock time isn't the constraint.
Benchmarks
Published scores from OpenAI's model card or independent leaderboards. We do not publish numbers we cannot source, see methodology.
- HumanEval95.0
- GPQA87.7
Capabilities
Strengths
- Industry-leading reasoning depth at launch
- Strong on math, science, and abstract puzzles
- Tool-use during reasoning loops
Tracked weaknesses
- Slow first-token, unpredictable total latency
- Expensive when reasoning runs long
Pricing
Per-million-token rates as published by OpenAI.
| Tier | Price | Notes |
|---|---|---|
| Input | $2.00 / 1M tokens | Tokens you send to the model |
| Output | $8.00 / 1M tokens | Tokens the model generates |
| Context | 200K tokens | Max combined input + output |
Call o3 from your code
Drop-in snippet for the OpenAI SDK. Set your API key in the environment and run.
import OpenAI from "openai";
const client = new OpenAI({
// Use OPENAI_API_KEY for OpenAI, or your provider's key + baseURL.
apiKey: process.env.OPENAI_API_KEY,
});
const completion = await client.chat.completions.create({
model: "o3",
messages: [
{ role: "user", content: "What's the time complexity of quicksort?" },
],
});
console.log(completion.choices[0].message.content);Best for
Tasks where o3 ranks among LLMDex's top picks.
- Best LLM for CodingDevelopers searching for the most capable model to write, edit, or refactor code in real codebases.See ranking
- Best LLM for Code ReviewEngineers automating PR review, catching bugs, security issues, or style regressions before merge.See ranking
- Best LLM for Essay WritingLong-form analytical or argumentative essays, blog posts, op-eds.See ranking
Compare o3 with…
Frequently asked
How much does o3 cost per million tokens?
o3 is priced at $2.00 / 1M tokens for input tokens and $8.00 / 1M tokens for output tokens via the official OpenAI API at the time of writing.What is o3's context window?
o3 supports a context window of 200K tokens.Is o3 open source?
No. o3 is a closed-weight model, you can use it via OpenAI's API but the model weights are not publicly downloadable.When was o3 released?
o3 was released on Apr 16, 2025 by OpenAI.What is o3's knowledge cutoff?
o3's training data has a knowledge cutoff of Jun 2024. For information after that date you'll need a tool-use or web-search wrapper.
Intelligence, distilled weekly.
One short email every Friday, new model launches, leaderboard moves, and pricing drops. Curated by hand. Free, no spam.