All models
Google
Official page
Gemini 2.5 Flash
standardGoogle's workhorse model — fast, cheap, and surprisingly capable. Handles 1M token contexts at a fraction of Pro's cost with optional thinking mode.
fastaffordablelong-context
Context
1.048576M
Input / 1M
$0.30
Output / 1M
$2.50
Knowledge cutoff
2025-01
Capabilities
Code4/5
Reasoning4/5
Multilingual5/5
Safety4/5
Speed5/5
Modalities & Features
Vision (image input) Yes
Audio Yes
Video Yes
Function / tool calling Yes
JSON mode Yes
Extended thinking Yes
Context & Output
Context window1.048576M
Max output tokens65.536K
Knowledge cutoff2025-01
Release date2025-05
API & Access
Open source No
Streaming Yes
Batch API Yes
Prompt caching Yes
Fine-tuning No
Benchmarks
MMLUGeneral knowledge
85.8%GPQA DiamondHard science Q&A
N/AMATHMath problem solving
74.1%AIME 2025Advanced math
N/AGSM8KGrade-school math
93.8%HumanEvalCode generation
83.2%LiveCodeBenchLive coding problems
N/ASWE-bench VerifiedReal-world software
N/AHellaSwagCommonsense inference
N/ASource: provider technical reports & independent evaluations. N/A = not yet published for this model.
Pricing (per 1M tokens)
Input$0.30
Output$2.50
Consumer plan
Gemini Free
Free
Prices shown are list prices and may vary by provider plan. Always check official documentation for the latest pricing.