Command Palette

Search for a command to run...

Gemini 2.5 Pro

Google

Frontier ModelReasoningVision ModelProprietary

Context

Knowledge Cutoff
Jan 01, 2025
Window
1M

PricingPer 1M tokens

Input
$1.25
Output
$10
Blended 3:1
$3.4375

Capabilities

Speed
143 t/s
Input
Output
Reasoning tokens

Latency

TTFT
36.54 ms
500 token response
40.02 s

Benchmarks

Reasoning
●●●●
Math
●●●●●
Coding
●●●○○
MMLU Pro
86.2%
GPQA
86.4%
HLE
21.6%
SciCode
42.8%
AIME
88.0%
MATH 500
93.0%
LiveCodeBench
69.0%

Gemini 2.5 Pro is Google’s latest sparse Mixture-of-Experts transformer that natively handles text, images, audio, and video in contexts exceeding one million tokens. It leads public coding, reasoning, and multimodal benchmarks (e.g., LiveCodeBench 69 %, GPQA 86 %) while delivering 143 tokens/s with sub-40 ms time-to-first-token on TPUv5p hardware.

For developers, this means you can stream entire codebases, PDFs, or hours of video to a single endpoint and get accurate answers or generated code without extra retrieval logic. Built-in function calling lets the model browse the web, run code, and chain actions, making it easy to build agentic workflows—chatbots, data pipelines, or autonomous apps—via the Google AI Studio API.