Gemini 3 Pro Preview

google/gemini-3-pro-preview

Gemini 3 Pro Preview is the flagship reasoning model in the Gemini 3 generation for demanding agentic and analytical tasks, with improvements in multi-step function calling, complex image reasoning, long-document analysis, and instruction following over Gemini 2.5 Pro.

File InputTool UseReasoningVision (Image)Web Searchtiered-costImplicit Caching

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-3-pro-preview',
  prompt: 'Why is the sky blue?'
})

Playground

Try out Gemini 3 Pro Preview by Google. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

About Gemini 3 Pro Preview

Gemini 3 Pro Preview is a pro-tier model in the Gemini 3 generation for tasks that demand deep reasoning and reliable multi-step execution. It improves substantially on Gemini 2.5 Pro in four areas: multi-step function calling, planning, reasoning over complex images and long documents, and instruction following. These capabilities determine whether an agent completes a sophisticated task reliably or produces compounding errors across steps.

Its handling of complex images alongside long documents sets it apart for analytical applications. As a reasoning model, Gemini 3 Pro Preview supports includeThoughts output via providerOptions, streaming the intermediate reasoning chain alongside the final response. This transparency helps in regulated or high-stakes applications where you need to audit the decision process. It also helps debug complex agentic workflows where pinpointing where the model went wrong requires more than reading the final answer.

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Per Query	Capabilities	ZDR	No Training	HIPAA	Release Date

Legal:Terms

•

Privacy

64K

2.8s

138tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

11/18/2025

Legal:Terms

•

Privacy

64K

7.4s

189tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

11/18/2025

Metrics

Based exclusively on usage through AI Gateway.

Throughput24 hours

More models by Google

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

0.8s

272tps

$0.25/M

$1.50/M

Read:$0.03/M

Write:—

$14.00/K

+ input costs

—

03/03/2026

4.5s

194tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

02/19/2026

0.6s

182tps

$0.50/M

$3.00/M

Read:

$0.05/M

Write:

—

$14.00/K

+ input costs

—

12/17/2025

0.3s

288tps

$0.10/M

$0.40/M

Read:$0.01/M

Write:—

$35.00/K

+ input costs

—

06/17/2025

0.4s

260tps

$0.30/M

$2.50/M

Read:$0.03/M

Write:—

$35.00/K

+ input costs

—

03/20/2025

2.2s

181tps

$1.25/M

$10.00/M

Read:

$0.13/M

Write:

—

$35.00/K

+ input costs

—

03/20/2025

What To Consider When Choosing a Provider

Zero Data Retention
AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication
AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

Gemini 3 Pro Preview is a reasoning model: enable includeThoughts via providerOptions.google.thinkingConfig to surface the model's reasoning trace, which is particularly useful when auditing complex multi-step outputs.

When to Use Gemini 3 Pro Preview

Best For

Multi-step agentic workflows:
Sequential function calls that require reliable planning and execution
Deep document analysis:
Combining long text with embedded charts, diagrams, and images
Instruction-following tasks:
Precision and completeness are critical to downstream correctness
Reasoning-intensive applications:
Surfacing the model's thought process aids auditability
Complex technical research:
Tasks requiring synthesis across disparate sources and formats

Consider Alternatives When

Latency and cost primary:
Per-token cost and response speed dominate (consider google/gemini-3-flash for pro-grade quality at flash speed)
Updated agentic quality needed:
For the latest improvements on software engineering tasks (consider google/gemini-3.1-pro-preview)
Native image generation output:
Your workflow requires image output (consider google/gemini-3-pro-image)
High-volume straightforward tasks:
Extraction or translation at scale (consider google/gemini-3.1-flash-lite-preview)

Conclusion

Gemini 3 Pro Preview targets tasks where getting every step right matters more than getting the answer quickly. That means complex agentic pipelines, technical document analysis, and multimodal reasoning that spans images and long text. For teams building the highest-stakes AI features, this is the Gemini 3 model designed for reasoning depth rather than maximum throughput.

FAQ

Four specific improvements: multi-step function calling, planning, reasoning over complex images and long documents, and instruction following. These directly address the reliability gaps that affect agentic workflows at scale.

Set includeThoughts to true under providerOptions.google.thinkingConfig in the AI SDK. Use streamText for streaming, and the model emits reasoning tokens alongside the generated response.

It can be, but it is a reasoning model with higher latency than the Flash tier. For interactive applications where sub-second responses are required, google/gemini-3-flash provides pro-grade reasoning at significantly lower latency.

Yes. The model handles long documents with embedded charts, diagrams, and images. Improved reasoning over complex images and long documents is one of its headline capabilities over Gemini 2.5 Pro.

Gemini 3.1 Pro introduces additional quality improvements for software engineering and agentic tasks, enhanced usability for finance and spreadsheet applications, and more efficient thinking that reduces token consumption. Gemini 3 Pro Preview was the initial release; 3.1 Pro builds on that foundation.

No. AI Gateway manages all underlying provider credentials. You authenticate once using a Vercel API key or OIDC token.

The model more reliably executes sequences of tool calls: choosing the right tool, interpreting its output, deciding whether to call another tool, and knowing when the task is complete. This reduces the need for human intervention to correct routing errors mid-workflow.

Yes. You can pass image inputs alongside text prompts to enable cross-modal analysis within a single request.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Gemini 3 Pro Preview

Playground

About Gemini 3 Pro Preview

Providers

More models by Google

What To Consider When Choosing a Provider

Zero Data Retention

Authentication

When to Use Gemini 3 Pro Preview

Best For

Multi-step agentic workflows:

Deep document analysis:

Instruction-following tasks:

Reasoning-intensive applications:

Complex technical research:

Consider Alternatives When

Latency and cost primary:

Updated agentic quality needed:

Native image generation output:

High-volume straightforward tasks:

Conclusion

FAQ

Playground

About Gemini 3 Pro Preview

Providers

More models by Google

About Gemini 3 Pro Preview

What To Consider When Choosing a Provider

Zero Data Retention

Authentication

When to Use Gemini 3 Pro Preview

Best For

Multi-step agentic workflows:

Deep document analysis:

Instruction-following tasks:

Reasoning-intensive applications:

Complex technical research:

Consider Alternatives When

Latency and cost primary:

Updated agentic quality needed:

Native image generation output:

High-volume straightforward tasks:

Conclusion

FAQ

What reasoning capabilities does Gemini 3 Pro Preview have over Gemini 2.5 Pro?

How do I enable reasoning traces in my application?

Is Gemini 3 Pro Preview suitable for real-time user-facing applications?

Does the model support analyzing PDFs and complex documents?

What is the difference between Gemini 3 Pro Preview and Gemini 3.1 Pro?

Does Gemini 3 Pro Preview require a Google Cloud account when accessed via AI Gateway?

What does multi-step function calling improvement mean in practice?

Can I use Gemini 3 Pro Preview for analyzing images alongside text?

About Gemini 3 Pro Preview