Skip to content
Vercel April 2026 security incident

MiMo V2 Pro

xiaomi/mimo-v2-pro

MiMo V2 Pro is the Pro variant in Xiaomi's MiMo v2 family with over 1T total parameters and 42B active, built for math, code, and multi-step reasoning within a context window of 1M tokens. It uses a hybrid attention architecture for long-context processing.

ReasoningTool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xiaomi/mimo-v2-pro',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Zero Data Retention

    AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.

    Authentication

    AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

MiMo V2 Pro sits at the Pro end of the MiMo v2 lineup. It costs more per token than Flash, but it fits when accuracy matters more than price. Use AI Gateway's cost tracking and model fallback to route easy work to Flash and harder work to Pro.

When to Use MiMo V2 Pro

Best For

  • Multi-step reasoning:

    You care more about accuracy on hard problems than raw throughput

  • Code generation:

    Architecture design, debugging, and multi-file refactors

  • Math and proofs:

    Problems that require long logical chains where intermediate reasoning steps matter

  • Long-context work:

    A window of 1M tokens fits big documents or repos

  • Pro-tier MoE:

    Over 1T total parameters with 42B active per forward pass

Consider Alternatives When

  • Speed-first simple tasks:

    The Flash variant fits better when cost and throughput drive the choice

  • Multimodal input required:

    MiMo V2 Pro is text-in, text-out only

  • Simple classification jobs:

    A smaller, cheaper model handles extraction at lower cost

Conclusion

MiMo V2 Pro is the Pro pick in Xiaomi's MiMo v2 lineup. Use it for multi-step math, multi-step code, and analytical work. Pair it with Flash through AI Gateway routing so you can balance cost and quality.

FAQ

It's the Pro tier. MiMo V2 Pro targets harder reasoning, math, and code than Flash, with higher per-token cost and somewhat lower throughput than Flash.

A Mixture-of-Experts (MoE) setup: each forward pass activates a subset of parameters, which keeps inference cost manageable while the full parameter count holds broader knowledge.

1M tokens. Hybrid sliding window attention reduces KV-cache use so long-context runs stay practical.

Add your API key in AI Gateway project settings. Use xiaomi/mimo-v2-pro in API calls. AI Gateway routes, retries, and fails over across xiaomi.

See the pricing section on this page for today's rates. AI Gateway exposes each provider's pricing for MiMo V2 Pro.

Yes. AI Gateway supports fallback and routing. You can send hard requests to MiMo V2 Pro and fall back to Flash for simpler tasks to control cost.

Multi-step reasoning, code generation, math, and long-context analysis. For short or simple jobs, Flash is usually cheaper.

Yes. The MiMo v2 line is under the MIT license, which allows commercial use, modification, and redistribution.