Updated 2026-07-05

Use V4 Flash thinking mode as the first replacement for `deepseek-reasoner`, then promote only the hardest traffic to Pro

A common migration mistake is assuming every old `deepseek-reasoner` route should jump straight to `deepseek-v4-pro`. DeepSeek's official docs suggest a more careful baseline: during the compatibility window, `deepseek-reasoner` already maps to the thinking mode of V4 Flash. That means many old reasoner workloads should test V4 Flash with thinking enabled first, then reserve Pro for the smaller slice that truly needs a stronger route.

1. What DeepSeek officially says about `deepseek-reasoner` today

DeepSeek's current change log says `deepseek-reasoner` points to the thinking mode of `deepseek-v4-flash` during the transition period before retirement on July 24, 2026.

That is the key migration clue. It tells you the legacy reasoner route is already closer to Flash thinking mode than to a separate premium model family.

Sources checked

DeepSeek official change log - Primary source for the current alias mapping and retirement date.

2. When V4 Flash thinking mode is usually enough

Use V4 Flash thinking mode first when the route mainly needs better step-by-step reasoning than a plain non-thinking answer, but still lives in a throughput-sensitive environment such as coding agents, support triage, or batch automation.

This path is especially attractive when cost and concurrency matter. DeepSeek's public pricing and routing surfaces keep Flash as the cheaper, higher-volume lane, so you do not want to upgrade every old reasoner call to Pro without evidence.

Migration split for old `deepseek-reasoner` traffic
Workload shape	Start with V4 Flash thinking?	When to escalate to Pro
Repeated coding-agent loops	Yes	Escalate only if accepted output quality stays weak after prompt and tool fixes
Batch summarization or triage	Yes	Escalate only for a narrow high-trust slice
Architecture review or harder reasoning	Maybe	Use Pro when the review depth itself is the product requirement

Sources checked

DeepSeek official Models & Pricing page - Use pricing and concurrency differences to decide whether Pro is worth the promotion.

3. Update OpenAI-compatible clients with explicit thinking settings

For OpenAI-compatible clients, the first migration step is explicit model naming. Stop using `deepseek-reasoner`, move to `deepseek-v4-flash`, and keep reasoning-related settings visible in code or config so the route stays debuggable.

Do not rely on an invisible alias bridge. The point of this migration is to make the chosen quality lane obvious and testable.

const response = await client.chat.completions.create({
  model: "deepseek-v4-flash",
  messages,
  thinking: { type: "enabled" },
  reasoning_effort: "high",
});

Sources checked

DeepSeek official Thinking Mode guide - Reference for current thinking-mode behavior and parameter framing.
DeepSeek official Create Chat Completion reference - Use the current API reference to confirm request-shape details.

4. Update Anthropic-compatible and Claude-style routes carefully

If the route uses DeepSeek's Anthropic-compatible base URL, keep the same migration principle: explicit V4 model names, not legacy aliases. That matters because unsupported-model fallbacks can hide routing mistakes if you do not verify the provider-side result.

For Claude-style setups, keep the main and subagent model split intentional instead of assuming every old reasoner path deserves Pro. Use Flash where you want cheaper repeated reasoning and Pro where the main lane truly benefits from it.

export ANTHROPIC_BASE_URL="https://api.deepseek.com/anthropic"
export ANTHROPIC_MODEL="deepseek-v4-pro[1m]"
export CLAUDE_CODE_SUBAGENT_MODEL="deepseek-v4-flash"

Sources checked

DeepSeek official Anthropic API guide - Use explicit DeepSeek model names on the Anthropic-compatible route.
DeepSeek official Claude Code integration guide - Useful for model-split examples across main and subagent lanes.

5. Decide with measurement, not with nostalgia for `deepseek-reasoner`

Log accepted output quality, latency, retries, and spend on a small evaluation set before you promote traffic to Pro. Many teams treat the old reasoner label as a quality badge instead of as a routing alias, which makes them over-upgrade.

If you still need the bigger migration picture after this page, continue with `/guides/deepseek-v4-pro-vs-flash`, `/guides/deepseek-thinking-mode-parameter-rules`, and `/guides/deepseek-chat-reasoner-404-fix`.

FAQ

Does `deepseek-reasoner` map to V4 Pro today?

No. DeepSeek's official change log says it currently maps to the thinking mode of `deepseek-v4-flash` during the transition period.

Should every old reasoner workflow move to Pro?

No. Many routes should test V4 Flash thinking mode first and only escalate the hardest traffic to Pro after measuring quality and spend.

What is the safest first replacement for `deepseek-reasoner` in OpenAI-format code?

Use `deepseek-v4-flash` explicitly and keep thinking-related settings visible so the route is easy to validate.

Does this migration page mean Flash or Pro is guaranteed in stock on `/pricing`?

No. Guide content explains routing. Purchasable Coding Plans still depend on actual stocked inventory.

The disciplined migration path is to treat `deepseek-reasoner` as a retiring compatibility alias, start with explicit V4 Flash thinking mode for the majority of those routes, and reserve V4 Pro for the smaller slice where measured quality justifies the extra cost.

Related model comparisons

Continue from this guide into structured DeepSeek-first comparison pages with model tables, routing advice, and pricing context.

DeepSeek V4 API pricing comparison: Pro, Flash, GPT 5.4, Claude, Gemini, Qwen, and more Best AI model for coding: DeepSeek V4-first comparison DeepSeek V4 vs Claude: 1M context, coding, review quality, and API cost

Compare current DeepSeek access paths