Official2026-05-01

DeepSeek V4 Pro price cut goes live: about $0.90 headline pricing, cache hits even cheaper

DeepSeek has turned pricing into the top story: V4 Pro now sits around $0.90 equivalent in headline pricing, while cache-hit input drops to RMB 0.025 per 1M tokens. For repeated coding and agent workloads, the effective bill can fall much lower than the headline rate.

Official pricing update

DeepSeek V4 Pro is no longer only a quality and long-context story. It is now the pricing headline. The official pricing page shows a 2.5-discount promotion running through 2026-05-31 23:59 (Beijing time).

What is officially confirmed

  • DeepSeek V4 Pro cache-hit input: RMB 0.025 / 1M tokens
  • DeepSeek V4 Pro cache-miss input: RMB 3 / 1M tokens
  • DeepSeek V4 Pro output: RMB 6 / 1M tokens
  • DeepSeek V4 Flash remains the high-volume route with lower non-Pro rates

Why this matters

For teams with stable system prompts, repeated retrieval blocks, or heavy prompt-prefix reuse, the cache-hit line matters as much as the headline price cut. The practical result is that DeepSeek V4 Pro can stay in the serving path for harder coding and reasoning tasks without blowing up the budget.

Editorial takeaway

This is the new DeepSeek-first lead story for the homepage: lower V4 Pro pricing, much cheaper cache-hit traffic, and a stronger economic argument against defaulting to premium alternatives for every request.