DeepSeek V4 Pro price cut goes live: about $0.90 headline pricing, cache hits even cheaper
DeepSeek has turned pricing into the top story: V4 Pro now sits around $0.90 equivalent in headline pricing, while cache-hit input drops to RMB 0.025 per 1M tokens. For repeated coding and agent workloads, the effective bill can fall much lower than the headline rate.
Official pricing update
DeepSeek V4 Pro is no longer only a quality and long-context story. It is now the pricing headline. The current official pricing page shows the same low Pro rates and adds a stronger forward-looking note: after the 75% discount window ends at 2026-05-31 23:59 (Beijing time), V4-Pro API pricing will be officially adjusted to 1/4 of the original price.
What is officially confirmed
- DeepSeek V4 Pro cache-hit input: RMB 0.025 / 1M tokens
- DeepSeek V4 Pro cache-miss input: RMB 3 / 1M tokens
- DeepSeek V4 Pro output: RMB 6 / 1M tokens
- DeepSeek V4 Flash remains the high-volume route with lower non-Pro rates
- DeepSeek still reserves the right to adjust product pricing, so this should be described as an official quarter-price reset, not a forever-price promise
Why this matters
For teams with stable system prompts, repeated retrieval blocks, or heavy prompt-prefix reuse, the cache-hit line matters as much as the headline price cut. The practical result is that DeepSeek V4 Pro can stay in the serving path for harder coding and reasoning tasks without blowing up the budget.
Editorial takeaway
This is the new DeepSeek-first lead story for the homepage: lower V4 Pro pricing, much cheaper cache-hit traffic, and a stronger economic argument against defaulting to premium alternatives for every request.