DeepSeek 官方限流文档现已明确写出账户级并发、user_id 隔离与 keep-alive 行为
截至 2026 年 6 月 11 日,DeepSeek 官方文档已给出更清晰的生产运行基线:V4 Pro 账户级并发 500、V4 Flash 2500,并明确说明 `user_id` 隔离和请求 keep-alive 的处理方式。
中文摘要
截至 2026 年 6 月 11 日,DeepSeek 官方文档已给出更清晰的生产运行基线:V4 Pro 账户级并发 500、V4 Flash 2500,并明确说明 user_id 隔离和请求 keep-alive 的处理方式。
阅读提示
这篇中文稿保留原始来源链接,并把 DeepSeek 官方发布、报道和市场传闻分开标注。购买相关判断仍以 /zh/pricing 的真实库存卡片为准;出现在新闻或基准中的模型不代表可购买。
英文原文
Accepted official-source monitoring note
Today's accepted item stays DeepSeek-first and uses current official DeepSeek documentation because the public X surface remains login-friction-heavy and did not expose a newer safely verifiable post than the already-published homepage anchor.
What we verified on June 11, 2026
- DeepSeek's official Rate Limit & Isolation page sets account-level concurrency at 500 for
deepseek-v4-proand 2500 fordeepseek-v4-flash. - The same official page now explains what
user_iddoes: content-safety isolation, KV-cache isolation, and scheduling isolation under one account. - DeepSeek also documents protocol-specific
user_idplacement:extra_body.user_idfor OpenAI-format calls andmetadata.user_idfor Anthropic-format calls. - The same page documents the request keep-alive behavior: non-streaming responses can emit empty lines, streaming responses can emit SSE keep-alive comments, and the server closes the connection if inference has not started after 10 minutes.
- DeepSeek's English homepage still points its public social anchor to
@deepseek_ai, which remains the safest official X confirmation even though the X page itself is not safely crawlable here.
Why this is publishable
This is not framed as a new product launch. It is a current official operations signal that matters for real DeepSeek deployments:
- It gives teams a trustworthy concurrency baseline instead of forcing them to infer limits from scattered SDK examples.
- It turns
user_idfrom an obscure field into a documented production control for isolation and privacy-sensitive workloads. - It gives the site a fresh official topic that is not a duplicate of the June 9 homepage-X anchor check or the June 10 Claude Code Web Search check.
Why this matters for DeepSeek-first SEO pages
- API operations pages should explain account-level concurrency separately from per-user application logic.
- Anthropic-format and OpenAI-format guides should show different
user_idwiring, because the field lands in different request shapes. - Error-handling and timeout guides should mention keep-alive lines and the 10-minute pre-inference cutoff so teams do not misclassify healthy long polls as broken connections.
Rejected candidates today
- The same April 24 homepage X anchor: still official, but already published on June 9 and therefore a duplicate-content risk.
- The same Claude Code Web Search page: still official, but already published on June 10.
- Status page uptime alone: official, but weaker and less actionable than the current Rate Limit & Isolation page.
- Community posts about DeepSeek concurrency or gateway tuning: helpful discovery leads only, but weaker than the official docs page that now states the limits directly.
Editorial takeaway
The safest official DeepSeek story today is an API operations documentation check: DeepSeek's own docs now say exactly how much concurrency V4 Pro and V4 Flash support per account, how user_id should be used for isolation, and what keep-alive behavior client code should tolerate while a request is waiting to run.