
DeepSeek announced a 75% price reduction for its flagship V4-Pro AI model, effective after a promotion period ending 2026/05/31 15:59 UTC. Before the change, usage costs ranged from $0.0145 per million tokens for cache hits to $3.48 per million output tokens. After the revision, V4-Pro pricing starts at $0.003625 per million tokens and rises to $0.87 per million tokens. The company framed the cut as a permanent efficiency gain rather than a temporary discount. V4-Pro was engineered to reduce long-context inference compute and memory footprint. DeepSeek also released V4 generation models, including V4 Pro and V4 Flash, and positioned V4 as open source for local use and modification, optimized for agent tool integrations.
"DeepSeek has reduced pricing for the model by 75%, just a month after unveiling the V4 generation, which includes V4 Pro and V4 Flash. Earlier, usage costs ranged from $0.0145 for one million tokens (cache hit) to $3.48 for one million output tokens. Following the revision, the V4 Pro will now cost starting at $0.003625 per million tokens and going up to $0.87 per million tokens, respectively. The Deepseek V4 Pro model API pricing will be officially adjusted to 1/4 of the original price after the 75% discount promotion ends on 2026/05/31 15:59 UTC, said the company."
""V4-Pro was engineered to cut the cost of long-context inference, reportedly running at roughly a quarter of the single-token compute and a tenth of the memory footprint of its predecessor at very long context. This is why the price cut is permanent rather than promotional. It is not a discount. It is an efficiency gain being passed through," said Sanchit Vir Gogia, chief analyst and CEO at Greyhound Research."
"Almost a year after introducing its R1 reasoning model offering performance and cost efficiency, DeepSeek released the preview of V4 LLM. Similar to the earlier models, even V4 is open source, which allows developers to download the code to run it locally and even modify it. The new models were optimized for use with popular agent tools such as Anthropic's Claude Code and OpenClaw."
""From a pure capabilities perspective, DeepSeek V4-Pro has effectively closed the performance gap on critical tasks like complex math and reasoning, while aggressively leading the market on openness and inference costs. Its specialized reasoning modes and architectural enhancements make it a formidable alternative to Western""
#ai-model-pricing #long-context-inference-efficiency #open-source-llms #reasoning-models #agent-tool-integrations
Read at Computerworld
Unable to calculate read time
Collection
[
|
...
]