#inference-cost-reduction

[ follow ]
Artificial intelligence
fromZDNET
1 week ago

DeepSeek claims its new AI model can cut the cost of predictions by 75% - here's how

DeepSeek's DeepSeek-V3.2-Exp reduces inference compute using sparsity-based attention retraining, cutting predicted inference cost by about 75%.
[ Load more ]