Artificial intelligencefromIT Pro1 week agoDeepSeek's R1 model training costs pour cold water on big tech's massive AI spendingDeepSeek trained its R1 reasoning model for about $294,000 using 512 Nvidia H800 chips, plus ~$6M for its base LLM.
Artificial intelligencefromTechzine Global1 week agoDeepSeek breaks through cost barrier in AI raceDeepSeek trained its R1 reasoning model for $294,000 using 512 Nvidia H800 chips in 80 hours, employing distillation and some A100s.