Canalys: Companies limit genAI use due to unclear costs
Briefly

A report from Canalys highlights that as companies transition generative AI tools from testing to real-world implementation, they encounter significant challenges in forecasting cloud costs associated with inference. Unlike the one-time costs of training, inference incurs ongoing operational fees that are critical for commercial viability. Canalys experts suggest that companies are increasingly focused on cost-effective solutions, exploring various AI models, cloud services, and hardware options like GPUs and customized accelerators. The unpredictable nature of usage-based pricing models complicates scaling AI services.
"Unlike training, which is a one-time investment, inference represents a recurring operational cost, making it a crucial constraint on the path to commercializing AI."
"As AI moves from research to large-scale deployment, companies are increasingly focusing on cost-effectiveness in inference, comparing models, cloud platforms, and hardware architectures such as GPUs versus custom accelerators."
Read at Computerworld
[
|
]