In 2024, the majority of AI compute spend shifted to inference, Thomas Sohmers, CEO of chip startup Positron AI, told BI. This will 'continue to grow on what looks like an exponential curve.'
Sohmers and others are excited about the growth in computing needs in 2025, as OpenAI's o1 and o3, Google's Gemini 2.0 Flash Thinking, and other models utilize more compute-intensive strategies that improve results after training.
Collection
[
|
...
]