fromBusiness Insider2 months agoArtificial intelligenceAMD's CTO says AI inference will move out of data centers and increasingly to phones and laptops
fromTechzine Global1 month agoArtificial intelligenceAMD makes third acquisition in eight days as feeding frenzy continues
fromBusiness Insider2 months agoArtificial intelligenceAMD's CTO says AI inference will move out of data centers and increasingly to phones and laptops
fromTechzine Global1 month agoArtificial intelligenceAMD makes third acquisition in eight days as feeding frenzy continues
Artificial intelligencefromInfoQ1 month agoGoogle Enhances LiteRT for Faster On-Device InferenceLiteRT simplifies on-device ML inference with enhanced GPU and NPU support for faster performance and lower power consumption.
Artificial intelligencefromIT Pro2 months ago'TPUs just work': Why Google Cloud is betting big on its custom chipsGoogle's seventh generation TPU, 'Ironwood', aims to lead in AI workload efficiency and cost-effectiveness.TPUs were developed with a cohesive hardware-software synergy, enhancing their utility for AI applications.
fromTheregister3 months agoArtificial intelligenceA closer look at Dynamo, Nvidia's 'operating system' for AI
Artificial intelligencefromInfoQ1 month agoGoogle Enhances LiteRT for Faster On-Device InferenceLiteRT simplifies on-device ML inference with enhanced GPU and NPU support for faster performance and lower power consumption.
Artificial intelligencefromIT Pro2 months ago'TPUs just work': Why Google Cloud is betting big on its custom chipsGoogle's seventh generation TPU, 'Ironwood', aims to lead in AI workload efficiency and cost-effectiveness.TPUs were developed with a cohesive hardware-software synergy, enhancing their utility for AI applications.
fromTheregister3 months agoArtificial intelligenceA closer look at Dynamo, Nvidia's 'operating system' for AI
Artificial intelligencefromTechzine Global1 month agoRed Hat lays foundation for AI inferencing: Server and llm-d projectAI inferencing is pivotal in making artificial intelligence operational and optimizing performance for various applications.
Silicon ValleyfromBusiness Insider3 months ago2 reasons why Nvidia's Jensen Huang isn't worriedNvidia CEO Jensen Huang is confident in sustained demand for Nvidia chips due to new powerful GPUs and an industry shift towards AI inference.
Silicon ValleyfromBusiness Insider3 months ago2 reasons why Nvidia's Jensen Huang isn't worriedNvidia CEO Jensen Huang is confident in sustained demand for Nvidia chips due to new powerful GPUs and an industry shift towards AI inference.
fromTheregister9 months agoSupermicro crams 18 GPUs into a 3U boxSupermicro's SYS-322GB-NR manages to fit 18 GPUs in just 3U rack space, catering to machine learning and AI inference workloads with low latency requirements.Miscellaneous