AMD says its $4K Ryzen AI Halo workstation practically pays for itself

Ryzen AI Halo, an AMD AI workstation, will be available for pre-order for $3,999. The device targets local model development and agentic AI frameworks with a curated developer environment. It is positioned as cost-effective compared with cloud APIs, with a claim that eight hours per day of coding could save $750 per month. The system is built around a 120W Ryzen AI Max+ 395 APU, codenamed Strix Halo, paired with 128GB LPDDR5x 8000 MT/s memory. It includes 16 Zen 5 cores and 40 RDNA 3.5 compute units, delivering up to 256GB/s bandwidth. It can run models up to 200B parameters at 4-bit precision and provides about 56 teraFLOPS at 16-bit precision, though it is slower than the DGX Spark.

"AMD's answer to Nvidia's DGX Spark AI workstations, codenamed the Ryzen AI Halo, will be available for pre-order later next month for anyone with $3,999 burning a hole in their pocket. That might sound like a lot for an AI mini PC, but don't worry. Compared to cloud APIs, it practically pays for itself. Or, well, that's AMD's sales pitch. The House of Zen argues that if you spend eight hours a day vibe coding, the system could save you $750 a month."

"Much like the DGX Spark, which now retails for $4,699, up from $3,999 when we reviewed it last fall, AMD's rendition aims to provide a curated developer environment for running local models and agentic AI frameworks. This is really the core value proposition behind both of these devices. They aren't the most powerful or the fastest AI systems, but they're able to run models that a few years ago would have cost $20K or more."

"The diminutive system measures in at 5.9 x 5.9 x 1.7 inches (150 x 150 x 43 mm) and is powered by a 120 watt Ryzen AI Max+ 395 APU, better known by its codename Strix Halo. The chip is backed by 128 GB of LPDDR5x 8000 MT/s memory, which feeds both its 16 Zen 5 cores and 40 RDNA 3.5 GPU compute units, providing up to 256 GB/s of bandwidth, more than a Ryzen 9000 Threadripper (non-Pro) system."

"For local AI enthusiasts, that's enough to run models up to 200 billion parameters in size at 4-bit precision - just like the more expensive Spark. The bulk of the Ryzen AI Halo's compute comes from its integrated graphics, which are capable of delivering roughly 56 teraFLOPS at 16-bit precision. While impressive for onboard graphics, that's still between 55 and 88 percent slower than what the DGX Spark advertises."

#ai-workstations #local-model-inference #amd-ryzen-ai #strix-halo #developer-platforms

Read at theregister

Unable to calculate read time

Collection

[

...

]

AMD says its $4K Ryzen AI Halo workstation practically pays for itselfAMD says its $4K Ryzen AI Halo workstation practically pays for itself Briefly

AMD says its $4K Ryzen AI Halo workstation practically pays for itself
AMD says its $4K Ryzen AI Halo workstation practically pays for itself
Briefly