#rubin-cpx
#rubin-cpx

[ follow ]

Nvidia's context-optimized Rubin CPX GPUs were inevitable

Nvidia on Tuesday unveiled the Rubin CPX, a GPU designed specifically to accelerate extremely long-context AI workflows like those seen in code assistants such as Microsoft's GitHub Copilot, while simultaneously cutting back on pricey and power-hungry high-bandwidth memory (HBM). The first indication that Nvidia might be moving in this direction came when CEO Jensen Huang unveiled Dynamo during his GTC keynote in spring. The framework brought mainstream attention to the idea of disaggregated inference.

Artificial intelligence

fromTechCrunch

8 months ago

Nvidia unveils new GPU designed for long-context inference | TechCrunch

Nvidia unveiled the Rubin CPX GPU for context windows exceeding one million tokens, targeting long-context workloads with a disaggregated inference infrastructure, arriving end of 2026.

[ Load more ]

#rubin-cpx#rubin-cpx

Nvidia's context-optimized Rubin CPX GPUs were inevitable

Nvidia unveils new GPU designed for long-context inference | TechCrunch

#rubin-cpx
#rubin-cpx