This startup claims it just outran Nvidia on its own turf
Briefly

Enterprises rely on GPUs to run thousands of parallel calculations for AI and analytics workloads. Generative AI models, recommendation engines, and analytics dashboards depend on data libraries to prepare, join, and transform massive datasets. Many legacy data libraries remain optimized for CPUs rather than GPUs, causing memory bandwidth and compute throughput to be underutilized. Frequent data movement between CPU and GPU erodes performance advantages. Nvidia introduced cuDF in 2018 to accelerate DataFrame operations on GPUs and improve utilization, but cuDF requires CUDA-compatible Nvidia hardware and faces architectural constraints. DataPelago's Nucleus claims substantially higher performance and broader hardware compatibility, potentially reshaping software-driven performance dynamics.
For years, enterprises have leaned on GPUs (Graphics Processing Units) to handle ever-growing mountains of data, leveraging their ability to run thousands of calculations in parallel for AI and analytics workloads. Every generative AI model, recommendation engine, and analytics dashboard depends on data libraries to prepare, join, and transform massive datasets. Yet the industry faces a quiet challenge: Despite advances in hardware, performance often stalls at scaling limits because the software stack struggles to fully exploit the hardware's capabilities.
To address this, Nvidia launched cuDF in 2018 as part of its open-source RAPIDS suite-a GPU-accelerated DataFrame library that quickly became the gold standard for data operations. It delivered speedups over CPU-based libraries and better utilization of GPU hardware. But cuDF also has limits. It requires an Nvidia GPU with ample memory and CUDA support, ruling out environments without compatible hardware. In many ways, cuDF became the industry's ceiling: powerful enough to accelerate AI and analytics pipelines, yet constrained by the quirks of GPU architecture itself.
DataPelago has created a new engine called Nucleus that dramatically speeds up data processing for AI and analytics. It outperforms Nvidia's cuDF library by large margins while working across different types of hardware. Today's GPUs are powerful, but older software often wastes their potential, making faster tools like Nucleus especially valuable. This shift could have dramatic implications for Nvidia.
Read at Fast Company
[
|
]