fromMedium
1 week agoApache Spark: Fix data skew issue using salting technique (practical example)
Data skew in Apache Spark is a performance issue where a few keys dominate the data distribution, leading to uneven partitions and slow queries, especially during operations that require shuffling.
Data science