Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.4.0
-
None
Description
The PR https://github.com/apache/spark/pull/21913 replaced Scala parallel collections in UnionRDD by new parmap function. This changes cause a deadlock in the partitions method. The code demonstrates the problem:
val wide = 20 def unionRDD(num: Int): UnionRDD[Int] = { val rdds = (0 until num).map(_ => sc.parallelize(1 to 10, 1)) new UnionRDD(sc, rdds) } val level0 = (0 until wide).map { _ => val level1 = (0 until wide).map(_ => unionRDD(wide)) new UnionRDD(sc, level1) } val rdd = new UnionRDD(sc, level0) rdd.partitions.length
Attachments
Issue Links
- relates to
-
SPARK-25286 Remove dangerous parmap
- Resolved
- links to