Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
0.9.0, 1.0.0
-
None
Description
Example:
scala> sc.parallelize(1L to 2L,4).zip(sc.parallelize(11 to 12,4)).collect
res1: Array[(Long, Int)] = Array((2,11))
But more generally, it's whenever the number of partitions does not evenly divide the total number of elements in the RDD.
See https://groups.google.com/forum/#!msg/spark-users/demrmjHFnoc/Ek3ijiXHr2MJ
Attachments
Issue Links
- depends upon
-
SPARK-1837 NumericRange should be partitioned in the same way as other sequences
- Resolved