Details
-
Question
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.4.0, 2.4.4
-
None
Description
Persist method with MEMORY_ONLY behave different than using with MEMORY_ONLY_SER.
persist(StorageLevel.MEMORY_ONLY()).distinct().count() return 1
while persist(StorageLevel.MEMORY_ONLY_SER()).distinct().count() return 100
I expect both to return the same results. The right result is 100, for some reason MEMORY_ONLY causing all the objects in the RDD to be the same one.