Description
The fold in the ML CrossValidator depends on a rand whose seed is set to 0 and it leads the sql.functions rand to call sc._jvm.functions.rand() with no seed.
In order to be able to unit test a Cross Validation it would be a good idea to be able to set this seed so the output of the cross validation (with a featureSubsetStrategy set to "all") would always be the same.
Attachments
Issue Links
- is related to
-
SPARK-9691 PySpark SQL rand function treats seed 0 as no seed
- Resolved
- links to