Description
As a user, I can specify how CrossValidator should create folds by specifying a foldCol, which should be integer type with range [0, numFolds). If foldCol is specified, Spark won't do random k-fold split. This is useful if there are custom logics to create folds, e.g., random split by users instead of random splits of events.
This is similar to SPARK-16206, which is for the RDD-based APIs.
Attachments
Issue Links
- relates to
-
SPARK-16206 Defining our own folds using CrossValidator
- Resolved
- links to