Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Incomplete
-
2.4.4, 2.4.5
-
None
Description
In mllib.evaluation.BinaryClassificationMetrics, cumulativeCounts is cached in a lazy initialization. But when I run LogisticRegressionSummaryExample as well as ModelSelectionViaCrossValidationExample, I find that cached cumulativeCounts only used by one action during execution.
So I think it should not be cached in initilization, we can set an extra persist() API in this class, just as that the unpersist() API in BinaryClassificationMetrics releases cached cumulativeCounts.