Details
-
Improvement
-
Status: Resolved
-
Minor
-
Resolution: Incomplete
-
1.1.0
-
None
Description
DecisionTree does not check for overflows or loss of precision while aggregating sufficient statistics (binAggregates). It uses Double, which may be a problem for DecisionTree regression since the variance calculation could blow up. At the least, it could check for overflow and renormalize as needed.
Attachments
Issue Links
- Is contained by
-
SPARK-14045 DecisionTree improvement umbrella
- Resolved