Description
Linear/logistic regression use treeAggregate with default aggregation depth for collecting coefficient gradient updates to the driver. For high dimensional problems, this can case OOM error on the driver. We should make it configurable, perhaps via an expert param, so that users can avoid this problem if their data has many features.
Attachments
Issue Links
- relates to
-
SPARK-17175 Add a expert formula to aggregationDepth of SharedParam
- Resolved
- links to