Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-31216

RDDs in GradientBoostedTress can be unpersisted earlier for saving memory

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 2.4.5
    • Fix Version/s: None
    • Component/s: ML, MLlib
    • Labels:
      None

      Description

      In ml.tree.impl.GradientBoostedTrees.boost(), predErrorCheckpointer and input are unpersisted at last. Actually, they can be unpersisted before the if(validate) block, so it can save memory for the computation in if block.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              spark_cachecheck IcySanwitch
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: