Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23730

Save and expose "in bag" tracking for random forest model

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Incomplete
    • 2.3.0
    • None
    • ML

    Description

      In a random forest model, it is often useful to be able to keep track of which samples ended up in each of the bootstrap replications (and how many times this happened). For instance, in the R randomForest package this is accomplished through the option keep.inbag=TRUE

      Similar functionality in Spark ML's random forest would be helpful

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              alpha137 Julian King
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: