Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-20199

GradientBoostedTreesModel doesn't have featureSubsetStrategy parameter

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 2.1.0
    • Fix Version/s: 2.3.0
    • Component/s: ML, MLlib
    • Labels:
      None

      Description

      Spark GradientBoostedTreesModel doesn't have featureSubsetStrategy . It Uses random forest internally ,which have featureSubsetStrategy hardcoded "all". It should be provided by the user to have randomness at the feature level.

      This parameter is available in H2O and XGBoost.

      Sample from H2O.ai
      gbmParams._col_sample_rate

      Please provide the parameter .

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                pralabhkumar pralabhkumar
                Reporter:
                pralabhkumar pralabhkumar
              • Votes:
                2 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: