Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3041

DecisionTree: isSampleValid indexing incorrect

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.1.0
    • 1.1.0
    • MLlib
    • None

    Description

      In DecisionTree, isSampleValid treats unordered categorical features incorrectly: It treated the bins as if indexed by featured values, rather than by subsets of values/categories.
      This bug is exhibited for unordered features (multi-class classification with categorical features of low arity).
      Proposed fix: Index bins correctly for unordered categorical features.

      Attachments

        Activity

          People

            josephkb Joseph K. Bradley
            josephkb Joseph K. Bradley
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: