Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-2152

the error of comput rightNodeAgg about Decision tree algorithm in Spark MLlib

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.0.0
    • 1.0.2, 1.1.0
    • None
    • windows7 ,32 operator,and 3G mem

    Description

      the error of comput rightNodeAgg about Decision tree algorithm in Spark MLlib about the function extractLeftRightNodeAggregates() ,when compute rightNodeAgg used bindata index is error. in the DecisionTree.scala file about Line 980:

      rightNodeAgg(featureIndex)(2 * (numBins - 2 - splitIndex)) =
      binData(shift + (2 * (numBins - 2 - splitIndex))) +
      rightNodeAgg(featureIndex)(2 * (numBins - 1 - splitIndex))

      the binData(shift + (2 * (numBins - 2 - splitIndex))) index compute is error, so the result of rightNodeAgg include repeated data about "bins"

      Attachments

        Issue Links

          Activity

            People

              jsondag Jon Sondag
              mathcao caoli
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - 4h
                  4h
                  Remaining:
                  Remaining Estimate - 4h
                  4h
                  Logged:
                  Time Spent - Not Specified
                  Not Specified