Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-15569

MultipleAdditiveTreesModel does not split as expected when the node value equals the split threshold

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 9.0, 8.9
    • None
    • contrib - LTR
    • None

    Description

      As discussed on the mailing list: https://lists.apache.org/thread.html/r23e10a021445e7bcf26ce578320304bd77165bc0c2136621f7a6225e%40%3Cusers.solr.apache.org%3E , MultipleAdditiveTreesModel does not split as expected when the value of the tree node is equal to the threshold, but splits to the left instead of the right. This can cause the final score of the model to change significantly.

      This issue affects especially categorical or boolean features that take on integer values. eg. if a feature can take on 0 or 1 and the left/right split is set to 1.0, the right path will never be evaluated.

      I will try to follow up with a test model that showcases the issue.

      Attachments

        Issue Links

          Activity

            People

              abenedetti Alessandro Benedetti
              spyk Spyros Kapnissis
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 2h 20m
                  2h 20m