Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-11386

Extracting learning to rank features fails when word ordering of EFI argument changed.

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Workaround
    • 6.5.1
    • None
    • contrib - LTR
    • None

    Description

      I'm getting some extremely strange behavior when trying to extract features for a learning to rank model. The following query incorrectly says all features have zero values:

      http://gss-test-fusion.usersys.redhat.com:8983/solr/access/query?q=added couple of fiber channel&rq={!ltr model=redhat_efi_model reRankDocs=1 efi.case_summary=the efi.case_description=added couple of fiber channel efi.case_issue=the efi.case_environment=the}&fl=id,score,[features]&rows=10
      

      But this query, which simply moves the word "added" from the front of the provided text to the back, properly fills in the feature values:

      http://gss-test-fusion.usersys.redhat.com:8983/solr/access/query?q=couple of fiber channel added&rq={!ltr model=redhat_efi_model reRankDocs=1 efi.case_summary=the efi.case_description=couple of fiber channel added efi.case_issue=the efi.case_environment=the}&fl=id,score,[features]&rows=10
      

      The explain output for the failing query can be found here:

      https://gist.github.com/manisnesan/18a8f1804f29b1b62ebfae1211f38cc4

      and the explain output for the properly functioning query can be found here:

      https://gist.github.com/manisnesan/47685a561605e2229434b38aed11cc65

      Attachments

        1. solr_efi_examples.zip
          23 kB
          Michael A. Alcorn

        Activity

          People

            Unassigned Unassigned
            malcorn_redhat Michael A. Alcorn
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: