Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-9929

Documentation and sample code about how to train the model using user clicks when use ltr module

    Details

    • Type: Task
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 6.4, 7.0
    • Component/s: None
    • Security Level: Public (Default Security Level. Issues are Public)

      Description

      Thanks very much for integrating machine learning to Solr.
      https://issues.apache.org/jira/browse/SOLR-8542

      I tried to integrate it. But have difficult figuring out how to translate the partial pairwise feedback to the importance or relevance of that doc.

      https://github.com/apache/lucene-solr/blob/f62874e47a0c790b9e396f58ef6f14ea04e2280b/solr/contrib/ltr/README.md
      In the Assemble training data part: the third column indicates the relative importance or relevance of that doc
      Could you please give more info about how to give a score based on what user clicks?

      I have read https://static.aminer.org/pdf/PDF/000/472/865/optimizing_search_engines_using_clickthrough_data.pdf
      http://www.cs.cornell.edu/people/tj/publications/joachims_etal_05a.pdf
      http://alexbenedetti.blogspot.com/2016/07/solr-is-learning-to-rank-better-part-1.html

      But still have no clue yet.

      From a user's perspective, the steps such as setup the feature and model in Solr is simple, but collecting the feedback data and train/update the model is much more complex. Without it, we can't really use the learning-to-rank function in Solr.

      It would be great if Solr can provide some detailed instruction and sample code about how to translate the partial pairwise feedback and use it to train and update model.

      Thanks

        Issue Links

          Activity

          Hide
          cpoerschke Christine Poerschke added a comment -
          Show
          cpoerschke Christine Poerschke added a comment - Thanks jefferyyuan and Diego Ceccarelli !
          Hide
          jira-bot ASF subversion and git services added a comment -

          Commit 88450c70bb4daa3ca6c4750581bddeaad9bea6f9 in lucene-solr's branch refs/heads/branch_6x from Christine Poerschke
          [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=88450c7 ]

          SOLR-8542: expand 'Assemble training data' content in solr/contrib/ltr/README

          (Diego Ceccarelli via Christine Poerschke in response to SOLR-9929 enquiry from Jeffery Yuan.)

          Show
          jira-bot ASF subversion and git services added a comment - Commit 88450c70bb4daa3ca6c4750581bddeaad9bea6f9 in lucene-solr's branch refs/heads/branch_6x from Christine Poerschke [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=88450c7 ] SOLR-8542 : expand 'Assemble training data' content in solr/contrib/ltr/README (Diego Ceccarelli via Christine Poerschke in response to SOLR-9929 enquiry from Jeffery Yuan.)
          Hide
          jira-bot ASF subversion and git services added a comment -

          Commit 024c4031e55a998b73288fd276e30ffd626f0b91 in lucene-solr's branch refs/heads/master from Christine Poerschke
          [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=024c403 ]

          SOLR-8542: expand 'Assemble training data' content in solr/contrib/ltr/README

          (Diego Ceccarelli via Christine Poerschke in response to SOLR-9929 enquiry from Jeffery Yuan.)

          Show
          jira-bot ASF subversion and git services added a comment - Commit 024c4031e55a998b73288fd276e30ffd626f0b91 in lucene-solr's branch refs/heads/master from Christine Poerschke [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=024c403 ] SOLR-8542 : expand 'Assemble training data' content in solr/contrib/ltr/README (Diego Ceccarelli via Christine Poerschke in response to SOLR-9929 enquiry from Jeffery Yuan.)
          Hide
          diegoceccarelli Diego Ceccarelli added a comment - - edited

          Thanks jefferyyuan for opening the issue, I submitted a patch to the learning to rank example readme, trying to explain better how a user can produce a training set from feedback data. The new version is available here: https://github.com/bloomberg/lucene-solr/blob/master-ltr/solr/contrib/ltr/example/README.md

          Please let me know if you have comments or more questions. Thanks!

          Show
          diegoceccarelli Diego Ceccarelli added a comment - - edited Thanks jefferyyuan for opening the issue, I submitted a patch to the learning to rank example readme, trying to explain better how a user can produce a training set from feedback data. The new version is available here: https://github.com/bloomberg/lucene-solr/blob/master-ltr/solr/contrib/ltr/example/README.md Please let me know if you have comments or more questions. Thanks!

            People

            • Assignee:
              cpoerschke Christine Poerschke
              Reporter:
              yuanyun.cn jefferyyuan
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development