Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-17058

Request param to disable distributed IDF request at query time

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 9.6
    • query
    • None

    Description

      When distributed IDF is enabled in solr cloud by adding one of the cache implementations in solrconfig.xml https://solr.apache.org/guide/solr/latest/deployment-guide/solrcloud-distributed-requests.html#distributedidf,  each solr query will incur a distributed shard request to get term statistics

      "debug": {

              "track": {

                  "rid": "-54",

                  "PARSE_QUERY": {

                      "http://192.168.0.34:8987/solr/shard2_replica_n1/":

                     { "QTime": “2”,                                                                                                        

                       "ElapsedTime": "13",                                                                                                        

                       "RequestPurpose": "GET_TERM_STATS",     

                       …                             

       

           For queries that does not use distributed IDF information for scoring such as terms filter by id, the stats request is not necessary.  Hence I propose to add a distrib.statsCache request param so that the distributed stats request can be disabled at query time. 

      1. distrib.statsCache defaults to true. When the param is not present, there is no change to current distributed IDF behavior. 
      2. When explicitly set distrib.statsCache=false, distributed stats call is disabled for the current query.  

      Attachments

        Issue Links

          Activity

            People

              mkhl Mikhail Khludnev
              weiwang19 wei wang
              Votes:
              1 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 4h 20m
                  4h 20m