Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-7818

Distributed stats only calculates with the terms that are present in the last shard of the distributed request

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 5.0, 5.1, 5.2, 5.2.1
    • 5.3
    • None
    • None

    Description

      In ExactStatsCache#mergeToGlobalStats we go through the n responses and merge the termStats and colStats. But we keep putting the terms into the map which means only the last shard response terms will get used.

      This can lead to some terms not ending up calculating the distributed idf scores as the last shard might not have contained the term while the other shards actually had those terms.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            varun Varun Thacker
            varun Varun Thacker
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment