Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10552

numBuckets is not consistent between distrib and non-distrib requests

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 5.1
    • Fix Version/s: 6.6
    • Component/s: None
    • Security Level: Public (Default Security Level. Issues are Public)
    • Labels:
      None

      Description

      The main problem is mincount... in a non-distrib query, numBuckets reflects the number of buckets that are screened out after mincount is applied. In distributed mode, we can't do this (or rather, the only way to do it would be to tramsmit all bucket counts to an aggregator node).

      We should perhaps just make numBuckets always pre-mincount to be consistent, and use hyper-log-log by default?

        Attachments

          Activity

            People

            • Assignee:
              yseeley@gmail.com Yonik Seeley
              Reporter:
              yseeley@gmail.com Yonik Seeley
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: