Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-13108

RelatednessAgg ignores cacheDf, consults filterCache for every bucket/term

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: 7.4, 8.0
    • Fix Version/s: None
    • Component/s: Facet Module
    • Security Level: Public (Default Security Level. Issues are Public)
    • Labels:
      None

      Description

      The relatedness aggregation function in JSON facet API ignores cacheDf setting and consults the filterCache for every bucket. This is ok e.g. for "Query" facet type, where buckets are explicitly enumerated (and thus probably relatively low cardinality). But for "Terms" facet type, where bucket count is determined by the corpus, this can be a problem. When used over even modestly high-cardinality fields, this is very likely to blow out the filterCache.

      See also issue with similar consequences: SOLR-9350

        Attachments

        1. SOLR-13108.patch
          6 kB
          Michael Gibney
        2. SOLR-13108-shortcircuit.patch
          7 kB
          Michael Gibney

          Issue Links

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                mgibney Michael Gibney
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated: