Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-4928

Configure UnInvertedField to skip terms with too high or too low document frequency

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None

    Description

      I want to facet on my tokenized text for tag clouds and for analytical purposes. Even though I only have 312k docs, UnInvertedField hit a limit – "Too many values for UnInvertedField faceting on field text". I guess some of these docs are bigger than I thought and have lots of distinct terms; I dunno.

      I'd like to add a new parameter named something like facet.uif.cache.minDf (named similarly to the existing facet.enum.cache.minDf).

      Attachments

        Activity

          People

            Unassigned Unassigned
            dsmiley David Smiley
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: