Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-9950

Support both single- and multi-value string fields in facet counting (non-taxonomy based approaches)

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 9.0
    • 9.0, 8.9
    • modules/facet
    • None
    • New

    Description

      Users wanting to facet count string-based fields using a non-taxonomy-based approach can use SortedSetDocValueFacetCounts, which accumulates facet counts based on a SortedSetDocValues field. This requires the stored doc values to be multi-valued (i.e., SORTED_SET), and doesn't work on single-valued fields (i.e., SORTED). In contrast, if a user wants to facet count on a stored numeric field, they can use LongValueFacetCounts, which supports both single- and multi-valued fields (and in LUCENE-9948, we now auto-detect instead of asking the user to specify).

      Let's update SortedSetDocValueFacetCounts to also support, and automatically detect single- and multi-value fields. Note that this is a spin-off issue from LUCENE-9946, where rcmuir points out that this can essentially be a one-line change, but we may want to do some class renaming at the same time. Also note that we should do this in ConcurrentSortedSetDocValuesFacetCounts while we're at it.

      Attachments

        Activity

          People

            Unassigned Unassigned
            gsmiller Greg Miller
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 3h
                3h