Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-8196

Add IntervalQuery and IntervalsSource to expose minimum interval semantics across term fields

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 7.4
    • Component/s: None
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      This ticket proposes an alternative implementation of the SpanQuery family that uses minimum-interval semantics from http://vigna.di.unimi.it/ftp/papers/EfficientAlgorithmsMinimalIntervalSemantics.pdf to implement positional queries across term-based fields.  Rather than using TermQueries to construct the interval operators, as in LUCENE-2878 or the current Spans implementation, we instead use a new IntervalsSource object, which will produce IntervalIterators over a particular segment and field.  These are constructed using various static helper methods, and can then be passed to a new IntervalQuery which will return documents that contain one or more intervals so defined.

        Attachments

        1. LUCENE-8196-debug.patch
          4 kB
          Jim Ferenczi
        2. LUCENE-8196.patch
          82 kB
          Alan Woodward
        3. LUCENE-8196.patch
          80 kB
          Alan Woodward
        4. LUCENE-8196.patch
          94 kB
          Alan Woodward
        5. LUCENE-8196.patch
          102 kB
          Alan Woodward
        6. LUCENE-8196.patch
          102 kB
          Alan Woodward

          Activity

            People

            • Assignee:
              romseygeek Alan Woodward
              Reporter:
              romseygeek Alan Woodward
            • Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 0.5h
                0.5h