Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1052

Add an "termInfosIndexDivisor" to IndexReader

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • 2.2
    • None
    • core/index
    • None
    • New

    Description

      The termIndexInterval, set during indexing time, let's you tradeoff
      how much RAM is used by a reader to load the indexed terms vs cost of
      seeking to the specific term you want to load.

      But the downside is you must set it at indexing time.

      This issue adds an indexDivisor to TermInfosReader so that on opening
      a reader you could further sub-sample the the termIndexInterval to use
      less RAM. EG a setting of 2 means every 2 * termIndexInterval is
      loaded into RAM.

      This is particularly useful if your index has a great many terms (eg
      you accidentally indexed binary terms).

      Spinoff from this thread:

      http://www.gossamer-threads.com/lists/lucene/java-dev/54371

      Attachments

        1. LUCENE-1052.patch
          21 kB
          Michael McCandless
        2. LUCENE-1052.patch
          14 kB
          Michael McCandless
        3. termInfosConfigurer.patch
          16 kB
          Chuck Williams

        Activity

          People

            mikemccand Michael McCandless
            mikemccand Michael McCandless
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: