Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1052

Add an "termInfosIndexDivisor" to IndexReader

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 2.2
    • Fix Version/s: None
    • Component/s: core/index
    • Labels:
      None
    • Lucene Fields:
      New

      Description

      The termIndexInterval, set during indexing time, let's you tradeoff
      how much RAM is used by a reader to load the indexed terms vs cost of
      seeking to the specific term you want to load.

      But the downside is you must set it at indexing time.

      This issue adds an indexDivisor to TermInfosReader so that on opening
      a reader you could further sub-sample the the termIndexInterval to use
      less RAM. EG a setting of 2 means every 2 * termIndexInterval is
      loaded into RAM.

      This is particularly useful if your index has a great many terms (eg
      you accidentally indexed binary terms).

      Spinoff from this thread:

      http://www.gossamer-threads.com/lists/lucene/java-dev/54371

        Attachments

        1. LUCENE-1052.patch
          14 kB
          Michael McCandless
        2. termInfosConfigurer.patch
          16 kB
          Chuck Williams
        3. LUCENE-1052.patch
          21 kB
          Michael McCandless

          Activity

            People

            • Assignee:
              mikemccand Michael McCandless
              Reporter:
              mikemccand Michael McCandless
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: