Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4678

FST should use paged byte[] instead of single contiguous byte[]

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.2, 6.0
    • core/FSTs
    • None
    • New

    Description

      The single byte[] we use today has several limitations, eg it limits us to < 2.1 GB FSTs (and suggesters in the wild are getting close to this limit), and it causes big RAM spikes during building when a the array has to grow.

      I took basically the same approach as LUCENE-3298, but I want to break out this patch separately from changing all int -> long for > 2.1 GB support.

      Attachments

        1. LUCENE-4678.patch
          12 kB
          Michael McCandless
        2. LUCENE-4678.patch
          70 kB
          Michael McCandless
        3. LUCENE-4678.patch
          93 kB
          Michael McCandless
        4. LUCENE-4678.patch
          69 kB
          Michael McCandless
        5. LUCENE-4678.patch
          3 kB
          Michael McCandless

        Issue Links

          Activity

            People

              mikemccand Michael McCandless
              mikemccand Michael McCandless
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: