Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-4678

FST should use paged byte[] instead of single contiguous byte[]

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 4.2, 6.0
    • core/FSTs
    • None
    • New

    Description

      The single byte[] we use today has several limitations, eg it limits us to < 2.1 GB FSTs (and suggesters in the wild are getting close to this limit), and it causes big RAM spikes during building when a the array has to grow.

      I took basically the same approach as LUCENE-3298, but I want to break out this patch separately from changing all int -> long for > 2.1 GB support.

      Attachments

        1. LUCENE-4678.patch
          3 kB
          Michael McCandless
        2. LUCENE-4678.patch
          69 kB
          Michael McCandless
        3. LUCENE-4678.patch
          93 kB
          Michael McCandless
        4. LUCENE-4678.patch
          70 kB
          Michael McCandless
        5. LUCENE-4678.patch
          12 kB
          Michael McCandless

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            mikemccand Michael McCandless
            mikemccand Michael McCandless
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment