Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-3069

Lucene should have an entirely memory resident term dictionary


    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 4.0-ALPHA
    • 4.7
    • core/index, core/search
    • New


      FST based TermDictionary has been a great improvement yet it still uses a delta codec file for scanning to terms. Some environments have enough memory available to keep the entire FST based term dict in memory. We should add a TermDictionary implementation that encodes all needed information for each term into the FST (custom fst.Output) and builds a FST from the entire term not just the delta.


        1. df-ttf-estimate.txt
          8 kB
          Han Jiang
        2. example.png
          458 kB
          Han Jiang
        3. LUCENE-3069.patch
          283 kB
          Han Jiang
        4. LUCENE-3069.patch
          282 kB
          Han Jiang
        5. LUCENE-3069.patch
          14 kB
          Han Jiang
        6. LUCENE-3069.patch
          101 kB
          Han Jiang
        7. LUCENE-3069.patch
          47 kB
          Han Jiang
        8. LUCENE-3069.patch
          9 kB
          Han Jiang
        9. LUCENE-3069.patch
          15 kB
          Han Jiang
        10. LUCENE-3069.patch
          12 kB
          Han Jiang
        11. LUCENE-3069.patch
          34 kB
          Han Jiang
        12. LUCENE-3069.patch
          33 kB
          Han Jiang
        13. LUCENE-3069.patch
          33 kB
          Han Jiang
        14. LUCENE-3069.patch
          4 kB
          Han Jiang
        15. LUCENE-3069.patch
          4 kB
          Han Jiang
        16. LUCENE-3069.patch
          40 kB
          Han Jiang

        Issue Links

          There are no Sub-Tasks for this issue.



              billy Han Jiang
              simonw Simon Willnauer
              2 Vote for this issue
              15 Start watching this issue