Uploaded image for project: 'Lucene - Core'
  1. Lucene - Core
  2. LUCENE-1058

New Analyzer for buffering tokens

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 2.3
    • modules/analysis
    • None

    Description

      In some cases, it would be handy to have Analyzer/Tokenizer/TokenFilters that could siphon off certain tokens and store them in a buffer to be used later in the processing pipeline.

      For example, if you want to have two fields, one lowercased and one not, but all the other analysis is the same, then you could save off the tokens to be output for a different field.

      Patch to follow, but I am still not sure about a couple of things, mostly how it plays with the new reuse API.

      See http://www.gossamer-threads.com/lists/lucene/java-dev/54397?search_string=BufferingAnalyzer;#54397

      Attachments

        1. LUCENE-1058.patch
          13 kB
          Grant Ingersoll
        2. LUCENE-1058.patch
          17 kB
          Grant Ingersoll
        3. LUCENE-1058.patch
          28 kB
          Grant Ingersoll
        4. LUCENE-1058.patch
          29 kB
          Grant Ingersoll
        5. LUCENE-1058.patch
          29 kB
          Grant Ingersoll
        6. LUCENE-1058.patch
          11 kB
          Grant Ingersoll
        7. LUCENE-1058.patch
          11 kB
          Grant Ingersoll

        Issue Links

          Activity

            People

              gsingers Grant Ingersoll
              gsingers Grant Ingersoll
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: