Details

    • Type: Improvement
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.3
    • Component/s: modules/analysis
    • Labels:
      None

      Description

      In some cases, it would be handy to have Analyzer/Tokenizer/TokenFilters that could siphon off certain tokens and store them in a buffer to be used later in the processing pipeline.

      For example, if you want to have two fields, one lowercased and one not, but all the other analysis is the same, then you could save off the tokens to be output for a different field.

      Patch to follow, but I am still not sure about a couple of things, mostly how it plays with the new reuse API.

      See http://www.gossamer-threads.com/lists/lucene/java-dev/54397?search_string=BufferingAnalyzer;#54397

        Attachments

        1. LUCENE-1058.patch
          11 kB
          Grant Ingersoll
        2. LUCENE-1058.patch
          11 kB
          Grant Ingersoll
        3. LUCENE-1058.patch
          29 kB
          Grant Ingersoll
        4. LUCENE-1058.patch
          29 kB
          Grant Ingersoll
        5. LUCENE-1058.patch
          28 kB
          Grant Ingersoll
        6. LUCENE-1058.patch
          17 kB
          Grant Ingersoll
        7. LUCENE-1058.patch
          13 kB
          Grant Ingersoll

          Issue Links

            Activity

              People

              • Assignee:
                gsingers Grant Ingersoll
                Reporter:
                gsingers Grant Ingersoll
              • Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: