Lucene - Core
  1. Lucene - Core
  2. LUCENE-2023

Improve performance of SmartChineseAnalyzer

    Details

    • Type: Improvement Improvement
    • Status: Open
    • Priority: Minor Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: 4.9, 5.0
    • Component/s: modules/analysis
    • Labels:
      None
    • Lucene Fields:
      New, Patch Available

      Description

      I've noticed SmartChineseAnalyzer is a bit slow, compared to say CJKAnalyzer on chinese text.

      This patch improves the internal hhmm implementation.
      Time to index my chinese corpus is 75% of the previous time.

      1. LUCENE-2023.patch
        82 kB
        Robert Muir
      2. LUCENE-2023.patch
        76 kB
        Robert Muir
      3. LUCENE-2023.patch
        38 kB
        Robert Muir
      4. LUCENE-2023.patch
        23 kB
        Robert Muir
      5. LUCENE-2023.patch
        18 kB
        Robert Muir
      6. LUCENE-2023.patch
        14 kB
        Robert Muir
      7. LUCENE-2023.patch
        10 kB
        Robert Muir
      8. LUCENE-2023.patch
        10 kB
        Robert Muir

        Activity

        Robert Muir created issue -
        Robert Muir made changes -
        Field Original Value New Value
        Attachment LUCENE-2023.patch [ 12423694 ]
        Robert Muir made changes -
        Priority Major [ 3 ] Minor [ 4 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423713 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423776 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423778 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423781 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423822 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423833 ]
        Robert Muir made changes -
        Attachment LUCENE-2023.patch [ 12423848 ]
        Robert Muir made changes -
        Fix Version/s 3.1 [ 12314025 ]
        Fix Version/s 3.0 [ 12312889 ]
        Mark Thomas made changes -
        Workflow jira [ 12480906 ] Default workflow, editable Closed status [ 12563284 ]
        Mark Thomas made changes -
        Workflow Default workflow, editable Closed status [ 12563284 ] jira [ 12584393 ]
        Shai Erera made changes -
        Component/s modules/analysis [ 12310230 ]
        Component/s contrib/analyzers [ 12312333 ]
        Robert Muir made changes -
        Fix Version/s 4.1 [ 12321140 ]
        Fix Version/s 4.0 [ 12314025 ]
        Steve Rowe made changes -
        Fix Version/s 4.2 [ 12323899 ]
        Fix Version/s 4.1 [ 12321140 ]
        Robert Muir made changes -
        Fix Version/s 4.3 [ 12324143 ]
        Fix Version/s 4.2 [ 12323899 ]
        Uwe Schindler made changes -
        Fix Version/s 4.4 [ 12324323 ]
        Fix Version/s 4.3 [ 12324143 ]
        Steve Rowe made changes -
        Fix Version/s 5.0 [ 12321663 ]
        Fix Version/s 4.5 [ 12324742 ]
        Fix Version/s 4.4 [ 12324323 ]
        Adrien Grand made changes -
        Fix Version/s 4.6 [ 12324999 ]
        Fix Version/s 5.0 [ 12321663 ]
        Fix Version/s 4.5 [ 12324742 ]
        Simon Willnauer made changes -
        Fix Version/s 4.7 [ 12325572 ]
        Fix Version/s 4.6 [ 12324999 ]
        David Smiley made changes -
        Fix Version/s 4.8 [ 12326269 ]
        Fix Version/s 4.7 [ 12325572 ]
        Uwe Schindler made changes -
        Fix Version/s 4.9 [ 12326730 ]
        Fix Version/s 5.0 [ 12321663 ]
        Fix Version/s 4.8 [ 12326269 ]

          People

          • Assignee:
            Robert Muir
            Reporter:
            Robert Muir
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:

              Development