Uploaded image for project: 'Lucene.Net'
  1. Lucene.Net
  2. LUCENENET-350

Performance enhancement in FastVectorHighlighter

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      I've had some performace issues with highlighting large documents (>25MB plain text, > 11000 Terms per Field)

      This can be usual if your indexing i.e. log or trace files.
      Most of the time is spent loading the field value and stored TermVectors and Offets and itering over this List.
      I've build a TermVectorMapper which filters this List, by the searched terms, so that the time is reduces by approx. 30%

        Attachments

        1. VectorHighlightMapper.cs
          4 kB
          Bianco Veigel
        2. LUCENENET-350.patch
          9 kB
          Digy
        3. FieldTermStack.patch
          2 kB
          Bianco Veigel

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              bianco.veigel Bianco Veigel
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: