Uploaded image for project: 'Lucene.Net'
  1. Lucene.Net
  2. LUCENENET-350

Performance enhancement in FastVectorHighlighter

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • None
    • None
    • None

    Description

      I've had some performace issues with highlighting large documents (>25MB plain text, > 11000 Terms per Field)

      This can be usual if your indexing i.e. log or trace files.
      Most of the time is spent loading the field value and stored TermVectors and Offets and itering over this List.
      I've build a TermVectorMapper which filters this List, by the searched terms, so that the time is reduces by approx. 30%

      Attachments

        1. VectorHighlightMapper.cs
          4 kB
          Bianco Veigel
        2. LUCENENET-350.patch
          9 kB
          Digy
        3. FieldTermStack.patch
          2 kB
          Bianco Veigel

        Activity

          People

            Unassigned Unassigned
            bianco.veigel Bianco Veigel
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: