Solr
  1. Solr
  2. SOLR-882

HTMLStripReader improvement - padding corrected for hexadecimal entities, option not to emit padding at all added

    Details

    • Type: Improvement Improvement
    • Status: Closed
    • Priority: Trivial Trivial
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 3.6, 4.0-ALPHA
    • Component/s: None
    • Labels:
      None

      Description

      Improvements to HTMLStripHighlighter:

      • fix padding of hexadecimal entities (currently off by 1)
      • add an option not to emit padding at all. In certain applications padding emitted after entities such as ó may split words that are in fact single terms.
      • add entities that are recognized when written all in uppercase and recognized by browsers.
      1. patch
        18 kB
        Dawid Weiss

        Issue Links

          Activity

          Dawid Weiss created issue -
          Dawid Weiss made changes -
          Field Original Value New Value
          Attachment patch [ 12394691 ]
          Dawid Weiss made changes -
          Attachment patch [ 12394725 ]
          Dawid Weiss made changes -
          Attachment patch [ 12394691 ]
          Dawid Weiss made changes -
          Link This issue relates to SOLR-887 [ SOLR-887 ]
          Grant Ingersoll made changes -
          Assignee Grant Ingersoll [ gsingers ]
          Shalin Shekhar Mangar made changes -
          Fix Version/s 1.5 [ 12313566 ]
          Hoss Man made changes -
          Fix Version/s Next [ 12315093 ]
          Fix Version/s 1.5 [ 12313566 ]
          Hoss Man made changes -
          Fix Version/s 3.2 [ 12316172 ]
          Fix Version/s Next [ 12315093 ]
          Robert Muir made changes -
          Fix Version/s 3.3 [ 12316471 ]
          Fix Version/s 3.2 [ 12316172 ]
          Robert Muir made changes -
          Fix Version/s 3.4 [ 12316683 ]
          Fix Version/s 4.0 [ 12314992 ]
          Fix Version/s 3.3 [ 12316471 ]
          Robert Muir made changes -
          Fix Version/s 3.5 [ 12317876 ]
          Fix Version/s 3.4 [ 12316683 ]
          Simon Willnauer made changes -
          Fix Version/s 3.6 [ 12319065 ]
          Fix Version/s 3.5 [ 12317876 ]
          Grant Ingersoll made changes -
          Assignee Grant Ingersoll [ gsingers ]
          Steve Rowe made changes -
          Link This issue is required by LUCENE-3690 [ LUCENE-3690 ]
          Steve Rowe made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Assignee Steven Rowe [ steve_rowe ]
          Resolution Fixed [ 1 ]
          Uwe Schindler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]

            People

            • Assignee:
              Steve Rowe
              Reporter:
              Dawid Weiss
            • Votes:
              1 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development