Solr
  1. Solr
  2. SOLR-1090

DataImportHandler should load the data-config.xml using UTF-8 encoding

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: 1.3
    • Fix Version/s: 1.4
    • Labels:
      None

      Description

      Wrongly encoded data may be indexed if the data-config.xml contains unicode characters and the default encoding is not UTF-8.

      Spin-off from http://www.lucidimagination.com/search/document/85b187a544fdc333/encoding_problem

      1. SOLR-1090.patch
        0.6 kB
        Shalin Shekhar Mangar

        Activity

        Hide
        Shalin Shekhar Mangar added a comment -

        Fix in SolrWriter.getResourceAsString to use UTF-8 encoding. I'll commit shortly.

        Show
        Shalin Shekhar Mangar added a comment - Fix in SolrWriter.getResourceAsString to use UTF-8 encoding. I'll commit shortly.
        Hide
        Shalin Shekhar Mangar added a comment -

        Committed revision 759337.

        Show
        Shalin Shekhar Mangar added a comment - Committed revision 759337.
        Hide
        Grant Ingersoll added a comment -

        Bulk close for Solr 1.4

        Show
        Grant Ingersoll added a comment - Bulk close for Solr 1.4

          People

          • Assignee:
            Shalin Shekhar Mangar
            Reporter:
            Shalin Shekhar Mangar
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development