Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-214

deficit of InputStreamReader support in anonymous class of ContentStream

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.2
    • None
    • None

    Description

      After SOLR-197 is applied, POSTed Japanese XML contents turn into garbled characters in the index.
      I can see the garbled characters through Luke. The issue was never seen before SOLR-197.
      The cause of this problem is that the deficit of InputStreamReader support in the anonymous class of ContentStream in SolrRequestParsers.parseParamsAndFillStreams() method.

      Before SOLR-197, InputStreamReader was used in XmlUpdateRequestHandler.handleRequestBody() method:

      // Cycle through each stream
      for( ContentStream stream : req.getContentStreams() ) {
      String charset = getCharsetFromContentType( stream.getContentType() );
      Reader reader = null;
      if( charset == null )

      { reader = new InputStreamReader( stream.getStream() ); }

      else

      { reader = new InputStreamReader( stream.getStream(), charset ); }

      rsp.add( "update", this.update( reader ) );

      // Make sure its closed
      try

      { reader.close(); }

      catch( Exception ex ){}
      }

      The patch will apply this effect to SolrRequestParsers.

      regards,

      Attachments

        1. UseInputStreamReader.patch
          1 kB
          Koji Sekiguchi

        Issue Links

          Activity

            People

              ryantxu Ryan McKinley
              koji Koji Sekiguchi
              Votes:
              2 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: