Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Major Major
    • Resolution: Invalid
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: HttpClient
    • Labels:
      None

      Description

      1.open this page in firefox or ie:http://rate.taobao.com/user-rate-96afc387087fe06b44c43f2b54592290--receivedOrPosted|0--buyerOrSeller|0.htm

      2.use httpclient to get or post this page

      3.then it will throw a exception because the uri has the char '|'

        Activity

        Hide
        Ortwin Glück added a comment -

        Properly escape non-URI characters. HttpClient is not a browser and thus does not, can not and will never try to fix invalid input.

        Show
        Ortwin Glück added a comment - Properly escape non-URI characters. HttpClient is not a browser and thus does not, can not and will never try to fix invalid input.
        Hide
        Remi Tassing added a comment -

        I'm having invalid uri error with url containing "..." (three dots). It works well in the browser so I guess it's a similar issue.
        My question is, how can this be solved in Nutch (or HttpClient)?

        @Ortwin: How do we perform the escape? I have no idea

        Show
        Remi Tassing added a comment - I'm having invalid uri error with url containing "..." (three dots). It works well in the browser so I guess it's a similar issue. My question is, how can this be solved in Nutch (or HttpClient)? @Ortwin: How do we perform the escape? I have no idea

          People

          • Assignee:
            Unassigned
            Reporter:
            yongyuan.jiang
          • Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

            • Created:
              Updated:
              Resolved:

              Development