Uploaded image for project: 'HttpComponents HttpClient'
  1. HttpComponents HttpClient
  2. HTTPCLIENT-947

HTTPClient downloads an empty page

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Invalid
    • Affects Version/s: 4.0.1
    • Fix Version/s: None
    • Component/s: None
    • Labels:
    • Environment:
      Mac OS X

      Description

      I am facing a really weird behavior of HTTPClient downloading dynamically generated pages. It seems that depending on the page it is able or not to get the content.

      My issue is related to the IEEE explore website, I can regularly access information for journal papers, but I cannot see data about conference papers.
      This is an example of the pages I can download
      http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5456077
      and this is one of the pages is giving me troubles
      http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=671096

      Please note that without the proxy that provides the authentication I am able to download both the pages, but if I need to authenticate to see the data HTTPClient downloads in the right way just the journal papers.

      I attached the log of a simple application that first successfully accesses to http://ieexplore.ieee.org/ and then tries to get http://ieexplore.ieee.org/xpls/abs_all.jsp?arnumber=840991&tag=1
      For privacy reason I substituted with OMITTED the username I used to authenticate with my university proxy

        Attachments

        1. log
          43 kB
          Mario Sangiorgio

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              mariosangiorgio Mario Sangiorgio
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: