Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-2478

// is not a valid base URL

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.13
    • 1.14
    • None
    • None

    Description

      This test fails:

        @Test
        public void testBadResolver() throws Exception {
          URL base = new URL("//www.example.org/");
          String target = "index/produkt/kanaly/";
          
          URL abs = URLUtil.resolveURL(base, target);
          Assert.assertEquals("http://www.example.org/index/produkt/kanaly/", abs.toString());
        }
      

      and has to fail because of invalid base URL, so the current URL is used. If current URL is not /, its path will be prepended, resulting in 404 being crawled.

      This ticket must allow // as base, and resolve the protocol.

      Attachments

        Activity

          People

            markus17 Markus Jelsma
            markus17 Markus Jelsma
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: