Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1483 Can't crawl filesystem with protocol-file plugin
  3. NUTCH-1880

URLUtil should not add additional slashes for file URLs

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.9, 2.2.1
    • 2.3, 1.10
    • protocol
    • None
    • Patch Available

    Description

      UrlUtil.toASCII(String url) and .toUNICODE(String url) add two slashes to file URLs if it contains a single slash: file:/path/index.html becomes file:///path/index.html. Both methods should keep the single slash to get a behavior consistent with URL.toString(). See NUTCH-1483 for details.

      Attachments

        1. NUTCH-1880-2x-v1.patch
          2 kB
          Sebastian Nagel
        2. NUTCH-1880-trunk-v1.patch
          2 kB
          Sebastian Nagel

        Activity

          People

            Unassigned Unassigned
            snagel Sebastian Nagel
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: