Details
Description
urlnormalizer-regex should replace multiple slashes after file: protocol by a single slash (file:/// -> file:/):
- required by
NUTCH-1483to get a consistent canonical form for file URL because URL.toString() also emits the single-slash form - would obsolete
NUTCH-1878
Attachments
Attachments
Issue Links
- supercedes
-
NUTCH-1878 urlnormalizer-regex to keep third slash in file:///path/index.html
- Closed