
|
If you were logged in you would be able to see more operations.
|
|
|
| Resolution Date: |
08/Nov/07 03:03 PM
|
|
If fetching and parsing are run as seperate jobs, then redirection coming from meta refresh tag (i.e. <meta http-equiv="refresh" content="0;url=foo/">) is ignored, resulting in the loss of that ("foo/") url.
|
|
Description
|
If fetching and parsing are run as seperate jobs, then redirection coming from meta refresh tag (i.e. <meta http-equiv="refresh" content="0;url=foo/">) is ignored, resulting in the loss of that ("foo/") url. |
Show » |
|
I believe that in the long term, nutch should associate source url with the redirected url. But this patch (or a more correct version of this
) can be applied so that we do not lose urls in the short term.