Description
MoreIndexingFilter must handle the following url's gracefully:
can't parse erroneous date: Sun, 27 Jun 2010 06:51:35 GMT+1
can't parse erroneous date: ma, 27 jun 2011 05:15:32 GMT
can't parse erroneous date: "Mon, 23 May 2011 22:05:58 GMT"
can't parse erroneous date: GMT
Attachments
Issue Links
- depends upon
-
NUTCH-1190 MoreIndexingFilter refactor: move data formats used to parse "lastModified" to a config file.
- Closed