Description
When using solrindex to index multiple segments via -dir segment,
the indexing fails if one or more segments are corrupted/incomplete (generated but not fetched for example)
The failure is simply java.io exception.
Deleting the segment fixes the issue.
The expected behavior should be one of the following:
- skipping the segment and proceeding with others (while logging)
- stopping the indexing and logging the failed segment
Attachments
Issue Links
- is duplicated by
-
NUTCH-1905 Nutch index tool should be resilient to segments that don't have crawl_* data
- Closed
-
NUTCH-1978 solrindex will fail when indexing corrupted segments
- Closed