Nutch
  1. Nutch

injector

Summary

Description

Takes a flat file of URLs and adds them to the crawldb as pages to be crawled

Issues: Unresolved

Key Summary Due Date
Bug NUTCH-1472 InvalidRequestException(why:(String didn't validate.) [webpage][f][ts] failed validation)
Bug NUTCH-1746 OutOfMemoryError in Mappers
New Feature NUTCH-2043 Interface and high level design for classification using models

View Issues

Issues: Updated recently

Key Summary Updated
Bug NUTCH-1472 InvalidRequestException(why:(String didn't validate.) [webpage][f][ts] failed validation)
Improvement NUTCH-1682 Port optionally maintain custom fetch interval despite AdaptiveFetchSchedule to 2.x
Improvement NUTCH-1683 Optionally maintain custom fetch interval despite AbstractFetchSchedule

View Issues

Versions: Unreleased

Name Release date
Unreleased 2.4  
Unreleased 1.12  
Unreleased 2.4.1  
Unreleased 1.13  
Unreleased 2.3.2  
Unreleased 2.5  

...and 1 more

Show first 5 only