All Projects : Nutch : indexer (Component)



 Select:   Open Issues   Road Map   Change Log   Popular Issues   

Open Issues

31 unresolved issue(s).

Versions

(with open issues due to be fixed per version for this component)
   Bug NUTCH-732 UNRESOLVED Subcollection plugin not working on Nutch-1.0 Critical Open
   Improvement NUTCH-760 UNRESOLVED Allow field mapping from nutch to solr index Major Open
   New Feature NUTCH-445 UNRESOLVED Domain İndexing / Query Filter Major Open
   New Feature NUTCH-541 UNRESOLVED Index url field untokenized Major Open
   Bug NUTCH-568 UNRESOLVED Indexer does not update the Lucene "TITLE" field Major Open
   Improvement NUTCH-716 UNRESOLVED Make subcollection index filed multivalued Major Open
   Bug NUTCH-729 UNRESOLVED NPE in FieldIndexer when BasicFields url doesn't exist Major Open
   Bug NUTCH-472 UNRESOLVED NullPointerException in ZipTextExtractor if no MIME type for zipped file Major Open
   Bug NUTCH-224 UNRESOLVED Nutch doesn't handle Korean text at all Major Open
   Bug NUTCH-739 UNRESOLVED SolrDeleteDuplications too slow when using hadoop Major Open
   New Feature NUTCH-441 UNRESOLVED Thai Analyzer Plugin Major Open
   New Feature NUTCH-185 UNRESOLVED XMLParser is configurable xml parser plugin. Major Open
   Improvement NUTCH-469 UNRESOLVED changes to geoPosition plugin to make it work on nutch 0.9 Major Open
   New Feature NUTCH-422 UNRESOLVED index-extra plugin creates additional fields in the index, based on configurable logic Major Open
   Improvement NUTCH-747 UNRESOLVED inject&Index metadatas and inherit these metadatas to all matching suburls Major Open
   Bug NUTCH-290 UNRESOLVED parse-pdf: Garbage indexed when text-extraction not allowed Major Open
   Bug NUTCH-129 UNRESOLVED rtf-parser does not work when opened with wordpad files and saved Major Open
   Improvement NUTCH-36 UNRESOLVED Chinese in Nutch Minor Open
   Improvement NUTCH-713 UNRESOLVED Config options for webgraph Scoring not documented Minor Open
   Improvement NUTCH-564 UNRESOLVED External parser supports encoding attribute Minor Open
   Bug NUTCH-267 UNRESOLVED Indexer doesn't consider linkdb when calculating boost value Minor Open
   Improvement NUTCH-86 UNRESOLVED LanguageIdentifier API enhancements Minor Open
   Improvement NUTCH-453 UNRESOLVED Move stop words to a config file Minor Open
   New Feature NUTCH-386 UNRESOLVED Plugin to index categories by url rules Minor Open
   Bug NUTCH-259 UNRESOLVED Problem in IndexSorter after dedup Minor Open
   New Feature NUTCH-260 UNRESOLVED Three new plugins that parse, index and query meta tags defined in the configuration Minor Open
   Bug NUTCH-326 UNRESOLVED WordExtractor throws java.util.NoSuchElementException on some documents Minor Open
   Improvement NUTCH-389 UNRESOLVED a url tokenizer implementation for tokenizing index fields : url and host Minor Open
   Improvement NUTCH-62 UNRESOLVED Add html META tag information into metaData in index-more plugin Trivial Open
   Improvement NUTCH-697 UNRESOLVED Generate log output for solr indexer and dedup Trivial Open
   New Feature NUTCH-16 UNRESOLVED boost documents matching a url pattern Trivial Open
Unreleased 1.1 7
  Unscheduled 24

Preset Filters


Component Summary

Open Open 31
   35%
Resolved Resolved 2
   2%
Closed Closed 56
   63%

Open Issues

By Priority
Critical Critical 1
   3%
Major Major 16
   52%
Minor Minor 11
   35%
Trivial Trivial 3
   10%

By Assignee
Chris A. Mattmann 1
   3%
Dennis Kubes 2
   6%
Enis Soztutar 1
   3%
Jerome Charron 1
   3%
Sami Siren 1
   3%
Unassigned 25
   81%