History
Log In
h
ome
b
rowse project
f
ind issues
Q
uick Search:
Learn more about
Quick Search
All Projects
:
Nutch
: indexer
(Component)
Select:
Open Issues
Road Map
Change Log
Popular Issues
Open Issues
31
unresolved issue(s).
Versions
(with open issues due to be fixed per version for this component)
NUTCH-732
UNRESOLVED
Subcollection plugin not working on Nutch-1.0
NUTCH-760
UNRESOLVED
Allow field mapping from nutch to solr index
NUTCH-445
UNRESOLVED
Domain İndexing / Query Filter
NUTCH-541
UNRESOLVED
Index url field untokenized
NUTCH-568
UNRESOLVED
Indexer does not update the Lucene "TITLE" field
NUTCH-716
UNRESOLVED
Make subcollection index filed multivalued
NUTCH-729
UNRESOLVED
NPE in FieldIndexer when BasicFields url doesn't exist
NUTCH-472
UNRESOLVED
NullPointerException in ZipTextExtractor if no MIME type for zipped file
NUTCH-224
UNRESOLVED
Nutch doesn't handle Korean text at all
NUTCH-739
UNRESOLVED
SolrDeleteDuplications too slow when using hadoop
NUTCH-441
UNRESOLVED
Thai Analyzer Plugin
NUTCH-185
UNRESOLVED
XMLParser is configurable xml parser plugin.
NUTCH-469
UNRESOLVED
changes to geoPosition plugin to make it work on nutch 0.9
NUTCH-422
UNRESOLVED
index-extra plugin creates additional fields in the index, based on configurable logic
NUTCH-747
UNRESOLVED
inject&Index metadatas and inherit these metadatas to all matching suburls
NUTCH-290
UNRESOLVED
parse-pdf: Garbage indexed when text-extraction not allowed
NUTCH-129
UNRESOLVED
rtf-parser does not work when opened with wordpad files and saved
NUTCH-36
UNRESOLVED
Chinese in Nutch
NUTCH-713
UNRESOLVED
Config options for webgraph Scoring not documented
NUTCH-564
UNRESOLVED
External parser supports encoding attribute
NUTCH-267
UNRESOLVED
Indexer doesn't consider linkdb when calculating boost value
NUTCH-86
UNRESOLVED
LanguageIdentifier API enhancements
NUTCH-453
UNRESOLVED
Move stop words to a config file
NUTCH-386
UNRESOLVED
Plugin to index categories by url rules
NUTCH-259
UNRESOLVED
Problem in IndexSorter after dedup
NUTCH-260
UNRESOLVED
Three new plugins that parse, index and query meta tags defined in the configuration
NUTCH-326
UNRESOLVED
WordExtractor throws java.util.NoSuchElementException on some documents
NUTCH-389
UNRESOLVED
a url tokenizer implementation for tokenizing index fields : url and host
NUTCH-62
UNRESOLVED
Add html META tag information into metaData in index-more plugin
NUTCH-697
UNRESOLVED
Generate log output for solr indexer and dedup
NUTCH-16
UNRESOLVED
boost documents matching a url pattern
1.1
7
Unscheduled
24
Preset Filters
-
All
-
Outstanding
-
Unscheduled
-
Most important
-
Resolved recently
-
Added recently
-
Updated recently
Component Summary
Open
31
35%
Resolved
2
2%
Closed
56
63%
Open Issues
By Priority
Critical
1
3%
Major
16
52%
Minor
11
35%
Trivial
3
10%
By Assignee
Chris A. Mattmann
1
3%
Dennis Kubes
2
6%
Enis Soztutar
1
3%
Jerome Charron
1
3%
Sami Siren
1
3%
Unassigned
25
81%