History
Log In
h
ome
b
rowse project
f
ind issues
Q
uick Search:
Learn more about
Quick Search
Filter:
View
Edit
New
Manage
You are currently using a new, unsaved search.
Summary
Project:
Nutch
Components:
indexer
Resolutions:
Unresolved
Sorted by:
Priority descending
Operations
Save
Issue Navigator
[
Permlink
]
Displaying issues
1
to
31
of
31
matching issues.
Current View:
Browser
(
Current Fields
|
Printable
|
Full Content
)
|
XML
| RSS
(
Issues
|
Comments
)
|
Word
| Excel
(
All fields
|
Current fields
)
T
Patch Info
Key
Summary
Assignee
Reporter
Pr
Status
Res
Created
Updated
Due
NUTCH-732
Subcollection plugin not working on Nutch-1.0
Unassigned
Filipe Antunes
Open
UNRESOLVED
07/Apr/09
07/Apr/09
NUTCH-422
index-extra plugin creates additional fields in the index, based on configurable logic
Sami Siren
Alan Tanaman
Open
UNRESOLVED
28/Dec/06
08/Jul/09
NUTCH-129
rtf-parser does not work when opened with wordpad files and saved
Unassigned
raghavendra prabhu
Open
UNRESOLVED
25/Nov/05
26/Jun/06
NUTCH-472
NullPointerException in ZipTextExtractor if no MIME type for zipped file
Unassigned
Antony Bowesman
Open
UNRESOLVED
24/Apr/07
12/May/07
NUTCH-441
Thai Analyzer Plugin
Unassigned
Vee Satayamas
Open
UNRESOLVED
07/Feb/07
07/Feb/07
NUTCH-290
parse-pdf: Garbage indexed when text-extraction not allowed
Unassigned
Stefan Neufeind
Open
UNRESOLVED
28/May/06
07/Sep/06
NUTCH-224
Nutch doesn't handle Korean text at all
Unassigned
KuroSaka TeruHiko
Open
UNRESOLVED
07/Mar/06
02/Mar/07
NUTCH-568
Indexer does not update the Lucene "TITLE" field
Unassigned
smorales
Open
UNRESOLVED
19/Oct/07
22/Oct/07
NUTCH-445
Domain İndexing / Query Filter
Unassigned
Enis Soztutar
Open
UNRESOLVED
15/Feb/07
27/Feb/08
NUTCH-469
changes to geoPosition plugin to make it work on nutch 0.9
Unassigned
Mike Schwartz
Open
UNRESOLVED
23/Apr/07
17/Feb/09
NUTCH-541
Index url field untokenized
Enis Soztutar
Enis Soztutar
Open
UNRESOLVED
09/Aug/07
20/Feb/09
NUTCH-185
XMLParser is configurable xml parser plugin.
Chris A. Mattmann
Rida Benjelloun
Open
UNRESOLVED
25/Jan/06
27/Feb/09
Patch Available
NUTCH-716
Make subcollection index filed multivalued
Unassigned
Dmitry Lihachev
Open
UNRESOLVED
10/Mar/09
22/May/09
NUTCH-729
NPE in FieldIndexer when BasicFields url doesn't exist
Dennis Kubes
Dennis Kubes
Open
UNRESOLVED
25/Mar/09
23/Jun/09
26/Mar/09
Patch Available
NUTCH-760
Allow field mapping from nutch to solr index
Unassigned
David Stuart
Open
UNRESOLVED
15/Oct/09
27/Oct/09
Patch Available
NUTCH-747
inject&Index metadatas and inherit these metadatas to all matching suburls
Unassigned
Marko Bauhardt
Open
UNRESOLVED
06/Aug/09
06/Aug/09
NUTCH-739
SolrDeleteDuplications too slow when using hadoop
Unassigned
Dmitry Lihachev
Open
UNRESOLVED
28/May/09
29/May/09
NUTCH-389
a url tokenizer implementation for tokenizing index fields : url and host
Unassigned
Enis Soztutar
Open
UNRESOLVED
20/Oct/06
07/Nov/06
NUTCH-259
Problem in IndexSorter after dedup
Unassigned
Michael
Open
UNRESOLVED
29/Apr/06
29/Apr/06
NUTCH-260
Three new plugins that parse, index and query meta tags defined in the configuration
Unassigned
Jake Vanderdray
Open
UNRESOLVED
03/May/06
03/May/06
NUTCH-267
Indexer doesn't consider linkdb when calculating boost value
Unassigned
Chris Schneider
Open
UNRESOLVED
09/May/06
12/May/06
NUTCH-326
WordExtractor throws java.util.NoSuchElementException on some documents
Unassigned
Tom Jensen
Open
UNRESOLVED
21/Jul/06
21/Jul/06
NUTCH-453
Move stop words to a config file
Unassigned
Steve Severance
Open
UNRESOLVED
02/Mar/07
02/Mar/07
NUTCH-386
Plugin to index categories by url rules
Unassigned
Ernesto De Santis
Open
UNRESOLVED
14/Oct/06
16/May/09
NUTCH-86
LanguageIdentifier API enhancements
Jerome Charron
Jerome Charron
Open
UNRESOLVED
31/Aug/05
17/Feb/09
NUTCH-36
Chinese in Nutch
Unassigned
Jack Tang
Open
UNRESOLVED
05/Apr/05
07/Nov/06
NUTCH-564
External parser supports encoding attribute
Unassigned
Antony Bowesman
Open
UNRESOLVED
03/Oct/07
17/Feb/09
Patch Available
NUTCH-713
Config options for webgraph Scoring not documented
Unassigned
Eric J. Christeson
Open
UNRESOLVED
09/Mar/09
09/Mar/09
NUTCH-16
boost documents matching a url pattern
Dennis Kubes
Stefan Groschupf
Open
UNRESOLVED
23/Mar/05
31/Mar/08
NUTCH-62
Add html META tag information into metaData in index-more plugin
Unassigned
Jack Tang
Open
UNRESOLVED
07/Jun/05
07/Jun/05
Patch Available
NUTCH-697
Generate log output for solr indexer and dedup
Unassigned
Dmitry Lihachev
Open
UNRESOLVED
20/Feb/09
20/Feb/09