History
Log In
h
ome
b
rowse project
f
ind issues
Q
uick Search:
Learn more about
Quick Search
Filter:
View
Edit
New
Manage
You are currently using a new, unsaved search.
Summary
Project:
Nutch
Sorted by:
Key descending
Operations
Issue Navigator
[
Permlink
]
Displaying issues
1
to
50
of
759
matching issues.
Current View:
Browser
(
Current Fields
|
Printable
|
Full Content
)
|
XML
| RSS
(
Issues
|
Comments
)
|
Word
| Excel
(
All fields
|
Current fields
)
1
|
2
|
3
|
4
|
5
|
6
|
7
|
8
|
9
|
Next >>
T
Patch Info
Key
Summary
Assignee
Reporter
Pr
Status
Res
Created
Updated
Due
Patch Available
NUTCH-770
Timebomb for Fetcher
Unassigned
Julien Nioche
Open
UNRESOLVED
23/Nov/09
23/Nov/09
Patch Available
NUTCH-769
Fetcher to skip queues for URLS getting repeated exceptions
Unassigned
Julien Nioche
Open
UNRESOLVED
23/Nov/09
23/Nov/09
NUTCH-768
Upgrade Nutch 1.0 to use Hadoop 0.20
Dennis Kubes
Dennis Kubes
Open
UNRESOLVED
21/Nov/09
21/Nov/09
24/Nov/09
Patch Available
NUTCH-767
Update version of Tika for the MimeType detection
Chris A. Mattmann
Julien Nioche
Open
UNRESOLVED
18/Nov/09
18/Nov/09
Patch Available
NUTCH-766
Tika parser
Unassigned
Julien Nioche
Open
UNRESOLVED
18/Nov/09
18/Nov/09
Patch Available
NUTCH-765
Allow Crawl class to call Either Solr or Lucene Indexer
Dennis Kubes
Dennis Kubes
Closed
Fixed
12/Nov/09
21/Nov/09
12/Nov/09
Patch Available
NUTCH-764
Add support for vfsfile:// loading of plugins for JBoss
Unassigned
tcurran@approachingpi.com
Open
UNRESOLVED
10/Nov/09
10/Nov/09
NUTCH-763
Separate configuration files from resources to be included in the job file
Unassigned
Julien Nioche
Open
UNRESOLVED
05/Nov/09
05/Nov/09
Patch Available
NUTCH-762
Alternative Generator which can generate several segments in one parse of the crawlDB
Unassigned
Julien Nioche
Open
UNRESOLVED
03/Nov/09
03/Nov/09
Patch Available
NUTCH-761
Avoid cloningCrawlDatum in CrawlDbReducer
Unassigned
Julien Nioche
Open
UNRESOLVED
03/Nov/09
03/Nov/09
Patch Available
NUTCH-760
Allow field mapping from nutch to solr index
Unassigned
David Stuart
Open
UNRESOLVED
15/Oct/09
27/Oct/09
NUTCH-759
Removal of deprecated APIs
Unassigned
Stephen Norman
Open
UNRESOLVED
14/Oct/09
14/Oct/09
Patch Available
NUTCH-758
Set subversion eol-style to "native"
Andrzej Bialecki
Niall Pemberton
Closed
Fixed
30/Sep/09
10/Oct/09
Patch Available
NUTCH-757
RequestUtils getBooleanParameter() always returns false
Andrzej Bialecki
Niall Pemberton
Closed
Fixed
30/Sep/09
10/Oct/09
Patch Available
NUTCH-756
CrawlDatum.set() does not reset Metadata if it is null
Andrzej Bialecki
Julien Nioche
Closed
Fixed
29/Sep/09
10/Oct/09
NUTCH-755
DomainURLFilter crashes on malformed URL
Unassigned
Mike Baranczak
Open
UNRESOLVED
17/Sep/09
26/Oct/09
Patch Available
NUTCH-754
Use GenericOptionsParser instead of FileSystem.parseArgs()
Andrzej Bialecki
Julien Nioche
Closed
Fixed
16/Sep/09
10/Oct/09
NUTCH-753
Prevent new Fetcher to retrieve the robots twice
Unassigned
Julien Nioche
Open
UNRESOLVED
07/Sep/09
07/Sep/09
NUTCH-752
how to index data from databse(ect oracle)
Unassigned
zhengfang
Closed
Invalid
07/Sep/09
10/Sep/09
NUTCH-751
Upgrade version of HttpClient
Unassigned
Julien Nioche
Open
UNRESOLVED
04/Sep/09
09/Sep/09
Patch Available
NUTCH-750
HtmlParser plugin - page title extraction
Unassigned
Alexey Torochkov
Open
UNRESOLVED
29/Aug/09
29/Aug/09
NUTCH-749
Fetching the url from crawldb
Unassigned
salima abdulsalam
Closed
Invalid
21/Aug/09
21/Aug/09
Patch Available
NUTCH-748
DiskChecker Could not find
Unassigned
mawanqiang
Closed
Won't Fix
18/Aug/09
09/Oct/09
Patch Available
NUTCH-747
inject&Index metadatas and inherit these metadatas to all matching suburls
Unassigned
Marko Bauhardt
Open
UNRESOLVED
06/Aug/09
06/Aug/09
Patch Available
NUTCH-746
NutchBeanConstructor does not close NutchBean upon contextDestroyed, causing resource leak in the container.
Unassigned
Kirby Bohling
Open
UNRESOLVED
26/Jul/09
04/Aug/09
NUTCH-745
MyHtmlParser getParse return not null,so all Analyzer-(zh|fr) cannot run
Unassigned
jcore_XiaTian
Open
UNRESOLVED
10/Jul/09
10/Jul/09
NUTCH-744
indexing items in rss-feed in seperate page
Unassigned
Tarun
Closed
Invalid
09/Jul/09
09/Jul/09
NUTCH-743
Site search powered by Lucene/Solr
Sami Siren
Sami Siren
Resolved
Fixed
23/Jun/09
04/Jul/09
NUTCH-742
Checksum Error
Unassigned
mawanqiang
Resolved
Incomplete
20/Jun/09
21/Jun/09
Patch Available
NUTCH-741
Job file includes multiple copies of nutch config files.
Unassigned
Kirby Bohling
Open
UNRESOLVED
29/May/09
29/May/09
Patch Available
NUTCH-740
Configuration option to override default language for fetched pages.
Otis Gospodnetic
Marcin Okraszewski
Open
UNRESOLVED
28/May/09
09/Jun/09
NUTCH-739
SolrDeleteDuplications too slow when using hadoop
Unassigned
Dmitry Lihachev
Open
UNRESOLVED
28/May/09
29/May/09
Patch Available
NUTCH-738
Close SegmentUpdater when FetchedSegments is closed
Unassigned
Martina Koch
Open
UNRESOLVED
26/May/09
04/Aug/09
Patch Available
NUTCH-737
urlnormalizer-unalias plugin
Unassigned
Dmitry Lihachev
Open
UNRESOLVED
26/May/09
26/May/09
NUTCH-736
how long it takes nutch 1.0 to fetch
Otis Gospodnetic
Filipe Antunes
Resolved
Invalid
14/May/09
24/May/09
Patch Available
NUTCH-735
crawl-tool.xml must be read before nutch-site.xml when invoked using crawl command
Doğacan Güney
Susam Pal
Closed
Fixed
09/May/09
08/Jun/09
NUTCH-734
option to filter "a" tag text
Unassigned
ron
Open
UNRESOLVED
02/May/09
02/May/09
Patch Available
NUTCH-733
plain text view of cached files ignores HTML encoding
Unassigned
Ilguiz Latypov
Open
UNRESOLVED
30/Apr/09
07/Jun/09
NUTCH-732
Subcollection plugin not working on Nutch-1.0
Unassigned
Filipe Antunes
Open
UNRESOLVED
07/Apr/09
07/Apr/09
Patch Available
NUTCH-731
Redirection of robots.txt in RobotRulesParser
Andrzej Bialecki
Julien Nioche
Closed
Fixed
03/Apr/09
10/Oct/09
NUTCH-730
NPE in LinkRank if no nodes with which to create the WebGraph
Andrzej Bialecki
Dennis Kubes
Closed
Fixed
26/Mar/09
10/Oct/09
26/Mar/09
NUTCH-729
NPE in FieldIndexer when BasicFields url doesn't exist
Dennis Kubes
Dennis Kubes
Open
UNRESOLVED
25/Mar/09
23/Jun/09
26/Mar/09
NUTCH-728
Improve nutch release packaging
Unassigned
Sami Siren
Open
UNRESOLVED
19/Mar/09
20/Mar/09
NUTCH-727
Add KEYS file to release artifact
Unassigned
Sami Siren
Closed
Fixed
19/Mar/09
10/Apr/09
NUTCH-726
README.txt is lacking info that should be there
Unassigned
Sami Siren
Closed
Fixed
19/Mar/09
10/Apr/09
NUTCH-725
NOTICE.txt is lacking info that should be there
Unassigned
Sami Siren
Closed
Fixed
19/Mar/09
10/Apr/09
NUTCH-724
Drop the JAI libraries
Unassigned
Jukka Zitting
Closed
Duplicate
19/Mar/09
10/Apr/09
NUTCH-723
LICENCE.txt is lacking info that should be there
Unassigned
Sami Siren
Closed
Fixed
19/Mar/09
10/Apr/09
NUTCH-722
Nutch contains jars that we cannot redistribute
Unassigned
Sami Siren
Closed
Fixed
19/Mar/09
10/Apr/09
NUTCH-721
Fetcher2 Slow
Doğacan Güney
Roger Dunk
Closed
Fixed
17/Mar/09
25/Aug/09
1
|
2
|
3
|
4
|
5
|
6
|
7
|
8
|
9
|
Next >>