History
Log In
h
ome
b
rowse project
f
ind issues
Q
uick Search:
Learn more about
Quick Search
All Projects
:
Nutch
: 1.0.0
(Fix For Version)
Release Date:
23/Mar/09
Description:
Nutch 1.0 release
1.0.0
Nutch 1.0 release
2009-03-23T07-00
Select:
Summary
Popular Issues
Summary
Issues:
All |
Unresolved
Progress:
194
of
194
issues have been resolved
Components
(with all issues in each component for this version)
NUTCH-698
FIXED
CrawlDb is corrupted after a few crawl cycles
NUTCH-694
FIXED
Distributed Search Server fails
NUTCH-688
FIXED
Fix missing/wrong headers in source files
NUTCH-631
FIXED
MoreIndexingFilter fails with NoSuchElementException
NUTCH-515
FIXED
Next fetch time is set incorrectly
NUTCH-722
FIXED
Nutch contains jars that we cannot redistribute
NUTCH-621
FIXED
Nutch needs to declare it's crypto usage
NUTCH-703
FIXED
Upgrade to Hadoop 0.19.1
NUTCH-724
DUPLICATE
Drop the JAI libraries
NUTCH-678
FIXED
Hadoop 0.19 requires an update of jets3t
NUTCH-641
FIXED
IndexSorter incorrectly copies stored fields
NUTCH-700
FIXED
Neko1.9.11 goes into a loop
NUTCH-508
FIXED
${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker
NUTCH-61
FIXED
Adaptive re-fetch interval. Detecting umodified content
NUTCH-652
FIXED
AdaptiveFetchSchedule#setFetchSchedule doesn't calculate fetch interval correctly
NUTCH-727
FIXED
Add KEYS file to release artifact
NUTCH-699
FIXED
Add an "official" solr schema for solr integration
NUTCH-603
FIXED
Add more default url normalizations
NUTCH-586
FIXED
Add option to run compiled classes w/o job file
NUTCH-279
FIXED
Additions for regex-normalize
NUTCH-602
FIXED
Allow configurable number of handlers for search servers
NUTCH-565
FIXED
Arc File to Nutch Segments Converter
NUTCH-488
FIXED
Avoid parsing uneccessary links and get a more relevant outlink list
NUTCH-485
FIXED
Change HtmlParseFilter 's to return ParseResult object instead of Parse object
NUTCH-605
FIXED
Change deprecated configuration methods for Hadoop
NUTCH-643
FIXED
ClassCastException in PdfParser on encrypted PDF with empty password
NUTCH-545
FIXED
Configuration and OnlineClusterer get initialized in every request.
NUTCH-669
FIXED
Consolidate code for Fetcher and Fetcher2
NUTCH-532
FIXED
CrawlDbMerger: wrong computation of last fetch time
NUTCH-684
FIXED
Dedup support for Solr
NUTCH-467
FIXED
DeleteDuplicate fails if Segment index directory has 0 documents
NUTCH-525
FIXED
DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment
NUTCH-668
FIXED
Domain URL Filter
NUTCH-613
FIXED
Empty Summaries and Cached Pages
NUTCH-497
FIXED
Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap
NUTCH-579
FIXED
Feed plugin only indexes one post per feed due to identical digest
NUTCH-413
FIXED
Fetcher ignores -noParsing command line option
NUTCH-597
FIXED
Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish.
NUTCH-474
FIXED
Fetcher2 sets server-delay and blocking checks incorrectly
NUTCH-126
FIXED
Fetching via https does not work with a proxy (patch)
NUTCH-518
FIXED
Fix OpicScoringFilter to respect scoring filter chaining
NUTCH-382
FIXED
Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled
NUTCH-471
FIXED
Fix synchronization in NutchBean creation
NUTCH-74
FIXED
French Analyzer Plugin
NUTCH-503
FIXED
Generator exits incorrectly for small fetchlists
NUTCH-554
FIXED
Generator throws java.io.IOException and dies on injected urls with no protocol
NUTCH-636
FIXED
Http client plug-in https doesn't work on IBM JRE
NUTCH-561
FIXED
HttpClient plugin does not work with NTLM authentication
NUTCH-501
FIXED
Implement a different caching mechanism for objects cached in configuration
NUTCH-574
FIXED
Including inlink anchor text in index can create irrelevant search results.
Viewing 50 of
194
Issues.
build
4
documentation
1
fetcher
46
generator
6
indexer
25
injector
2
linkdb
4
mime_type_detector
2
ndfs
1
searcher
13
web gui
6
No Component
89
Preset Filters
-
All
-
Outstanding
-
Most important
-
Resolved recently
-
Added recently
-
Updated recently
Version Summary
Closed
194
100%
Open Issues
By Priority
No issues
By Assignee
No issues