All Projects : Nutch : 1.0.0 (Fix For Version)

Release Date: 23/Mar/09
Description: Nutch 1.0 release
1.0.0 Nutch 1.0 release 2009-03-23T07-00

 Select:   Summary   Popular Issues   

Summary

Progress: 
 194 of 194 issues have been resolved

Components

(with all issues in each component for this version)
   Bug NUTCH-698 FIXED CrawlDb is corrupted after a few crawl cycles Blocker Closed
   Bug NUTCH-694 FIXED Distributed Search Server fails Blocker Closed
   Bug NUTCH-688 FIXED Fix missing/wrong headers in source files Blocker Closed
   Bug NUTCH-631 FIXED MoreIndexingFilter fails with NoSuchElementException Blocker Closed
   Bug NUTCH-515 FIXED Next fetch time is set incorrectly Blocker Closed
   Bug NUTCH-722 FIXED Nutch contains jars that we cannot redistribute Blocker Closed
   Task NUTCH-621 FIXED Nutch needs to declare it's crypto usage Blocker Closed
   Bug NUTCH-703 FIXED Upgrade to Hadoop 0.19.1 Blocker Closed
   Bug NUTCH-724 DUPLICATE Drop the JAI libraries Blocker Closed
   Bug NUTCH-678 FIXED Hadoop 0.19 requires an update of jets3t Critical Closed
   Bug NUTCH-641 FIXED IndexSorter incorrectly copies stored fields Critical Closed
   Bug NUTCH-700 FIXED Neko1.9.11 goes into a loop Critical Closed
   Bug NUTCH-508 FIXED ${hadoop.log.dir} and ${hadoop.log.file} are not propagated to the tasktracker Major Closed
   New Feature NUTCH-61 FIXED Adaptive re-fetch interval. Detecting umodified content Major Closed
   Bug NUTCH-652 FIXED AdaptiveFetchSchedule#setFetchSchedule doesn't calculate fetch interval correctly Major Closed
   Bug NUTCH-727 FIXED Add KEYS file to release artifact Major Closed
   New Feature NUTCH-699 FIXED Add an "official" solr schema for solr integration Major Closed
   Improvement NUTCH-603 FIXED Add more default url normalizations Major Closed
   New Feature NUTCH-586 FIXED Add option to run compiled classes w/o job file Major Closed
   Improvement NUTCH-279 FIXED Additions for regex-normalize Major Closed
   Improvement NUTCH-602 FIXED Allow configurable number of handlers for search servers Major Closed
   Improvement NUTCH-565 FIXED Arc File to Nutch Segments Converter Major Closed
   Improvement NUTCH-488 FIXED Avoid parsing uneccessary links and get a more relevant outlink list Major Closed
   Improvement NUTCH-485 FIXED Change HtmlParseFilter 's to return ParseResult object instead of Parse object Major Closed
   Improvement NUTCH-605 FIXED Change deprecated configuration methods for Hadoop Major Closed
   Bug NUTCH-643 FIXED ClassCastException in PdfParser on encrypted PDF with empty password Major Closed
   Bug NUTCH-545 FIXED Configuration and OnlineClusterer get initialized in every request. Major Closed
   Improvement NUTCH-669 FIXED Consolidate code for Fetcher and Fetcher2 Major Closed
   Bug NUTCH-532 FIXED CrawlDbMerger: wrong computation of last fetch time Major Closed
   New Feature NUTCH-684 FIXED Dedup support for Solr Major Closed
   Bug NUTCH-467 FIXED DeleteDuplicate fails if Segment index directory has 0 documents Major Closed
   Bug NUTCH-525 FIXED DeleteDuplicates generates ArrayIndexOutOfBoundsException when trying to rerun dedup on a segment Major Closed
   Improvement NUTCH-668 FIXED Domain URL Filter Major Closed
   Bug NUTCH-613 FIXED Empty Summaries and Cached Pages Major Closed
   Bug NUTCH-497 FIXED Extreme Nested Tags causes StackOverflowException in DomContentUtils...Spider Trap Major Closed
   Bug NUTCH-579 FIXED Feed plugin only indexes one post per feed due to identical digest Major Closed
   Bug NUTCH-413 FIXED Fetcher ignores -noParsing command line option Major Closed
   Bug NUTCH-597 FIXED Fetcher2 - java.lang.NullPointerException when host does not exist and fetcher.threads.per.host.by.ip is set to true causes threads to finish. Major Closed
   Bug NUTCH-474 FIXED Fetcher2 sets server-delay and blocking checks incorrectly Major Closed
   Bug NUTCH-126 FIXED Fetching via https does not work with a proxy (patch) Major Closed
   Bug NUTCH-518 FIXED Fix OpicScoringFilter to respect scoring filter chaining Major Closed
   Bug NUTCH-382 FIXED Fix for NUTCH-365 introduced a bug if generate.max.per.host.by.ip is enabled Major Closed
   Bug NUTCH-471 FIXED Fix synchronization in NutchBean creation Major Closed
   New Feature NUTCH-74 FIXED French Analyzer Plugin Major Closed
   Bug NUTCH-503 FIXED Generator exits incorrectly for small fetchlists Major Closed
   Bug NUTCH-554 FIXED Generator throws java.io.IOException and dies on injected urls with no protocol Major Closed
   Bug NUTCH-636 FIXED Http client plug-in https doesn't work on IBM JRE Major Closed
   Bug NUTCH-561 FIXED HttpClient plugin does not work with NTLM authentication Major Closed
   Improvement NUTCH-501 FIXED Implement a different caching mechanism for objects cached in configuration Major Closed
   Bug NUTCH-574 FIXED Including inlink anchor text in index can create irrelevant search results. Major Closed
  Viewing 50 of 194 Issues.
Component build 4
Component documentation 1
Component fetcher 46
Component generator 6
Component indexer 25
Component injector 2
Component linkdb 4
Component mime_type_detector 2
Component ndfs 1
Component searcher 13
Component web gui 6
  No Component 89

Preset Filters


Version Summary

Closed Closed 194
   100%

Open Issues

By Priority
No issues

By Assignee
No issues