Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-60

Bad language identifier plugin performances

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • None
    • indexer
    • None

    Description

      As reported by Stefan Groschupf (http://www.mail-archive.com/nutch-developers@lists.sourceforge.net/msg04090.html) the language identifier plugin consumes a lot of processing time.
      Some optimizations and/or configuration options are required.

      Attachments

        1. NUTCH-60-050526.patch
          767 kB
          Jerome Charron
        2. NUTCH-60-050605.patch
          791 kB
          Jerome Charron
        3. NUTCH-60-050607.patch
          790 kB
          Jerome Charron
        4. NUTCH-60-050627.patch
          795 kB
          Jerome Charron

        Activity

          People

            Unassigned Unassigned
            jerome.charron Jerome Charron
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: