Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1280

language-identifier should have option to use detected value by Tika even when uncertain

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • nutchgora
    • parser
    • None
    • Patch Available

    Description

      Nutchtrunk has an option "lang.identification.only.certain", this should be the case for Nutchgora too. Note that it is set default to false. So this changes the default behaviour somewhat.

      Patch will be right up.

      Attachments

        1. NUTCH-1280.txt
          2 kB
          Ferdy

        Activity

          People

            Unassigned Unassigned
            ferdy.g Ferdy
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: