Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1280

language-identifier should have option to use detected value by Tika even when uncertain

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: nutchgora
    • Component/s: parser
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      Nutchtrunk has an option "lang.identification.only.certain", this should be the case for Nutchgora too. Note that it is set default to false. So this changes the default behaviour somewhat.

      Patch will be right up.

        Attachments

        1. NUTCH-1280.txt
          2 kB
          Ferdy

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              ferdy.g Ferdy
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: