Uploaded image for project: 'Nutch'
  1. Nutch
  2. NUTCH-1280

language-identifier should have option to use detected value by Tika even when uncertain

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: nutchgora
    • Component/s: parser
    • Labels:
      None
    • Patch Info:
      Patch Available

      Description

      Nutchtrunk has an option "lang.identification.only.certain", this should be the case for Nutchgora too. Note that it is set default to false. So this changes the default behaviour somewhat.

      Patch will be right up.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              ferdy.g Ferdy

              Dates

              • Created:
                Updated:
                Resolved:

                Issue deployment