Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-3456

LanguageDetector should try to respect hasEnoughText more intelligently

    XMLWordPrintableJSON

Details

    • Task
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.27
    • None
    • None

    Description

      If a user calls LanguageDetector's detect(String txt) or addText(String txt), the full string is passed on to the subclasses and there is no check on "hasEnoughText()". For large strings, LanguageDetector should break the string into smaller parts and check for hasEnoughText().

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              tallison Tim Allison
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: