Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-856

Support CJK (Chinese, Japanese and Korean) language detection

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.0
    • None
    • languageidentifier
    • All

    Description

      Support language detection of CJK (Chinese, Japanese and Korean).
      Some estimates have Chinese users overtaking English users on the Internet so it is important that these languages used by large number of people be supported.

      See TIKA-855

      Attachments

        1. ja.ngp
          14 kB
          James Sullivan

        Issue Links

          Activity

            People

              kkrugler Kenneth William Krugler
              sully James Sullivan
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated: