Uploaded image for project: 'Tika'
  1. Tika

languageidentifier

Summary

Description

The language identifier component for use in content detection and analysis.

Issues: Unresolved

Key Summary Due Date
Improvement TIKA-369 Improve accuracy of language detection
New Feature TIKA-491 Add language identification support for Norwegian Bokmål and Norwegian Nynorsk
Bug TIKA-496 Language identifier profile comparison favors large profiles

View Issues

Issues: Updated recently

Key Summary Updated
Improvement TIKA-2439 Avoid NullPointerException in org.apache.tika.langdetect.OptimaizeLangDetector if models haven't been loaded
Bug TIKA-2183 Can't Read file if its name is Arabic
Improvement TIKA-2297 Add Lingo24 Language Detector

View Issues

Versions: Unreleased

Name Release date
Unreleased 2.0  
Unreleased 1.17