Details
-
Improvement
-
Status: Resolved
-
Major
-
Resolution: Fixed
-
2.1.0
-
None
Description
This language identifier/detection suffers from a few problems
- Clarity is needed on identifier/identification Vs detector/detection. Which is it? The source code says identifier whereas the documentation is nested under detection.
- The org.apache.tika.language.LanguageIdentifier returns 404. What is this meant to resolve to?
- Generally speaking the documentation is literally non-existent. I checked the wiki and failed to find anything. I did find some minor documentation but this is also severely lacking. Also note the broken hyperlink.
Some suggestions for improvement
- Fix the broken hyperlinks.
- Hyperlink to the existing example namely LanguageDetectorExample.java, LanguageDetectingParser.java and Language.java
- Hyperlink to the LanguageDetector Javadoc and atleast mention some of the other implementations.