Details
-
Bug
-
Status: Open
-
Major
-
Resolution: Unresolved
-
1.6
-
None
-
None
Description
The Boilerpipe project bundles inside it two classes from org.cyberneko.html. We're already using NekoHTML in our project. Depending on which library shows up on our classpath certain parts of our project will either work or not. I'd really love it if Boilerpipe could be fixed or replaced with some other library that is a better citizen.
I see I'm not the first person to run into this as another Tika user has filed a bug on the Boilerpipe project: https://code.google.com/p/boilerpipe/issues/detail?id=62