Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-1484

Boilerpipe dependency is evil

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.6
    • None
    • parser
    • None

    Description

      The Boilerpipe project bundles inside it two classes from org.cyberneko.html. We're already using NekoHTML in our project. Depending on which library shows up on our classpath certain parts of our project will either work or not. I'd really love it if Boilerpipe could be fixed or replaced with some other library that is a better citizen.

      I see I'm not the first person to run into this as another Tika user has filed a bug on the Boilerpipe project: https://code.google.com/p/boilerpipe/issues/detail?id=62

      Attachments

        1. TIKA-1484.patch
          20 kB
          Tim Allison

        Activity

          People

            Unassigned Unassigned
            chengas123 Java Developer
            Votes:
            1 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated: