Details

      Description

      Tika 1.1 is being released soon. It features some new parsers, ability to extract text from password protected PDFs and office docs, and several bug fixes. See http://people.apache.org/~mattmann/apache-tika-1.1/rc1/CHANGES-1.1.txt

      We should upgrade as soon as it is released.

      1. SOLR-3254.patch
        6 kB
        Jan Høydahl
      2. SOLR-3254.patch
        61 kB
        Jan Høydahl
      3. SOLR-3254.patch
        67 kB
        Jan Høydahl
      4. SOLR-3254-NOTICE.patch
        0.7 kB
        Jan Høydahl

        Issue Links

          Activity

          Hide
          Jan Høydahl added a comment -

          Added NOTICE.TXT lines for Javassist, OggVorbis, Scannotation

          Show
          Jan Høydahl added a comment - Added NOTICE.TXT lines for Javassist, OggVorbis, Scannotation
          Hide
          Robert Muir added a comment -

          Thanks Jan!

          Show
          Robert Muir added a comment - Thanks Jan!
          Hide
          Jan Høydahl added a comment -

          Attached a NOTICE.TXT patch. Btw, the newest version of Javassist is triple-licensed including Apache, so this can probably go away later..

          Show
          Jan Høydahl added a comment - Attached a NOTICE.TXT patch. Btw, the newest version of Javassist is triple-licensed including Apache, so this can probably go away later..
          Hide
          Mark Miller added a comment -

          reopening so that we don't forget to respond to roberts mailing list comment:

          I don't understand how we are adding MPL dependencies without updating
          solr/NOTICE.txt here.

          "Although the source must not be included in Apache products, the
          NOTICE file, which is required to be included in each ASF
          distribution, must point to the source form of the included binary
          (more on that in the forthcoming "Receiving and Releasing
          Contributions" document)."

          http://www.apache.org/legal/3party.html (Category B: Reciprocal Licenses)

          Show
          Mark Miller added a comment - reopening so that we don't forget to respond to roberts mailing list comment: I don't understand how we are adding MPL dependencies without updating solr/NOTICE.txt here. "Although the source must not be included in Apache products, the NOTICE file, which is required to be included in each ASF distribution, must point to the source form of the included binary (more on that in the forthcoming "Receiving and Releasing Contributions" document)." http://www.apache.org/legal/3party.html (Category B: Reciprocal Licenses)
          Hide
          Jan Høydahl added a comment -

          Committed

          Show
          Jan Høydahl added a comment - Committed
          Hide
          Jan Høydahl added a comment -

          New patch also fixing *.jar.sha1 files

          Show
          Jan Høydahl added a comment - New patch also fixing *.jar.sha1 files
          Hide
          Jan Høydahl added a comment -

          New patch including LICENSE/NOTICE for new jars. Passes testa, will commit soon..

          Show
          Jan Høydahl added a comment - New patch including LICENSE/NOTICE for new jars. Passes testa, will commit soon..
          Hide
          Jan Høydahl added a comment -

          Here's the major news in v1.1: http://tika.apache.org/1.1/

          I have not tried to exclude any parsers at all - such optimization is left for another issue...

          Show
          Jan Høydahl added a comment - Here's the major news in v1.1: http://tika.apache.org/1.1/ I have not tried to exclude any parsers at all - such optimization is left for another issue...
          Hide
          Jan Høydahl added a comment -

          With Ivy it's really easy to do the Tika upgrade, and the patch becomes an appliable plaintext patch!

          This patch also adds some comments to the dependencies section with instructions for upgrading, and rearranges the deps to match the order listed in http://tika.apache.org/1.1/gettingstarted.html#Using_Tika_as_a_Maven_dependency

          It also removes a non-used xml-apis dep

          Show
          Jan Høydahl added a comment - With Ivy it's really easy to do the Tika upgrade, and the patch becomes an appliable plaintext patch! This patch also adds some comments to the dependencies section with instructions for upgrading, and rearranges the deps to match the order listed in http://tika.apache.org/1.1/gettingstarted.html#Using_Tika_as_a_Maven_dependency It also removes a non-used xml-apis dep

            People

            • Assignee:
              Jan Høydahl
              Reporter:
              Jan Høydahl
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development