Details

      Description

      Tika 1.1 is being released soon. It features some new parsers, ability to extract text from password protected PDFs and office docs, and several bug fixes. See http://people.apache.org/~mattmann/apache-tika-1.1/rc1/CHANGES-1.1.txt

      We should upgrade as soon as it is released.

      1. SOLR-3254-NOTICE.patch
        0.7 kB
        Jan Høydahl
      2. SOLR-3254.patch
        67 kB
        Jan Høydahl
      3. SOLR-3254.patch
        61 kB
        Jan Høydahl
      4. SOLR-3254.patch
        6 kB
        Jan Høydahl

        Issue Links

          Activity

          Uwe Schindler made changes -
          Status Resolved [ 5 ] Closed [ 6 ]
          Jan Høydahl made changes -
          Link This issue is cloned as SOLR-3707 [ SOLR-3707 ]
          Jan Høydahl made changes -
          Link This issue is cloned as SOLR-3707 [ SOLR-3707 ]
          sarowe committed 1311566 (1 file)
          Reviews: none

          SOLR-3254: maven configuration: tika dependency version -> 1.1

          Jan Høydahl made changes -
          Status Reopened [ 4 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Hide
          Jan Høydahl added a comment -

          Added NOTICE.TXT lines for Javassist, OggVorbis, Scannotation

          Show
          Jan Høydahl added a comment - Added NOTICE.TXT lines for Javassist, OggVorbis, Scannotation
          Jan Høydahl committed 1311301 (1 file)
          Reviews: none

          SOLR-3254: NOTICE.TXT entries for Javassist (MPL), OggVorbis, Scannotation

          Hide
          Robert Muir added a comment -

          Thanks Jan!

          Show
          Robert Muir added a comment - Thanks Jan!
          Jan Høydahl made changes -
          Attachment SOLR-3254-NOTICE.patch [ 12521954 ]
          Hide
          Jan Høydahl added a comment -

          Attached a NOTICE.TXT patch. Btw, the newest version of Javassist is triple-licensed including Apache, so this can probably go away later..

          Show
          Jan Høydahl added a comment - Attached a NOTICE.TXT patch. Btw, the newest version of Javassist is triple-licensed including Apache, so this can probably go away later..
          Mark Miller made changes -
          Resolution Fixed [ 1 ]
          Status Resolved [ 5 ] Reopened [ 4 ]
          Hide
          Mark Miller added a comment -

          reopening so that we don't forget to respond to roberts mailing list comment:

          I don't understand how we are adding MPL dependencies without updating
          solr/NOTICE.txt here.

          "Although the source must not be included in Apache products, the
          NOTICE file, which is required to be included in each ASF
          distribution, must point to the source form of the included binary
          (more on that in the forthcoming "Receiving and Releasing
          Contributions" document)."

          http://www.apache.org/legal/3party.html (Category B: Reciprocal Licenses)

          Show
          Mark Miller added a comment - reopening so that we don't forget to respond to roberts mailing list comment: I don't understand how we are adding MPL dependencies without updating solr/NOTICE.txt here. "Although the source must not be included in Apache products, the NOTICE file, which is required to be included in each ASF distribution, must point to the source form of the included binary (more on that in the forthcoming "Receiving and Releasing Contributions" document)." http://www.apache.org/legal/3party.html (Category B: Reciprocal Licenses)
          Jan Høydahl made changes -
          Status Open [ 1 ] Resolved [ 5 ]
          Resolution Fixed [ 1 ]
          Hide
          Jan Høydahl added a comment -

          Committed

          Show
          Jan Høydahl added a comment - Committed
          Jan Høydahl committed 1311198 (33 files)
          Reviews: none

          SOLR-3254: Upgrade Solr to Tika 1.1

          Lucene trunk
          Jan Høydahl made changes -
          Attachment SOLR-3254.patch [ 12521900 ]
          Hide
          Jan Høydahl added a comment -

          New patch also fixing *.jar.sha1 files

          Show
          Jan Høydahl added a comment - New patch also fixing *.jar.sha1 files
          Jan Høydahl made changes -
          Attachment SOLR-3254.patch [ 12521116 ]
          Hide
          Jan Høydahl added a comment -

          New patch including LICENSE/NOTICE for new jars. Passes testa, will commit soon..

          Show
          Jan Høydahl added a comment - New patch including LICENSE/NOTICE for new jars. Passes testa, will commit soon..
          Jan Høydahl made changes -
          Assignee Jan Høydahl [ janhoy ]
          Hide
          Jan Høydahl added a comment -

          Here's the major news in v1.1: http://tika.apache.org/1.1/

          I have not tried to exclude any parsers at all - such optimization is left for another issue...

          Show
          Jan Høydahl added a comment - Here's the major news in v1.1: http://tika.apache.org/1.1/ I have not tried to exclude any parsers at all - such optimization is left for another issue...
          Jan Høydahl made changes -
          Attachment SOLR-3254.patch [ 12520806 ]
          Hide
          Jan Høydahl added a comment -

          With Ivy it's really easy to do the Tika upgrade, and the patch becomes an appliable plaintext patch!

          This patch also adds some comments to the dependencies section with instructions for upgrading, and rearranges the deps to match the order listed in http://tika.apache.org/1.1/gettingstarted.html#Using_Tika_as_a_Maven_dependency

          It also removes a non-used xml-apis dep

          Show
          Jan Høydahl added a comment - With Ivy it's really easy to do the Tika upgrade, and the patch becomes an appliable plaintext patch! This patch also adds some comments to the dependencies section with instructions for upgrading, and rearranges the deps to match the order listed in http://tika.apache.org/1.1/gettingstarted.html#Using_Tika_as_a_Maven_dependency It also removes a non-used xml-apis dep
          Jan Høydahl made changes -
          Field Original Value New Value
          Link This issue is depended upon by SOLR-1929 [ SOLR-1929 ]
          Jan Høydahl created issue -

            People

            • Assignee:
              Jan Høydahl
              Reporter:
              Jan Høydahl
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development