Solr
  1. Solr
  2. SOLR-2875

Incorrect url of tika-data-config.xml in example-DIH

    Details

    • Type: Bug Bug
    • Status: Closed
    • Priority: Trivial Trivial
    • Resolution: Fixed
    • Affects Version/s: 4.0-ALPHA
    • Fix Version/s: 3.5, 4.0-ALPHA
    • Labels:
      None
    • Environment:

      solr boot:java -Dsolr.solr.home=~/trunk/solr/example/example-DIH/solr/tika -jar start.jar

      Description

      The specified url in tika-data-config.xml is not correct path. So when running full-import, exception is thrown.

      2011/11/04 16:48:26 org.apache.solr.common.SolrException log
      ?v???I: Full Import failed:java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: java.lang.RuntimeException: java.io.FileNotFoundException: Could not find file: ../contrib/extraction/src/test/resources/solr-word.pdf
      at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:261)
      at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:372)
      :
      :
      Caused by: java.io.FileNotFoundException: Could not find file: ../contrib/extraction/src/test/resources/solr-word.pdf
      at org.apache.solr.handler.dataimport.FileDataSource.getFile(FileDataSource.java:110)

      1. SOLR-2875.patch
        0.8 kB
        Shinichiro Abe

        Issue Links

          Activity

          Hide
          Koji Sekiguchi added a comment -

          I can reproduce the problem and the patch looks good!

          Show
          Koji Sekiguchi added a comment - I can reproduce the problem and the patch looks good!
          Hide
          Koji Sekiguchi added a comment -

          committed trunk and 3x. Thanks Abe-san!

          Show
          Koji Sekiguchi added a comment - committed trunk and 3x. Thanks Abe-san!
          Hide
          Uwe Schindler added a comment -

          Bulk close after 3.5 is released

          Show
          Uwe Schindler added a comment - Bulk close after 3.5 is released
          Hide
          Frank Ren added a comment -

          This file, solr-word.pdf, is not shipped in the binary release, 4.9.0.

          Show
          Frank Ren added a comment - This file, solr-word.pdf, is not shipped in the binary release, 4.9.0.
          Hide
          Shinichiro Abe added a comment -

          Yes, the binary don't always have any pdf files. This data import will be completed successfully on the source.

          Show
          Shinichiro Abe added a comment - Yes, the binary don't always have any pdf files. This data import will be completed successfully on the source.
          Hide
          Frank Ren added a comment -

          It would be helpful to inform people of this somewhere, say,
          example/example-DIH/README.txt. Some instruction for tika would also be
          appreciated. Thanks

          On Thu, Jul 17, 2014 at 4:08 PM, Shinichiro Abe (JIRA) <jira@apache.org>

          Show
          Frank Ren added a comment - It would be helpful to inform people of this somewhere, say, example/example-DIH/README.txt. Some instruction for tika would also be appreciated. Thanks On Thu, Jul 17, 2014 at 4:08 PM, Shinichiro Abe (JIRA) <jira@apache.org>

            People

            • Assignee:
              Koji Sekiguchi
              Reporter:
              Shinichiro Abe
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development