Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-1358

Integration of Tika and DataImportHandler

    Details

    • Type: New Feature
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.5
    • Labels:
      None

      Description

      At the moment, it's impossible to configure Solr such that it build up documents by using data that comes from both pdf documents and database table columns. Currently, to accomplish this task, it's up to the user to add some preprocessing that converts pdf files into plain text files. Therefore, I would like to see an integration of Solr Cell into DIH that makes those preprocessing obsolete.

        Attachments

        1. SOLR-1358.patch
          7 kB
          Akshay K. Ukey
        2. SOLR-1358.patch
          7 kB
          Noble Paul
        3. SOLR-1358.patch
          7 kB
          Akshay K. Ukey
        4. SOLR-1358.patch
          20 kB
          Akshay K. Ukey

          Issue Links

          There are no Sub-Tasks for this issue.

            Activity

              People

              • Assignee:
                noble.paul Noble Paul
                Reporter:
                szott Sascha Szott
              • Votes:
                2 Vote for this issue
                Watchers:
                7 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: