Solr
  1. Solr
  2. SOLR-1358

Integration of Tika and DataImportHandler

    Details

    • Type: New Feature New Feature
    • Status: Resolved
    • Priority: Major Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 1.5
    • Labels:
      None

      Description

      At the moment, it's impossible to configure Solr such that it build up documents by using data that comes from both pdf documents and database table columns. Currently, to accomplish this task, it's up to the user to add some preprocessing that converts pdf files into plain text files. Therefore, I would like to see an integration of Solr Cell into DIH that makes those preprocessing obsolete.

      1. SOLR-1358.patch
        7 kB
        Akshay K. Ukey
      2. SOLR-1358.patch
        7 kB
        Noble Paul
      3. SOLR-1358.patch
        7 kB
        Akshay K. Ukey
      4. SOLR-1358.patch
        20 kB
        Akshay K. Ukey

        Issue Links

          Activity

            People

            • Assignee:
              Noble Paul
              Reporter:
              Sascha Szott
            • Votes:
              2 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved:

                Development