Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-1358

Integration of Tika and DataImportHandler

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.5
    • None

    Description

      At the moment, it's impossible to configure Solr such that it build up documents by using data that comes from both pdf documents and database table columns. Currently, to accomplish this task, it's up to the user to add some preprocessing that converts pdf files into plain text files. Therefore, I would like to see an integration of Solr Cell into DIH that makes those preprocessing obsolete.

      Attachments

        1. SOLR-1358.patch
          20 kB
          Akshay K. Ukey
        2. SOLR-1358.patch
          7 kB
          Akshay K. Ukey
        3. SOLR-1358.patch
          7 kB
          Noble Paul
        4. SOLR-1358.patch
          7 kB
          Akshay K. Ukey

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            noble.paul Noble Paul
            szott Sascha Szott
            Votes:
            2 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment