Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-10350

By posting documents by post.jar i saw that it uses org.apache.tika.parser.txt.TXTParser" how can i change the parse that it also extract text from images which are inside pdf and also separate images like jpg

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Not A Problem
    • 6.4.1
    • None
    • Schema and Analysis
    • None

    Attachments

      Activity

        People

          Unassigned Unassigned
          waleed.raza Waleed Raza
          Votes:
          0 Vote for this issue
          Watchers:
          2 Start watching this issue

          Dates

            Created:
            Updated:
            Resolved: