Uploaded image for project: 'Solr'
  1. Solr
  2. SOLR-2424

extracted text from tika has no spaces

    XMLWordPrintableJSON

Details

    Description

      Try this:
      curl "http://localhost:8983/solr/update/extract?extractOnly=true&wt=json&indent=true" -F "tutorial=@tutorial.pdf"
      And you get text output w/o spaces: "ThisdocumentcoversthebasicsofrunningSolru"...

      Attachments

        1. ET2000 Service Manual.pdf
          8.86 MB
          Liam O'Boyle

        Activity

          People

            Unassigned Unassigned
            yseeley@gmail.com Yonik Seeley
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: