Uploaded image for project: 'Jackrabbit Oak'
  1. Jackrabbit Oak
  2. OAK-6377

Text extraction with oak-run and tika requires fake string in the command to work

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: 1.7.2, 1.8.0
    • Fix Version/s: 1.7.3, 1.8.0
    • Component/s: lucene, run
    • Labels:
      None

      Description

      According to the text-extraction documentation there is currently no need to set a segmentstore for the extract command.

          java -cp tika-app-1.15.jar:oak-run.jar \
          org.apache.jackrabbit.oak.run.Main tika \
          --data-file binary-stats.csv \
          --store-path ./store  \
          --fds-path /path/to/datastore  --extract
      

      The command parser expects a string option so the workaround for this would be to provide a fake string at the end.
      e.g:

      java -cp .......... --extract fakestore
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                chetanm Chetan Mehrotra
                Reporter:
                chibulcu Paul Chibulcuteanu
              • Votes:
                0 Vote for this issue
                Watchers:
                3 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: