Uploaded image for project: 'Tika'
  1. Tika
  2. TIKA-2780

Intermittent failures in batch mode when STDIN = /tmp/null

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.19.1
    • 2.0.0, 1.20
    • batch
    • None

    Description

      We found this strange behavior in the R wrapper for tika:

      We call rtika in batch mode using the -fileList parameter. However if in the java tika process STDIN is set to /dev/null with read permission (O_RDONLY) as is usually the default then, then tika will randomly fail occasionally with this error:

      {{  BatchProcess:ParallelFileProcessingResult

      {considered=7, added=7, consumed=2, numberHandledExceptions=0, secondsElapsed=1.539, exitStatus=0, causeForTermination='USER_INTERRUPTION'}

      }}

      As a workaround we now set STDIN to something that is unreadable and then everything works, but this is really strange behavior.

      Attachments

        Activity

          People

            tallison Tim Allison
            jeroenooms Jeroen
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: