Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3113

Shell command execution hangs job

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 0.8.1
    • None
    • impl
    • None

    Description

      Executing a shell command inside a Pig script has the potential to deadlock the job. For example, the following statement will block when somebigfile.txt is sufficiently large:

      %declare input `cat /path/to/somebigfile.txt`
      

      This happens because PreprocessorContext.executeShellCommand(String) incorrectly uses Runtime.exec(). The sub-process's stderr and stdout streams should be read in a separate thread to prevent p.waitFor() from hanging when the sub-process's output is larger than the output buffer.

      Per the Java Process class javadoc: "Because some native platforms only provide limited buffer size for standard input and output streams, failure to promptly write the input stream or read the output stream of the subprocess may cause the subprocess to block, and even deadlock".

      See http://www.javaworld.com/jw-12-2000/jw-1229-traps.html for a correct solution.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              sgtgrumbles James
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: