Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3179

Task Information Header only prints out the first split for each task

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Trivial
    • Resolution: Fixed
    • None
    • 0.12.0
    • None
    • None

    Description

      When a task's PigSplit is containing more than wrappedSplit, it only logs the first fileinfo.

      When debugging, I saw

      ===== Task Information Header =====
      Command: bash ....
      Start time: Mon Feb 11 16:41:21 UTC 2013
      Input-split file: hdfs://abc.bcd.efg:8020/tmp/hij/part-r-00000.bz2
      Input-split start-offset: 0Input-split length: 11854247
      

      but the actual error was happing while reading part-r-00007.bz2. It would have been nice if the log showed all the info that task was going to read.

      Attachments

        1. pig-3179-v01.patch
          2 kB
          Koji Noguchi
        2. pig-3179-v02.patch
          2 kB
          Koji Noguchi
        3. pig-3179-v03.patch
          2 kB
          Koji Noguchi
        4. pig-3179-v04.patch
          2 kB
          Koji Noguchi

        Activity

          People

            knoguchi Koji Noguchi
            knoguchi Koji Noguchi
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: