Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4600

Delegate bzip processing to Hadoop

    Details

    • Type: Improvement
    • Status: Closed
    • Priority: Major
    • Resolution: Duplicate
    • Affects Version/s: None
    • Fix Version/s: 0.16.0
    • Component/s: None
    • Labels:
      None

      Description

      Pig handling bzip within its own codebase. I am not very sure but that's probably due to history reason. Hadoop does not support splittable bzip until 0.21 (HADOOP-4012). If so, we shall remove the bzip2 module in Pig and delegate to Hadoop.

      In particular, Pig does not support bzip2 concatenation. Once we delegate bzip to Hadoop, this issue will be solved.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                daijy Daniel Dai
                Reporter:
                daijy Daniel Dai
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: