Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-4600

Delegate bzip processing to Hadoop

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • None
    • 0.16.0
    • None
    • None

    Description

      Pig handling bzip within its own codebase. I am not very sure but that's probably due to history reason. Hadoop does not support splittable bzip until 0.21 (HADOOP-4012). If so, we shall remove the bzip2 module in Pig and delegate to Hadoop.

      In particular, Pig does not support bzip2 concatenation. Once we delegate bzip to Hadoop, this issue will be solved.

      Attachments

        Issue Links

          Activity

            People

              daijy Daniel Dai
              daijy Daniel Dai
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: