Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-932

Required fields projection in Loader: nested fields in bag/tuple, map key lookup more than two levels

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Duplicate
    • 0.3.0
    • 0.8.0
    • impl
    • None

    Description

      To leverage the performance features provided by Zebra, Pig should be able to figure out which input fields are actually used in Pig script, and prune unnecessary inputs. This feature is being implementing in PIG-922. However, there are two limitations currently:

      1. Pruning nested fields only apply to map. We do not prune sub-field inside a bag or tuple
      2. For map, currently we only go one level deep. Eg, if in Pig script, user uses a#'key0'#'key1', a#'key0' will be asked

      These two limitations are in line with current limitation of Zebra loader. Once Zebra loader can handle this, we need to work to lift these limitations.

      Attachments

        Activity

          People

            daijy Daniel Dai
            daijy Daniel Dai
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: