Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-1237

Piggybank MutliStorage - specify field to write in output

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: None
    • Labels:
      None

      Description

      I've made a modification to the piggy bank MutliStorage class that allows to optionally specify the index of the field in each tuple to write to output.
      This feature allows to have records with metadata like seqno, time of upload etc, and then to combine files from these records into one but without the metadata.
      e.g.
      1: date type seq1 data
      2: date type seq2 data

      then write output grouped by type and ordered by sequence:
      data
      data

        Attachments

        1. PIG-1237.patch
          3 kB
          Gerrit Jansen van Vuuren

          Activity

            People

            • Assignee:
              gerritjvv Gerrit Jansen van Vuuren
              Reporter:
              gerritjvv Gerrit Jansen van Vuuren
            • Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

              • Created:
                Updated: