Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-1237

Piggybank MutliStorage - specify field to write in output

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      I've made a modification to the piggy bank MutliStorage class that allows to optionally specify the index of the field in each tuple to write to output.
      This feature allows to have records with metadata like seqno, time of upload etc, and then to combine files from these records into one but without the metadata.
      e.g.
      1: date type seq1 data
      2: date type seq2 data

      then write output grouped by type and ordered by sequence:
      data
      data

      Attachments

        1. PIG-1237.patch
          3 kB
          Gerrit Jansen van Vuuren

        Activity

          People

            gerritjvv Gerrit Jansen van Vuuren
            gerritjvv Gerrit Jansen van Vuuren
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: