Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3588

Integrate Pig with Morphlines.

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 0.12.0
    • None
    • internal-udfs
    • None

    Description

      Integrate Pig with Morphlines.

      Specifically, add support to Pig to call a morphline as a UDTF to leverage existing morphlines ETL functionality. Often, some flexible massaging needs to happen to get the input data into the shape that's desired, and morphline logic helps do this in a user-friendly, pluggable, efficient, pipelined manner. This issue basically boils down to transforming an input tuple into a morphline record, feeding the record into the morphline processing API, and finally converting zero or more morphline output records into corresponding Pig tuples.

      Some background is here:

      http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html

      http://cloudera.github.io/cdk/docs/current/cdk-morphlines/index.html

      Attachments

        Activity

          People

            Unassigned Unassigned
            whoschek Wolfgang Hoschek
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: