Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3839 Umbrella jira for Pig on Tez Performance Improvements
  3. PIG-4120

Broadcast the index file in case of POMergeCoGroup and POMergeJoin

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.18.0
    • tez
    • None
    • Reviewed

    Description

      Currently merge join and merge cogroup use two DAGs - the first DAG creates the index file in hdfs and second DAG does the merge join. Similar to replicate join, we can broadcast the index file and cache it and use it in merge join and merge cogroup. This will give better performance and also eliminate need for the second DAG.

      Attachments

        1. PIG-4120-5.patch
          70 kB
          Satish Saley
        2. PIG-4120-4.patch
          69 kB
          Satish Saley
        3. PIG-4120-3.patch
          65 kB
          Satish Saley
        4. PIG-4120-2.patch
          60 kB
          Satish Saley
        5. PIG-4120-1.patch
          27 kB
          Satish Saley

        Issue Links

          Activity

            People

              satishsaley Satish Saley
              rohini Rohini Palaniswamy
              Votes:
              1 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: