Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3839 Umbrella jira for Pig on Tez Performance Improvements
  3. PIG-3856

UnionOptimizer in Tez should optimize the case of replicated join

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • tez
    • None

    Description

      Replicate join input that was broadcast to union vertex now needs to be broadcast to all the union predecessors. So we need to

      • Create edges from the Replicate join input to all the union predecessors
      • Change replicate join input to write to multiple outputs.

      This can be further optimized by using a shared edge which is yet to be implemented in Tez (TEZ-391)

      Attachments

        1. PIG-3856-1.patch
          48 kB
          Rohini Palaniswamy

        Issue Links

          Activity

            People

              Unassigned Unassigned
              rohini Rohini Palaniswamy
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: