Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3839

Umbrella jira for Pig on Tez Performance Improvements

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • tez
    • None

    Description

      Separating out performance improvements from PIG-3446 which is the main jira for Pig on Tez.

      Attachments

        Issue Links

          1.
          Improve performance of replicate-join Sub-task Open Unassigned
          2.
          Improve performance of union Sub-task Closed Rohini Palaniswamy
          3.
          Use unsorted shuffle in Orderby, Skewed Join to improve performance in Tez Sub-task Open Rohini Palaniswamy
          4.
          Use shared edge with no multiquery Sub-task Open Unassigned
          5.
          Implement automatic reducer parallelism Sub-task Closed Daniel Dai
          6.
          Sort avoidance for group by and join Sub-task Open Unassigned
          7.
          Dynamically switch to replicate join Sub-task Open Unassigned
          8.
          Integrate YSmart into Pig on tez Sub-task Open Unassigned
          9.
          Optimize join followed by order by using same key Sub-task Open Unassigned
          10.
          Simplify plan of Limit on Tez Sub-task Open Unassigned
          11.
          UnionOptimizer in Tez should optimize the case of replicated join Sub-task Open Unassigned
          12.
          Handle two outputs from split going to same input in MultiQueryOptimizer Sub-task Resolved Rohini Palaniswamy
          13.
          Improve parallelism of union and join Sub-task Resolved Rohini Palaniswamy
          14.
          Use unsorted shuffle in Union Sub-task Closed Rohini Palaniswamy
          15.
          Improve performance of Limit following an Orderby on Tez Sub-task Open Unassigned
          16.
          Limit reduce task should start as soon as one map task finishes Sub-task Closed Rohini Palaniswamy
          17.
          Broadcast the index file in case of POMergeCoGroup and POMergeJoin Sub-task Resolved Satish Saley
          18.
          Rework Hash based aggregation for Tez Sub-task Open Unassigned
          19.
          1-1 edge vertices should use same jvm opts Sub-task Closed Rohini Palaniswamy
          20.
          Size estimation should be done in sampler instead of sample aggregator Sub-task Open Unassigned
          21.
          Enhance Tez AM size estimation Sub-task Resolved Unassigned
          22.
          Better multi-query planning in case of multiple edges Sub-task Closed Rohini Palaniswamy
          23.
          Eliminate identity vertex for order by and skewed join right after LOAD Sub-task Closed Rohini Palaniswamy
          24.
          Optimize multi-query plan for diamond shape edges Sub-task Open Rohini Palaniswamy
          25.
          Eliminate duplicate split calculation for Order by and Skewed Join Sub-task Open Unassigned

          Activity

            People

              Unassigned Unassigned
              rohini Rohini Palaniswamy
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: