Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3839

Umbrella jira for Pig on Tez Performance Improvements

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: tez
    • Labels:
      None

      Description

      Separating out performance improvements from PIG-3446 which is the main jira for Pig on Tez.

        Attachments

          Issue Links

          1.
          Improve performance of replicate-join Sub-task Open Unassigned
          2.
          Improve performance of union Sub-task Closed Rohini Palaniswamy
          3.
          Use unsorted shuffle in Orderby, Skewed Join to improve performance in Tez Sub-task Open Rohini Palaniswamy
          4.
          Use shared edge with no multiquery Sub-task Open Unassigned
          5.
          Implement automatic reducer parallelism Sub-task Closed Daniel Dai
          6.
          Sort avoidance for group by and join Sub-task Open Unassigned
          7.
          Dynamically switch to replicate join Sub-task Open Unassigned
          8.
          Integrate YSmart into Pig on tez Sub-task Open Unassigned
          9.
          Optimize join followed by order by using same key Sub-task Open Unassigned
          10.
          Simplify plan of Limit on Tez Sub-task Open Unassigned
          11.
          UnionOptimizer in Tez should optimize the case of replicated join Sub-task Open Unassigned
          12.
          Handle two outputs from split going to same input in MultiQueryOptimizer Sub-task Resolved Rohini Palaniswamy
          13.
          Improve parallelism of union and join Sub-task Resolved Rohini Palaniswamy
          14.
          Use unsorted shuffle in Union Sub-task Closed Rohini Palaniswamy
          15.
          Improve performance of Limit following an Orderby on Tez Sub-task Open Unassigned
          16.
          Limit reduce task should start as soon as one map task finishes Sub-task Closed Rohini Palaniswamy
          17.
          Broadcast the index file in case of POMergeCoGroup and POMergeJoin Sub-task Resolved Satish Subhashrao Saley
          18.
          Rework Hash based aggregation for Tez Sub-task Open Unassigned
          19.
          1-1 edge vertices should use same jvm opts Sub-task Closed Rohini Palaniswamy
          20.
          Size estimation should be done in sampler instead of sample aggregator Sub-task Open Unassigned
          21.
          Enhance Tez AM size estimation Sub-task Resolved Unassigned
          22.
          Better multi-query planning in case of multiple edges Sub-task Closed Rohini Palaniswamy
          23.
          Eliminate identity vertex for order by and skewed join right after LOAD Sub-task Closed Rohini Palaniswamy
          24.
          Optimize multi-query plan for diamond shape edges Sub-task Open Rohini Palaniswamy
          25.
          Eliminate duplicate split calculation for Order by and Skewed Join Sub-task Open Unassigned

            Activity

              People

              • Assignee:
                Unassigned
                Reporter:
                rohini Rohini Palaniswamy
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated: