Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3839

Umbrella jira for Pig on Tez Performance Improvements

Add voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • tez
    • None

    Description

      Separating out performance improvements from PIG-3446 which is the main jira for Pig on Tez.

      Attachments

        Issue Links

        1.
        Improve performance of replicate-join Sub-task Open Unassigned Actions
        2.
        Improve performance of union Sub-task Closed Rohini Palaniswamy Actions
        3.
        Use unsorted shuffle in Orderby, Skewed Join to improve performance in Tez Sub-task Open Rohini Palaniswamy Actions
        4.
        Use shared edge with no multiquery Sub-task Open Unassigned Actions
        5.
        Implement automatic reducer parallelism Sub-task Closed Daniel Dai Actions
        6.
        Sort avoidance for group by and join Sub-task Open Unassigned Actions
        7.
        Dynamically switch to replicate join Sub-task Open Unassigned Actions
        8.
        Integrate YSmart into Pig on tez Sub-task Open Unassigned Actions
        9.
        Optimize join followed by order by using same key Sub-task Open Unassigned Actions
        10.
        Simplify plan of Limit on Tez Sub-task Open Unassigned Actions
        11.
        UnionOptimizer in Tez should optimize the case of replicated join Sub-task Open Unassigned Actions
        12.
        Handle two outputs from split going to same input in MultiQueryOptimizer Sub-task Resolved Rohini Palaniswamy Actions
        13.
        Improve parallelism of union and join Sub-task Resolved Rohini Palaniswamy Actions
        14.
        Use unsorted shuffle in Union Sub-task Closed Rohini Palaniswamy Actions
        15.
        Improve performance of Limit following an Orderby on Tez Sub-task Open Unassigned Actions
        16.
        Limit reduce task should start as soon as one map task finishes Sub-task Closed Rohini Palaniswamy Actions
        17.
        Broadcast the index file in case of POMergeCoGroup and POMergeJoin Sub-task Resolved Satish Saley Actions
        18.
        Rework Hash based aggregation for Tez Sub-task Open Unassigned Actions
        19.
        1-1 edge vertices should use same jvm opts Sub-task Closed Rohini Palaniswamy Actions
        20.
        Size estimation should be done in sampler instead of sample aggregator Sub-task Open Unassigned Actions
        21.
        Enhance Tez AM size estimation Sub-task Resolved Unassigned Actions
        22.
        Better multi-query planning in case of multiple edges Sub-task Closed Rohini Palaniswamy Actions
        23.
        Eliminate identity vertex for order by and skewed join right after LOAD Sub-task Closed Rohini Palaniswamy Actions
        24.
        Optimize multi-query plan for diamond shape edges Sub-task Open Rohini Palaniswamy Actions
        25.
        Eliminate duplicate split calculation for Order by and Skewed Join Sub-task Open Unassigned Actions

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            rohini Rohini Palaniswamy

            Dates

              Created:
              Updated:

              Slack

                Issue deployment