Uploaded image for project: 'Pig'
  1. Pig
  2. PIG-3751

Generating Splits in Tez should be configurable to AM or client

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • tez
    • None

    Description

      1) TEZ-752 allows setting list of URIs to get delegation tokens. Set that to make Tez get delegation tokens and calculate input splits on AM
      2) Try using Tez Grouping of input splits instead of pig.maxCombinedSplitSize grouping.

      Generating splits in AM is supposed to give performance boost. For those case where InputFormat or OutputFormat get delegation tokens and it is not possible to do that, then have a option to generate input splits on client.

      Attachments

        Activity

          People

            rohini Rohini Palaniswamy
            rohini Rohini Palaniswamy
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: