Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-1264

Support for limiting output records in OnFileSortedOutput

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      When we are limiting on unsorted output, we can stop after reaching the count in the Processor. But if limiting has to be done on sorted output in map phase it is not possible as sorting is done by OnFileSortedOutput. If limiting was supported as part of the output, then we can limit records before writing to each part file after Partitioner is applied.

      Attachments

        Issue Links

          Activity

            People

              rohini Rohini Palaniswamy
              rohini Rohini Palaniswamy
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated: