Uploaded image for project: 'Apache Tez'
  1. Apache Tez
  2. TEZ-1397

Node affinity for tasks processing the same splits

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Within a session, if the same set of HDFS blocks are accessed by different tasks - these should ideally be launched on the same node for better buffer cache, etc utilization.
      This will likely end up being another level of requests higher up than NODE_LOCAL for the scheduler.

      Attachments

        Activity

          People

            sseth Siddharth Seth
            sseth Siddharth Seth
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: