Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-23443

LLAP speculative task pre-emption seems to be not working

    XMLWordPrintableJSON

Details

    Description

      I think after HIVE-23210 we are getting a stable sort order and it is causing pre-emption to not work in certain cases.

      "attempt_1589167813851_0000_119_01_000008_0 (hive_20200511055921_89598f09-19f1-4969-ab7a-82e2dd796273-119/Map 1, started at 2020-05-11 05:59:22, in preemption queue, can finish)", 
      "attempt_1589167813851_0008_84_01_000008_1 (hive_20200511055928_7ae29ca3-e67d-4d1f-b193-05651023b503-84/Map 1, started at 2020-05-11 06:00:23, in preemption queue, can finish)" 

      Scheduler only peek's at the pre-emption queue and looks at whether it is non-finishable. 

      https://github.com/apache/hive/blob/master/llap-server/src/java/org/apache/hadoop/hive/llap/daemon/impl/TaskExecutorService.java#L420

      In the above case, all tasks are speculative but state change is not triggering pre-emption queue re-ordering so peek() always returns canFinish task even though non-finishable tasks are in the queue. 

      Attachments

        1. HIVE-23443.1.patch
          2 kB
          Prasanth Jayachandran
        2. HIVE-23443.2.patch
          9 kB
          Prasanth Jayachandran
        3. HIVE-23443.3.patch
          10 kB
          Prasanth Jayachandran

        Issue Links

          Activity

            People

              prasanth_j Prasanth Jayachandran
              prasanth_j Prasanth Jayachandran
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1.5h
                  1.5h