Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-8558

NM recovery level db not cleaned up properly on container finish

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 3.0.0, 3.1.0
    • Fix Version/s: 3.2.0, 3.1.1, 3.0.4
    • Component/s: None
    • Labels:
      None
    • Target Version/s:

      Description

      2018-07-20 16:49:23,117 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl: Application application_1531994217928_0054 transitioned from NEW to INITING
      2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000018 with incomplete records
      2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000019 with incomplete records
      2018-07-20 16:49:23,204 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000020 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000021 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000022 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000023 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000024 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000025 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000038 with incomplete records
      2018-07-20 16:49:23,205 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000039 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000041 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000044 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000046 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000049 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000052 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000054 with incomplete records
      2018-07-20 16:49:23,206 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000073 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000074 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000075 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000078 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000079 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000082 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000083 with incomplete records
      2018-07-20 16:49:23,207 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_000085 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627738 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627742 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627746 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627749 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627753 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627757 with incomplete records
      2018-07-20 16:49:23,208 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627761 with incomplete records
      2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627765 with incomplete records
      2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627769 with incomplete records
      2018-07-20 16:49:23,209 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0001_01_1099511627773 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627679 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627681 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627684 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627690 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627695 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627696 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627702 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627706 with incomplete records
      2018-07-20 16:49:23,210 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627710 with incomplete records
      2018-07-20 16:49:23,211 WARN org.apache.hadoop.yarn.server.nodemanager.recovery.NMLeveldbStateStoreService: Remove container container_1531994217928_0002_01_1099511627712 with incomplete records
      
      

      NM state store size could increase in long running scenarios, and recovery could be slow

        Attachments

        1. YARN-8558-branch-3.0.003.patch
          5 kB
          Bibin A Chundatt
        2. YARN-8558-branch-3.0.002.patch
          5 kB
          Bibin A Chundatt
        3. YARN-8558.002.patch
          5 kB
          Bibin A Chundatt
        4. YARN-8558.001.patch
          3 kB
          Bibin A Chundatt

          Activity

            People

            • Assignee:
              bibinchundatt Bibin A Chundatt
              Reporter:
              bibinchundatt Bibin A Chundatt
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: