Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-9063

ATS 1.5 fails to start if RollingLevelDb files are corrupt or missing

    XMLWordPrintableJSON

Details

    Description

      ATS v1.5 fails to start up if there are some missing files in RollingLevelDBTimelineStore. YARN-6054 fixes this issue only for the LevelDBTimelineStore. Since RollingLevelDBTimelineStore opens multiple level db and rolls them, we need a separate fix for this. The error is shown below

      18/11/13 07:00:56 FATAL applicationhistoryservice.ApplicationHistoryServer: Error starting ApplicationHistoryServer 
      org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: 1 missing files; e.g.: /tmp/ats_folder/yarn/timeline/leveldb-timeline-store/owner-ldb/000005.sst 
      at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59) 
      at org.apache.hadoop.service.AbstractService.init(AbstractService.java:172) 
      at org.apache.hadoop.service.CompositeService.serviceInit(CompositeService.java:107) 
      at org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer.serviceInit(ApplicationHistoryServer.java:111) 
      at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163) 
      at org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer.launchAppHistoryServer(ApplicationHistoryServer.java:174) 
      at org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryServer.main(ApplicationHistoryServer.java:184) 
      Caused by: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: 1 missing files; e.g.: /tmp/ats-folder/yarn/timeline/leveldb-timeline-store/owner-ldb/000005.sst 
      at org.fusesource.leveldbjni.internal.NativeDB.checkStatus(NativeDB.java:200) 
      at org.fusesource.leveldbjni.internal.NativeDB.open(NativeDB.java:218) 
      at org.fusesource.leveldbjni.JniDBFactory.open(JniDBFactory.java:168) 
      at org.apache.hadoop.yarn.server.timeline.RollingLevelDBTimelineStore.serviceInit(RollingLevelDBTimelineStore.java:321) 
      at org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
      

      Attachments

        Issue Links

          Activity

            People

              groot Ashutosh Gupta
              tarunparimi Tarun Parimi
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 5.5h
                  5.5h