Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-321 [Umbrella] Generic application history service
  3. YARN-1578

Fix how to read history file in FileSystemApplicationHistoryStore

    XMLWordPrintableJSON

    Details

    • Type: Sub-task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: YARN-321
    • Fix Version/s: 2.4.0
    • Component/s: None
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      I carried out PiEstimator job at Hadoop cluster which applied YARN-321.
      After the job end and when I accessed Web UI of HistoryServer, it displayed "500". And HistoryServer daemon log was output as follows.

      2014-01-09 13:31:12,227 ERROR org.apache.hadoop.yarn.webapp.Dispatcher: error handling URI: /applicationhistory/appattempt/appattempt_1389146249925_0008_000001
      java.lang.reflect.InvocationTargetException
              at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
              at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
              at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
              at java.lang.reflect.Method.invoke(Method.java:597)
              at org.apache.hadoop.yarn.webapp.Dispatcher.service(Dispatcher.java:153)
              at javax.servlet.http.HttpServlet.service(HttpServlet.java:820)
      (snip...)
      Caused by: java.lang.NullPointerException
              at org.apache.hadoop.yarn.server.applicationhistoryservice.FileSystemApplicationHistoryStore.mergeContainerHistoryData(FileSystemApplicationHistoryStore.java:696)
              at org.apache.hadoop.yarn.server.applicationhistoryservice.FileSystemApplicationHistoryStore.getContainers(FileSystemApplicationHistoryStore.java:429)
              at org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryManagerImpl.getContainers(ApplicationHistoryManagerImpl.java:201)
              at org.apache.hadoop.yarn.server.webapp.AppAttemptBlock.render(AppAttemptBlock.java:110)
      (snip...)
      

      I confirmed that there was container which was not finished from ApplicationHistory file.
      In ResourceManager daemon log, ResourceManager reserved this container, but did not allocate it.

      When FileSystemApplicationHistoryStore reads container information without finish data in history file, this problem occurs.
      In consideration of the case which there is not finish data, we should fix how to read history file in FileSystemApplicationHistoryStore.

        Attachments

        1. YARN-1578-4.patch
          14 kB
          Shinichi Yamashita
        2. YARN-1578-3.patch
          16 kB
          Shinichi Yamashita
        3. YARN-1578-2.patch
          5 kB
          Shinichi Yamashita
        4. YARN-1578.patch
          1 kB
          Shinichi Yamashita
        5. screenshot2.pdf
          155 kB
          Shinichi Yamashita
        6. screenshot.png
          68 kB
          Shinichi Yamashita
        7. resoucemanager.log
          218 kB
          Shinichi Yamashita
        8. application_1390978867235_0001
          11 kB
          Shinichi Yamashita

          Activity

            People

            • Assignee:
              sinchii Shinichi Yamashita
              Reporter:
              sinchii Shinichi Yamashita
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: