Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-10406

YARN log processor

    XMLWordPrintableJSON

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Critical
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: yarn
    • Labels:
      None
    • Target Version/s:

      Description

      YARN currently does not have any utility that would enable cluster administrators to display previous actions in a Hadoop YARN cluster in an offline fashion.

      HDFS has the OIV/ OEV which does not require a running cluster to look and modify the filesystem. A corresponding tool would be very helpful in the context of YARN.

      Since ATS is not widespread (is not available for older clusters) and there isn't a single file or entity that would collect all the application/container etc. related information, we thought our best option to parse and process the output of the YARN daemon log files and reconstruct the history of the cluster from that. We designed and implemented a CLI based solution that after parsing the log file enables users to query app/container related information (listing, filtering by certain properties) and search for common errors like CE failures/error codes, AM preemption or stack traces. The tool can be integrated into the YARN project as a sub-project.

        Attachments

          Activity

            People

            • Assignee:
              mhudaky Hudáky Márton Gyula
              Reporter:
              adam.antal Adam Antal
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated: