Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-1130

Improve the log flushing for tasks when mapred.userlog.limit.kb is set

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • 2.0.5-alpha
    • None
    • nodemanager
    • None

    Description

      When userlog limit is set with something like this:

      <property>
      <name>mapred.userlog.limit.kb</name>
      <value>2048</value>
      <description>The maximum size of user-logs of each task in KB. 0 disables the cap.
      </description>
      </property>
      

      the log entry will be truncated randomly for the jobs.

      The log size is left between 1.2MB to 1.6MB.

      Since the log is already limited, avoid the log truncation is crucial for user.

      The other issue with the current impl(org.apache.hadoop.yarn.ContainerLogAppender) is that log entries will not flush to file until the container shutdown and logmanager close all appenders. If user likes to see the log during task execution, it doesn't support it.

      Will propose a patch to add a flush mechanism and also flush the log when task is done.

      Attachments

        1. YARN-1130.patch
          11 kB
          Paul Han
        2. YARN-1130.patch
          11 kB
          Paul Han
        3. YARN-1130.patch
          12 kB
          Paul Han

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            paulhan Paul Han
            paulhan Paul Han
            Votes:
            0 Vote for this issue
            Watchers:
            11 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment