Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-4621

additional logging to help diagnose slow QJM logSync

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • 2.0.3-alpha
    • 2.1.0-beta
    • ha, qjm
    • None
    • Reviewed

    Description

      I've been working on diagnosing an issue with a cluster which is seeing slow logSync calls occasionally to QJM. Adding a few more pieces of logging would help this:

      • in the warning messages on the client side leading up to a timeout, include which nodes have responded and which ones are still pending
      • on the server side, when we actually call FileChannel.force, log a warning if the sync takes longer than 1 second

      Attachments

        1. hdfs-4621.txt
          5 kB
          Todd Lipcon

        Activity

          People

            tlipcon Todd Lipcon
            tlipcon Todd Lipcon
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: