Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-10733

NameNode terminated after full GC thinking QJM is unresponsive.

    Details

    • Target Version/s:
    • Hadoop Flags:
      Reviewed

      Description

      NameNode went into full GC while in AsyncLoggerSet.waitForWriteQuorum(). After completing GC it checks if the timeout for quorum is reached. If the GC was long enough the timeout can expire, and QuorumCall.waitFor() will throw TimeoutExcpetion. Finally FSEditLog.logSync() catches the exception and terminates NameNode.

        Attachments

        1. HDFS-10733.002.patch
          4 kB
          Vinitha Reddy Gankidi
        2. HDFS-10733.001.patch
          4 kB
          Vinitha Reddy Gankidi

          Issue Links

            Activity

              People

              • Assignee:
                redvine Vinitha Reddy Gankidi
                Reporter:
                shv Konstantin Shvachko
              • Votes:
                0 Vote for this issue
                Watchers:
                15 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: