Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1623 High Availability Framework for HDFS NN
  3. HDFS-2737

HA: Automatically trigger log rolls periodically on the active NN

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • HA branch (HDFS-1623)
    • HA branch (HDFS-1623)
    • ha, namenode
    • None

    Description

      Currently, the edit log tailing process can only read finalized log segments. So, if the active NN is not rolling its logs periodically, the SBN will lag a lot. This also causes many datanode messages to be queued up in the PendingDatanodeMessage structure.

      To combat this, the active NN needs to roll its logs periodically – perhaps based on a time threshold, or perhaps based on a number of transactions. I'm not sure yet whether it's better to have the NN roll on its own or to have the SBN ask the active NN to roll its logs.

      Attachments

        1. HDFS-2737-HDFS-1623.patch
          38 kB
          Aaron Myers
        2. HDFS-2737-HDFS-1623.patch
          38 kB
          Aaron Myers
        3. HDFS-2737-HDFS-1623.patch
          38 kB
          Aaron Myers
        4. HDFS-2737-HDFS-1623.patch
          37 kB
          Aaron Myers
        5. hdfs-2737-prelim.txt
          35 kB
          Todd Lipcon

        Issue Links

          Activity

            People

              atm Aaron Myers
              tlipcon Todd Lipcon
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: