Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-7454

Common side of High Availability Framework (HDFS-1623)

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0-alpha
    • None
    • None
    • Reviewed

    Description

      There will likely need to be a few changes to Hadoop Common (e.g. HDFS-7380) to complete HDFS-1623 (High Availability Framework for HDFS NN). This JIRA is an umbrella for those Common changes.

      Attachments

        Issue Links

        1.
        Add client failover functionality to o.a.h.io.(ipc|retry) Sub-task Closed Aaron Myers Actions
        2.
        Introduce HA Service Protocol Interface Sub-task Resolved Suresh Srinivas Actions
        3.
        HA: Administrative CLI to control HA daemons Sub-task Resolved Todd Lipcon Actions
        4.
        HA: if both NNs are in Standby mode, client needs to try failing back and forth several times with sleeps Sub-task Resolved Aaron Myers Actions
        5.
        StandbyException should extend IOException Sub-task Resolved Todd Lipcon Actions
        6.
        HA: Improve some logging for client IPC failovers and StandbyExceptions Sub-task Resolved Todd Lipcon Actions
        7.
        
FailoverController for client-based configuration Sub-task Resolved Eli Collins Actions
        8.
        Add interface and update CLI to query current state to HAServiceProtocol Sub-task Resolved Eli Collins Actions
        9.
        HA: Client failover policy is incorrectly trying to fail over all IOExceptions Sub-task Resolved Aaron Myers Actions
        10.
        HA : Make client connection retries on socket time outs configurable. Sub-task Resolved Uma Maheswara Rao G Actions
        11.
        HA: the FailoverController should optionally fence the active during failover Sub-task Resolved Eli Collins Actions
        12.
        HA: failover should error out if either host is not specified in the configs Sub-task Resolved Eli Collins Actions
        13.
        HA: fencing mechanism Sub-task Resolved Todd Lipcon Actions
        14.
        All HAServiceProtocol methods have to throw IOException since they are invoked over the network Sub-task Resolved Hari Mankude Actions
        15.
        HA: failover should be able to pass args to fencers Sub-task Resolved Eli Collins Actions
        16.
        HA: the FailoverController should check the standby is ready before failing over Sub-task Resolved Eli Collins Actions
        17.
        HA: Add 'ipc.client.connect.max.retries.on.timeouts' entry in core-default.xml file. Sub-task Resolved Uma Maheswara Rao G Actions
        18.
        HA: log a warning when a failover is first attempted Sub-task Resolved Todd Lipcon Actions
        19.
        HA: void methods can swallow exceptions when going through failover path Sub-task Resolved Todd Lipcon Actions
        20.
        HA: RetriableCommand is using RetryPolicy incorrectly after HADOOP-7896 Sub-task Resolved Aaron Myers Actions
        21.
        HA: HAServiceProtocol exceptions need to be unwrapped before it can be used Sub-task Resolved Hari Mankude Actions
        22.
        Add ZK client for leader election Sub-task Resolved Bikas Saha Actions

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            atm Aaron Myers
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment