Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-7823

SCM HA Phase 2

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • SCM
    • None

    Description

      This jira will track all the Phase 2 work of SCM HA

      • Decommissioning
      • Add a new node to the existing Ring
      • Support SCM HA in MiniOzoneHACluster
      • Documentation
      • Performance Improvements

      Attachments

        Issue Links

          1.
          CLI for SCMs info Sub-task Resolved Unassigned
          2.
          Option to enable Ratis in SCM Sub-task Resolved Nandakumar
          3.
          SCM HA needs handle the generation of clusterID and scmUuid in a decent way. Sub-task Resolved Unassigned
          4.
          SCM CLI command towards certain IP Sub-task Resolved Unassigned
          5.
          SCM HA: Move proxy object creation code to SCMRatisServer Sub-task Open Nandakumar
          6.
          Add unit test for SCMHAInvocationHandler Sub-task Open Nandakumar
          7.
          Update javadoc in SCMHA related classes Sub-task Open Nandakumar
          8.
          Revisit SCM client retry and failover when SCM leader changes Sub-task Resolved Shashikant Banerjee
          9.
          TestSCMStateMachine Sub-task Open Unassigned
          10.
          FailoverProxyProvider of SCM client should support leaderHint. Sub-task Resolved Rui Wang
          11.
          Handle backward compatible when upgrading from non HA to HA Sub-task Open Rui Wang
          12.
          [SCM HA Security] Handle bootstrap of SCM when primary SCM is down Sub-task Open Unassigned
          13.
          Add unit test for container operation in ContainerManagerImpl Sub-task Open Nandakumar
          14.
          Disallow committing to DB by getCurrentBatchOperation() Sub-task Open Unassigned
          15.
          SCMBlockLocationFailoverProxyProvider should handle LeaderNotReadyException Sub-task Open Rui Wang
          16.
          add new scm to SCM HA group Sub-task Resolved Unassigned
          17.
          Add integration test for SequenceIdGen Sub-task Open Unassigned
          18.
          Handle inflight delete/add actions in ReplicationManager properly. Sub-task Open YI-CHEN WANG
          19.
          Design for Error/Exception handling in state update for container/pipeline V2 Sub-task Open Glen Geng
          20.
          Add ratis snapshot retention policy for SCM HA Sub-task Open Shashikant Banerjee
          21.
          In ContainerStateManagerV2, modification of RocksDB should be consistent with that of memory state. Sub-task Open Glen Geng
          22.
          Decommission can be only executed on leader Sub-task Open Rui Wang
          23.
          Merge SCM HA Configuration Sub-task Open Unassigned
          24.
          Support SCM HA in MiniOzoneHACluster Sub-task Resolved Rui Wang
          25.
          Add SCM to Ratis Log Parser Sub-task Open Mukul Kumar Singh
          26.
          Add unit test to prove that datanode can handle term in SCMCommand properly Sub-task Open Unassigned
          27.
          Implement InstallSnapshot for SCM HA Sub-task Open Shashikant Banerjee
          28.
          Add more tests for SCM Failover scenarios Sub-task Open Shashikant Banerjee
          29.
          Refactor out Ratis logic chain Sub-task Open Unassigned
          30.
          Design for SCM HA configuration Sub-task Open Unassigned
          31.
          During bootstrap, always download checkpoint from leader SCM. Sub-task Open Unassigned
          32.
          Use MiniOzoneHAClusterImpl in TestDecommissionAndMaintenance. Sub-task Open Glen Geng
          33.
          Arrange Util classes for SCM HA Sub-task Open Nandakumar
          34.
          Retry policy for SCM requests over ratis Sub-task Open Shashikant Banerjee
          35.
          Handle NotLeaderException with Event Queue Handlers Sub-task Open Unassigned
          36.
          Testing Infrastructure Random Failures Sub-task Open Unassigned
          37.
          [SCM HA Security] Make upgraded cluster to ratis enabled single node cluster Sub-task Open Unassigned
          38.
          Better handle the case that setting a trx that is earlier than latest trx in SCMDBTransactionBuffer Sub-task Open Rui Wang
          39.
          StorageContainerLocationProtocol api should throw SCMException Sub-task Open Unassigned
          40.
          Extract SCM failover proxy provider logic Sub-task Open Unassigned

          Activity

            People

              nanda Nandakumar
              nanda Nandakumar
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated: