|
|
|
HDDS-8950
|
HDDS-7759
NPE in SCMCommonPlacementPolicy#validateContainerPlacement
|
Attila Doroszlai
|
Soumitra Sulav
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8898
|
HDDS-7759
Replication limit should not be less than reconstruction weight
|
Attila Doroszlai
|
Varsha Ravi
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8850
|
HDDS-7759
ReplicationManager: Add metrics for partial replication / reconstruction and cluster limit
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8840
|
HDDS-7759
Add metrics to Container Balancer
|
Siddhant Sangwan
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8838
|
HDDS-7759
Orphan blocks can leave empty container stuck deleting forever
|
Ashish Kumar
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8835
|
HDDS-7759
ReplicationManager: Fix metrics to work with new RM
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8813
|
HDDS-7759
Prepare for dynamic config in ReplicationManager
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8801
|
HDDS-7759
ReplicationManager: Add metric to count how often replication is throttled
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8776
|
HDDS-7759
ReplicationManager: Log overloaded commands at debug rather than info level
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8746
|
HDDS-7759
Add metrics to ReplicationSupervisor for task count and max stream
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8728
|
HDDS-7759
Incorrect expectedNodes passed to InsufficientNodesException
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8712
|
HDDS-7759
Clean up replication code
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8675
|
HDDS-7759
Fix expectation in testUnderRepSentToOverRepHandlerIfNoNewNodes
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8658
|
HDDS-7759
ReplicationManager: Change default command timeout to 10 minutes
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8617
|
HDDS-7759
Ratis underreplication due to maintenance is not deprioritised
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8616
|
HDDS-7759
Underreplication not fixed if all replicas start decommissioning
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8554
|
HDDS-7759
Thread pool size needs to be decreased in different order in ReplicationSupervisor
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8535
|
HDDS-7759
ReplicationManager: Unhealthy containers could block EC recovery in small clusters
|
Siddhant Sangwan
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8532
|
HDDS-7759
Add config for factor of scaling up replication queue/threads in decommissioning nodes
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8525
|
HDDS-7759
Provide more info in assertions
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8505
|
HDDS-7759
ReplicationManager: Add configurable global replication limit
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8504
|
HDDS-7759
ReplicationManager: Pass used and excluded node separately for Under and Mis-Replication
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8494
|
HDDS-7759
Adjust replication queue limits for decommissioning nodes
|
Attila Doroszlai
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8473
|
HDDS-7759
ReplicationManager: Clear ContainerReplicaPendingOps when RM goes to running state
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8471
|
HDDS-7759
Ensure replication processors use a single queue for each iteration
|
Attila Doroszlai
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8466
|
HDDS-7759
Disable LegacyReplicationManager by default
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8459
|
HDDS-7759
Ratis under replication handling in a rack aware environment doesn't work
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8416
|
HDDS-7759
ReplicationManager: RatisUnderReplication handler should not sort sources by BCSID
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8368
|
HDDS-7759
ReplicationManager: Create ContainerReplicaOp with correct target Datanode
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8359
|
HDDS-7759
ReplicationManager: Fix getContainerReplicationHealth() so that it builds ContainerCheckRequest correctly
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8358
|
HDDS-7759
Fix the space usage comparator in ContainerBalancerSelectionCriteria
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8351
|
HDDS-7759
ReplicationManager: Use RM exclude list when getting target nodes for reconstruction
|
Attila Doroszlai
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8348
|
HDDS-7759
ReplicationManager should handle CLOSING containers that are empty
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Duplicate
|
|
|
|
|
|
|
|
HDDS-8347
|
HDDS-7759
Investigate possible race conditions on ContainerInfo in ContainerBalancer
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8346
|
HDDS-7759
ContainerBalancer should move only CLOSED replicas
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8337
|
HDDS-7759
ReplicationManager: MisReplicationHandler should throw an exception if partially successful
|
Attila Doroszlai
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8336
|
HDDS-7759
ReplicationManager: RatisUnderReplicationHandler should partially recover the container if not enough nodes
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8335
|
HDDS-7759
ReplicationManager: EC Mis and Under replication handler should handle overloaded exceptions
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8334
|
HDDS-7759
ReplicationManager: Add nodes to exclude list if they are overloaded
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8333
|
HDDS-7759
ReplicationManager: Allow partial EC reconstruction if insufficient nodes available
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8332
|
HDDS-7759
ReplicationManager: Use EC config scheme to adjust the weighting of reconstruction tasks
|
Unassigned
|
Stephen O'Donnell
|
|
Resolved |
Won't Fix
|
|
|
|
|
|
|
|
HDDS-8309
|
HDDS-7759
ReplicationManager: Basic Throttling of EC Reconstruction commands
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8233
|
HDDS-7759
ReplicationManager: Throttle delete container commands from over replication handlers
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8230
|
HDDS-7759
Let ReplicationManager decide the timeout for commands in Datanodes
|
Stephen O'Donnell
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8187
|
HDDS-7759
ReplicationManager: Datanode commands should be sent to nodeManager directly
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8179
|
HDDS-7759
Datanode decommissioning blocked due to non-empty replica of deleting container
|
Siddhant Sangwan
|
Varsha Ravi
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8172
|
HDDS-7759
ECUnderReplicationHandler should consider commands already sent when processing the container
|
Stephen O'Donnell
|
Arun Sarin
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8171
|
HDDS-7759
Replicate commands can be sent to dead maintenance modes if the same index is being decommissioned
|
Stephen O'Donnell
|
Jyotirmoy Sinha
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8170
|
HDDS-7759
Let ContainerBalancer consider EC containers for balancing
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8169
|
HDDS-7759
Delay Starting ContainerBalancer after SCM failover
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8168
|
HDDS-7759
Make deadlines inside MoveManager for move commands configurable
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8167
|
HDDS-7759
Inject MoveManager into ContainerBalancer
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8158
|
HDDS-7759
Replication Manager: Make all handlers send commands immediately instead of returning commands
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8153
|
HDDS-7759
Integrate ContainerBalancer with MoveManager
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8111
|
HDDS-7759
ReplicationManager: Add RatisMisReplicationHandler into rm.processUnderReplicatedContainer
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8110
|
HDDS-7759
ReplicationManager: Introduce basic limits on ReplicateContainer commands
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8109
|
HDDS-7759
Consider seperating Ratis and EC MisReplication Handling
|
Attila Doroszlai
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-8075
|
HDDS-7759
ECReconstructionCoordinatorTask.runTask should catch Exception
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8074
|
HDDS-7759
Improve synchronization around command queue updates in Node Manager
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8073
|
HDDS-7759
Replace Usages of LegacyReplicationManager.MoveResult with MoveManager.MoveResult
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8034
|
HDDS-7759
Check container replication health before scheduling move in MoveManager
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-8025
|
HDDS-7759
ReplicationManager: Count a container once for missing, under, mis or over replicated
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-8008
|
HDDS-7759
Move pendingOps into ContainerStateManagerImpl to ensure consistent state
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7998
|
HDDS-7759
Synchronize on containerInfo in ReplicationManager and MoveManager
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-7989
|
HDDS-7759
UnhealthyReplicationProcessor retries failure without delay
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7980
|
HDDS-7759
Force Close QUASI_CLOSED replicas of CLOSED containers in RM
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-7973
|
HDDS-7759
Let RatisMisReplicationHandler use the new RatisContainerReplicaCount constructor
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Done
|
|
|
|
|
|
|
|
HDDS-7962
|
HDDS-7759
Add configuration flag to enable LegacyReplicationManager for RATIS containers
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7915
|
HDDS-7759
Force close QUASI_CLOSED replicas only when the container is CLOSED in Legacy RM
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7882
|
HDDS-7759
Delete empty containers that are stuck in CLOSING state
|
Nandakumar
|
Nandakumar
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7867
|
HDDS-7759
Clean up replication logs
|
Attila Doroszlai
|
Attila Doroszlai
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-7847
|
HDDS-7759
Handle Replication of Unhealthy Replicas in RM
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7818
|
HDDS-7759
Modify Ratis Replication Handling in the new RM
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7813
|
HDDS-7759
Handle Mismatched Replicas (OPEN or CLOSING) of QUASI-CLOSED containers in RM
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7808
|
HDDS-7759
Intermittent failure in TestReplicationManager#testUnderReplicationQueuePopulated
|
Attila Doroszlai
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7804
|
HDDS-7759
UNHEALTHY replicas will not contribute to sufficient replication in RatisContainerReplicaCount
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-7788
|
HDDS-7759
Ratis OverReplicationHandler should exclude stale replicas
|
Stephen O'Donnell
|
Stephen O'Donnell
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7785
|
HDDS-7759
Improve Handling of Unhealthy Container Replicas in the new RM
|
Siddhant Sangwan
|
Siddhant Sangwan
|
|
Resolved |
Fixed
|
|
|
|
|
|
|
|
HDDS-7190
|
HDDS-7759
EC: ReplicationManager - consider deprecating maintenance.replica.minimum
|
Attila Doroszlai
|
Stephen O'Donnell
|
|
Resolved |
Won't Fix
|
|
|
|
|
|
|
|
HDDS-6572
|
HDDS-7759
EC: ReplicationManager - add move manager for container move
|
Stephen O'Donnell
|
Jie Yao
|
|
Resolved |
Implemented
|
|
|
|
|
|
|
|
HDDS-6447
|
HDDS-7759
Refine SCM handling of unhealthy container replicas
|
Ethan Rose
|
Hanisha Koneru
|
|
Resolved |
Fixed
|
|
|
|
|