Details
-
Bug
-
Status: Closed
-
Major
-
Resolution: Fixed
-
None
-
None
-
None
Description
When digging into RestartWhileUpdatingTest failure, I see that term of replicas is kinda mess up.
[junit4] 1> /collections/collection1/terms/shard1 (0)
[junit4] 1> DATA:
[junit4] 1>Unknown macro: { [junit4] 1> "core_node24_recovering"}
By design, the core_node24_recovering and core_node24 should be always equals to each other. The reason here is ZkShardTerms.ensureTermsIsHigher also increase the core_node24_recovering is a higher number. This will lead to a case when a replica finished recovering but it won't be able to become active.