Details
-
Bug
-
Status: Resolved
-
P2
-
Resolution: Fixed
-
None
-
None
Description
To avoid duplication of data ElasticsearchIO reads from primary shards only and filters out replica shards. But in reality, even if _shard-preference:shardId is set in scroll request, ES internally load balances requests between primary and replica shards and ensures that there will be no duplicates. Targeting all the shards and letting ES deal with replicas is better in some corner cases like failover.
Attachments
Issue Links
- links to