Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-2488

Elasticsearch IO should read also in replica shards

Details

    • Bug
    • Status: Resolved
    • P2
    • Resolution: Fixed
    • None
    • 2.1.0
    • io-java-elasticsearch
    • None

    Description

      To avoid duplication of data ElasticsearchIO reads from primary shards only and filters out replica shards. But in reality, even if _shard-preference:shardId is set in scroll request, ES internally load balances requests between primary and replica shards and ensures that there will be no duplicates. Targeting all the shards and letting ES deal with replicas is better in some corner cases like failover.

      Attachments

        Issue Links

          Activity

            People

              echauchot Etienne Chauchot
              echauchot Etienne Chauchot
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: