Uploaded image for project: 'Geode'
  1. Geode
  2. GEODE-4802

Geode cluster hung after network problems

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Test preparation:

      1. create file bin/server1/gemfire.properties with property membership-port-range=2025-2030
      2. create file bin/server2/gemfire.propertieswith property membership-port-range=2035-2040
      3. Download network problems emulator https://jagt.github.io/clumsy
      4. Fill field 'filtering' in Clumsy: tcp and (tcp.DstPort == 2025 or tcp.DstPort == 2026 or tcp.DstPort == 2027 or tcp.DstPort == 2028 or tcp.DstPort == 2029 or tcp.DstPort == 2030). Select function 'Drop' and set Chance=100%. See clumsy2.jpg

      Steps to reproduce

      1. Start gfsh
      2. start locator --name=locator1
      3. start server --name=server1 --server-port=40411
      4. start server --name=server2 --server-port=40412
      5. create region --name=regionA --type=REPLICATE
      6. put --region=regionA --key="1" --value="one"
      7. Click on 'start' button in Clumsy
      8. put --region=regionA --key="1" --value="onev2"
      9. Wait 15s and click on 'stop' in Clumsy

      Gfsh console has hung.

      bin\server1\server1.log:

      [warning 2018/03/07 18:02:50.360 PST server1 <Function Execution Processor1> tid=0x4b] 15 seconds have elapsed while waiting for replies: <DistributedCacheOperation$CacheOperationReplyProcessor 22 waiting for 1 replies from [192.168.100.109(server2:12804)<v2>:2035]> on 192.168.100.109(server1:14416)<v1>:2045 whose current membership list is: [[192.168.100.109(server2:12804)<v2>:2035, 192.168.100.109(locator1:15628:locator)<ec><v0>:1024, 192.168.100.109(server1:14416)<v1>:2045]]

      Pulse has shown 'normal' status for both servers.

      Gfsh works again if server1 process was killed.

      Also  i've reproduced another issue with the same scenario on my test environment(see threaddump.log)

       

      Attachments

        1. threaddump.log
          3 kB
          Eugene Nedzvetsky
        2. clumsy2.jpg
          102 kB
          Eugene Nedzvetsky

        Activity

          People

            Unassigned Unassigned
            eugenex9 Eugene Nedzvetsky
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: