Uploaded image for project: 'Geode'
  1. Geode
  2. GEODE-7038

After auto-reconnect a server's multicat communications aren't working correctly

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 1.11.0
    • membership, messaging
    • None

    Description

      This was observed in an server having multicast enabled on a Region.  The server went into a GC pause and was kicked out of the cluster.  After auto-reconnecting all of the servers were requested to shut down and they all hung on destroy-region message responses.  Statistics showed constant multicast retransmission requests but no retransmissions being sent.

      When a Region is configured to use multicast all of its cache operation messages are multicast, including a destroy-region message.

      Some time ago we decided to stop sending Join Request Responses during discovery.  These messages were responsible for carrying the JGroups multicast message digest so that a joining member could install this digest into its multicast protocol.  Today these messages are only sent if a UDP Diffie-Hellman algorithm has been specified.  We need to also ensure that we send these messages if multicast is enabled.

       

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            bschuchardt Bruce J Schuchardt
            bschuchardt Bruce J Schuchardt
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 50m
                50m

                Slack

                  Issue deployment