Uploaded image for project: 'Cassandra'
  1. Cassandra
  2. CASSANDRA-5692

Race condition in detecting version on a mixed 1.1/1.2 cluster

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Low
    • Resolution: Fixed
    • 1.2.7, 2.0 beta 1
    • None
    • None
    • Low

    Description

      On a mixed 1.1 / 1.2 cluster, starting 1.2 nodes fires sometimes a race condition in version detection, where the 1.2 node wrongly detects version 6 for a 1.1 node.

      It works as follows:
      1) The just started 1.2 node quickly opens an OutboundTcpConnection toward a 1.1 node before receiving any messages from the latter.
      2) Given the version is correctly detected only when the first message is received, the version is momentarily set at 6.
      3) This opens an OutboundTcpConnection from 1.2 to 1.1 at version 6, which gets stuck in the connect() method.

      Later, the version is correctly fixed, but all outbound connections from 1.2 to 1.1 are stuck at this point.

      Evidence from 1.2 logs:
      TRACE 13:48:31,133 Assuming current protocol version for /127.0.0.2
      DEBUG 13:48:37,837 Setting version 5 for /127.0.0.2

      Attachments

        1. 5692-0006.patch
          5 kB
          Sergio Bossa
        2. 5692-0005.patch
          5 kB
          Sergio Bossa

        Activity

          People

            sbtourist Sergio Bossa
            sbtourist Sergio Bossa
            Sergio Bossa
            Jason Brown
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: