Uploaded image for project: 'Samza'
  1. Samza
  2. SAMZA-2633

Rolling deployment/upgrade causes downtime for processors for the entire deployment window

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 1.7
    • None
    • None

    Description

      Problem:

      At LinkedIn, we noticed several standalone users complained about lag/downtime during rolling deployments/upgrades.

      Description:

      During rolling upgrades, the current debounce timer gets extended every time when there is a quorum change notification. As a result, processors that were upgraded earlier in the deployment window remain unavailable waiting for work assignment. In some scenarios, this cause processors to be unavailable for 20 minutes or so depending on the size of the quorum and the debounce time configuration.

      Impact:

      Partitions that were stopped for initial processors as part of upgrade remain unassigned for the entire deployment window which can result in processing lag.

      Attachments

        Issue Links

          Activity

            People

              bharathkk Bharath Kumarasubramanian
              bharathkk Bharath Kumarasubramanian
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 1.5h
                  1.5h