Kafka
  1. Kafka
  2. KAFKA-1106

HighwaterMarkCheckpoint failure puting broker into a bad state

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Major Major
    • Resolution: Unresolved
    • Affects Version/s: 0.8.0
    • Fix Version/s: None
    • Component/s: core
    • Labels:
      None

      Description

      I'm encountering a case where broker get stuck due to HighwaterMarkCheckpoint failing to recover from reading what appear to be corrupted isr entries. Once in this state, leader election can never succeed and hence stalling the entire cluster.

      Please see the detailed stack trace from the attached log. Perhaps failing fast when HighwaterMarkCheckpoint fails to read would force the broker to restart and recover.

      1. KAFKA-1106-patch
        4 kB
        David Lao
      2. kafka.log
        7 kB
        David Lao

        Activity

        David Lao created issue -
        David Lao made changes -
        Field Original Value New Value
        Attachment kafka.log [ 12610737 ]
        David Lao made changes -
        Attachment KAFKA-1106-patch [ 12610741 ]

          People

          • Assignee:
            Unassigned
            Reporter:
            David Lao
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:

              Development