Uploaded image for project: 'Kafka'
  1. Kafka
  2. KAFKA-4124

Handle disk failures gracefully

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • None
    • None
    • None

    Description

      Currently when a disk goes down, the broker also goes down with it. This causes too much reshuffle of data over the network to replace the broker. Make the broker resilient to disk failure.

      The broker can detect a disk failure, mark it bad and then re-replicate the under replicated data in all other available disks in the node. If the bad disk is replaced with new one, the broker can rebalance the data among all other disks it has. The broker can also tolerate upto n disk failures.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            slukog Gokul
            Votes:
            1 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment