Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-4000

log recovery failed after hard reset

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 1.6.2
    • None
    • None
    • None
    • very large cluster, accumulo 1.6.2, hadoop 2.5.0 (cdh 5.3)

    Description

      Had a hardware failure on a single node within a large cluster. Tablets were migrated away, but one tablet would not recover. The Closer run by the master to release the write lease on the WAL failed repeatedly.

      Afterwards, it was determined the file was small, probably just opened and used at the moment the machine failed. The block could not be recovered from any replicas.

      One question raised: does the write pipeline acknowledge the sync, before the write pipeline completes?

      Attachments

        Issue Links

          Activity

            People

              ecn Eric C. Newton
              ecn Eric C. Newton
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 10m
                  10m