Details
-
Bug
-
Status: Resolved
-
Major
-
Resolution: Won't Fix
-
1.6.2
-
None
-
None
-
None
-
very large cluster, accumulo 1.6.2, hadoop 2.5.0 (cdh 5.3)
Description
Had a hardware failure on a single node within a large cluster. Tablets were migrated away, but one tablet would not recover. The Closer run by the master to release the write lease on the WAL failed repeatedly.
Afterwards, it was determined the file was small, probably just opened and used at the moment the machine failed. The block could not be recovered from any replicas.
One question raised: does the write pipeline acknowledge the sync, before the write pipeline completes?
Attachments
Issue Links
- is related to
-
ACCUMULO-4004 open WALs prevent DN decommissioning
- Resolved
- relates to
-
ACCUMULO-4004 open WALs prevent DN decommissioning
- Resolved
- links to