Uploaded image for project: 'Accumulo'
  1. Accumulo
  2. ACCUMULO-509

default walog copy/sort uses replication of 1

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 1.4.1, 1.5.0
    • logger
    • None
    • medium size cluster

    Description

      During recovery, the logger copied/sorted a recovery walog to hdfs. The copy was ok, but there was a checksum error when replaying the data. The system did not recover without manual intervention. The work-around was to find the datanode serving the back block, and stop it. Then I removed the bad recovery file and restarted the master. The copy/sort took place again, and used a different datanode. Recovery proceeded successfully.

      We need to use a higher replication and/or a more sophisticated approach to verifying and restarting recoveries.

      Attachments

        Activity

          People

            kturner Keith Turner
            ecn Eric C. Newton
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: