Uploaded image for project: 'HBase'
  1. HBase
  2. HBASE-4177

Handling read failures during recovery‏ - when HMaster calls Namenode recovery, recovery may be a failure leading to read failure while splitting logs

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.95.1
    • Component/s: master
    • Labels:
      None

      Description

      As per the mailing thread with the heading
      'Handling read failures during recovery‏' we found this problem.
      As part of split Logs the HMaster calls Namenode recovery. The recovery is an asynchronous process.
      In HDFS
      =======
      Even though client is getting the updated block info from Namenode on first
      read failure, client is discarding the new info and using the old info only
      to retrieve the data from datanode. So, all the read
      retries are failing. [Method parameter reassignment - Not reflected in
      caller].
      In HBASE
      =======
      In HMaster code we tend to wait for 1sec. But if the recovery had some failure then split log may not happen and may lead to dataloss.
      So may be we need to decide upon the actual delay that needs to be introduced once Hmaster calls NN recovery.

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                ram_krish ramkrishna.s.vasudevan
                Reporter:
                ram_krish ramkrishna.s.vasudevan
              • Votes:
                0 Vote for this issue
                Watchers:
                10 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: