Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 2.0.2-alpha
    • Component/s: namenode
    • Labels:
      None
    • Hadoop Flags:
      Reviewed

      Description

      Sometimes when primary crashes during image transfer secondary namenode would hang trying to read the image from HTTP connection forever.
      It would be great to set timeouts on the connection so if something like that happens there is no need to restart the secondary itself.
      In our case restarting components is handled by the set of scripts and since the Secondary as the process is running it would just stay hung until we get an alarm saying the checkpointing doesn't happen.

        Attachments

        1. HDFS-1490.patch
          4 kB
          Vinayakumar B
        2. HDFS-1490.patch
          5 kB
          Vinayakumar B
        3. HDFS-1490.patch
          6 kB
          Vinayakumar B
        4. HDFS-1490.patch
          6 kB
          Vinayakumar B

          Issue Links

            Activity

              People

              • Assignee:
                vinayrpet Vinayakumar B
                Reporter:
                dms Dmytro Molkov
              • Votes:
                0 Vote for this issue
                Watchers:
                14 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: