Uploaded image for project: 'Hadoop HDFS'
  1. Hadoop HDFS
  2. HDFS-1490

TransferFSImage should timeout

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • None
    • 2.0.2-alpha
    • namenode
    • None
    • Reviewed

    Description

      Sometimes when primary crashes during image transfer secondary namenode would hang trying to read the image from HTTP connection forever.
      It would be great to set timeouts on the connection so if something like that happens there is no need to restart the secondary itself.
      In our case restarting components is handled by the set of scripts and since the Secondary as the process is running it would just stay hung until we get an alarm saying the checkpointing doesn't happen.

      Attachments

        1. HDFS-1490.patch
          4 kB
          Vinayakumar B
        2. HDFS-1490.patch
          5 kB
          Vinayakumar B
        3. HDFS-1490.patch
          6 kB
          Vinayakumar B
        4. HDFS-1490.patch
          6 kB
          Vinayakumar B

        Issue Links

          Activity

            People

              vinayakumarb Vinayakumar B
              dms Dmytro Molkov
              Votes:
              0 Vote for this issue
              Watchers:
              13 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: