Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-10702

Ozone Recon - Improve Recon startup failure handling and make it more resilient

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 2.0.0
    • None

    Description

      Recon can fail to start due to multiple reasons similar to other Ozone components, however Recon should recover from Runtime or unexpected failures during startup and provide a meaningful information on Recon UI :

      • Related to failure of registering of datanodes and invalid network topology during Recon startup. When Recon starts up, it tries to loadExisting nodes and during loading of nodes, cluster topology get loaded which calls add of node during register of node event and if any discrepancy or invalid network topology, Recon startup fails.
      • Related to initialization of pipelines.

      Attachments

        Issue Links

          Activity

            People

              deveshsingh Devesh Kumar Singh
              deveshsingh Devesh Kumar Singh
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: