Uploaded image for project: 'REEF (Retired)'
  1. REEF (Retired)
  2. REEF-1837

IsParentAliveCancellationDetector logs lack of HTTP endpoint excessively

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Invalid
    • None
    • None
    • REEF.NET Driver
    • None

    Description

      In IsParentAliveCancellationDetector, we generate a log line each time an upstream HTTP server cannot be found, e.g.:

      INFO: Using Yarn Configuration to retrieve RM address.
      Microsoft.MachineLearning.Distributed.Core.Common.IsParentAliveCancellationDetector Error: 0 : 2017-07-23T18:00:28.6024001-07:00 0004
      ERROR: Failed to initialize yarn configuration
      Encountered error [System.ApplicationException: Unable to find RM Webapp Address from yarn-site.xml.
         at Org.Apache.REEF.Utilities.Runtime.Yarn.YarnConfiguration.GetYarnRMWebappEndpoints()
         at Microsoft.MachineLearning.Distributed.Core.Common.IsParentAliveCancellationDetector.InitYarnConfiguration(String rmAddresses)]
      

      This leads to excessively bloated logs. We should reduce the amount of logging here, e.g. by assuming that the state of the YARN deployment doesn't change during execution of one job.

      Attachments

        Activity

          People

            Unassigned Unassigned
            markus.weimer Markus Weimer
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: