Uploaded image for project: 'Apache Ozone'
  1. Apache Ozone
  2. HDDS-7825

Difficult to debug why a node is excluded

    XMLWordPrintableJSON

Details

    Description

      When the client adds a node to the ExcludedNodes list due to an exception, the exception is hidden. I've found no logging to help determine why a node is added to the ExcludedNodes list, which makes it nearly impossible to determine if some configuration needs to be changed.

      In my case it turned out I was running into port exhaustion because lots of sockets were being opened/closed quickly, leaving many in TIME_WAIT, and I was on an older kernel with tcp_tw_reuse=0. One thing that helped determine that was the cause was doing a custom build of Ozone where I added logging when a node is excluded to see what exception caused it. Turns out that exception wasn't related to the target node.

      Attachments

        Issue Links

          Activity

            People

              MikaelSmith Michael Smith
              MikaelSmith Michael Smith
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: