Uploaded image for project: 'Hadoop YARN'
  1. Hadoop YARN
  2. YARN-4414

Nodemanager connection errors are retried at multiple levels

    XMLWordPrintableJSON

Details

    • Reviewed

    Description

      This is related to YARN-3238. Ran into more scenarios where connection errors are being retried at multiple levels, like NoRouteToHostException. The fix for YARN-3238 was too specific, and I think we need a more general solution to catch a wider array of connection errors that can occur to avoid retrying them both at the RPC layer and at the NM proxy layer.

      Attachments

        1. YARN-4414.1.patch
          6 kB
          Chang Li
        2. YARN-4414.1.2.patch
          6 kB
          Chang Li
        3. YARN-4414.1.2.patch
          6 kB
          Chang Li
        4. YARN-4414.1.3.patch
          6 kB
          Chang Li
        5. YARN-4414.2.patch
          6 kB
          Chang Li
        6. YARN-4414.3.patch
          6 kB
          Chang Li

        Issue Links

          Activity

            People

              lichangleo Chang Li
              jlowe Jason Darrell Lowe
              Votes:
              0 Vote for this issue
              Watchers:
              12 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: