Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-3216

[HOD] Handle Torque error codes related to security / credential errors

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Won't Fix
    • 0.16.2
    • None
    • contrib/hod
    • None

    Description

      There a bunch of credential / security related errors that come from Torque server, possibly under high load. HOD already handles one of this code specially, by retrying a bunch of times and giving up. We should probably do the same for other such errors. One of the frequently occuring one is error code 159. Other ones which Rajiv identified are:

      PBSE_IVALREQ
      PBSE_TOOMANY
      PBSE_UNKREQ
      PBSE_PERM
      PBSE_SYSTEM
      PBSE_INTERNAL
      PBSE_BADSTATE
      PBSE_BADCRED
      PBSE_EXPIRED
      PBSE_BADUSER
      PBSE_QUEBUSY
      PBSE_NOCONNECTS
      PBSE_ROUTEREJ
      PBSE_RESCUNAV
      PBSE_BADGRP
      PBSE_BADACLHOST

      Attachments

        Activity

          People

            Unassigned Unassigned
            yhemanth Hemanth Yamijala
            Votes:
            0 Vote for this issue
            Watchers:
            0 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: