Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-27891

Long running spark jobs fail because of HDFS delegation token expires

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Critical
    • Resolution: Cannot Reproduce
    • 2.0.1, 2.1.0, 2.3.1, 2.4.1
    • None
    • Security
    • None

    Description

      When the spark job runs on a secured cluster for longer then time that is mentioned in the dfs.namenode.delegation.token.renew-interval property of hdfs-site.xml the spark job fails. ** 

      Following command was used to submit the spark job

      bin/spark-submit --principal acekrbuser --keytab ~/keytabs/acekrbuser.keytab --master yarn --deploy-mode cluster examples/src/main/python/wordcount.py /tmp/ff1.txt

       

      Application Logs attached

       

      Attachments

        1. spark_2.3.1_failure.log
          524 kB
          hemshankar sahu
        2. application_1559242207407_0001.log
          534 kB
          hemshankar sahu

        Issue Links

          Activity

            People

              Unassigned Unassigned
              hemshankar_sahu hemshankar sahu
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: