Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-1652 Fixes and improvements for spark-submit/configs
  3. SPARK-1921

Allow duplicate jar files among the app jar and secondary jars in yarn-cluster mode

    XMLWordPrintableJSON

Details

    • Sub-task
    • Status: Resolved
    • Minor
    • Resolution: Incomplete
    • 1.0.0
    • None
    • Deploy

    Description

      In yarn-cluster mode, jars are uploaded to a staging folder on hdfs. If there are duplicates among the app jar and secondary jars, there will be overwrites that cause inconsistent timestamps. I saw the following message:

      Application application_1400965808642_0021 failed 2 times due to AM Container for appattempt_1400965808642_0021_000002 exited with  exitCode: -1000 due to: Resource hdfs://localhost.localdomain:8020/user/cloudera/.sparkStaging/application_1400965808642_0021/app_2.10-0.1.jar changed on src filesystem (expected 1400998721965, was 1400998723123
      

      Tested on a CDH-5 quickstart VM.

      Attachments

        Activity

          People

            Unassigned Unassigned
            mengxr Xiangrui Meng
            Votes:
            4 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: