Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-3223

runAsSparkUser cannot change HDFS write permission properly in mesos cluster mode

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Critical
    • Resolution: Fixed
    • Affects Version/s: 1.0.2
    • Fix Version/s: 1.1.1, 1.2.0
    • Component/s: Input/Output, Mesos
    • Labels:
      None

      Description

      While running mesos with --no-switch_user option, HDFS account name is different from driver and executor. It makes a permission error at last stage. Executor's id is mesos' user id and driver's id is who runs spark-submit. So, moving output from _temporary/path/to/output/part-xxxx to /output/path/part-xxxx fails because of permission error. The solution for this is only setting SPARK_USER to HADOOP_USER_NAME when MesosExecutorBackend calls runAsSparkUser. HADOOP_USER_NAME is used when FileSystem get user.

        Attachments

          Activity

            People

            • Assignee:
              jongyoul Jongyoul Lee
              Reporter:
              jongyoul Jongyoul Lee
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: