Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6869

Add pyspark archives path to PYTHONPATH

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • 1.0.0
    • 1.4.0
    • PySpark
    • None

    Description

      From SPARK-1920 and SPARK-1520 we know PySpark on Yarn can not work when the assembly jar are package by JDK 1.7+, so ship pyspark archives to executors by Yarn with --py-files. The pyspark archives name must contains "spark-pyspark".

      1st: zip pyspark to spark-pyspark_2.10.zip
      2nd:./bin/spark-submit --master yarn-client/yarn-cluster --py-files spark-pyspark_2.10.zip app.py args

      Attachments

        Issue Links

          Activity

            People

              lianhuiwang Lianhui Wang
              Sephiroth-Lin Weizhong
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: