Uploaded image for project: 'Livy'
  1. Livy
  2. LIVY-429

Livy skips url fragment identifier in spark.yarn.dist.archives setting

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 0.3
    • 0.5.0
    • Server
    • None

    Description

      When submitting Spark session via Livy API with the following parameters:

      {
      	"kind": "pyspark",
      	"name": "my-app"
      	"conf": {
      		"spark.yarn.appMasterEnv.PYSPARK_PYTHON": "./ENVS/custom-python/bin/python"
      	},
      	"archives": ["/path/to/custom-python.zip#ENVS"]
      }
      

      Spark session fails with:

      java.io.IOException: Cannot run program "./ENVS/custom-python/bin/python": error=2, No such file or directory
      

      Because Livy uses java.net.URI#getPath method under the hood, which just skips URI fragment identifier part and Spark is being submitted with spark.yarn.dist.archives=/path/to/custom-python.zip and does not even extracts the archive in this case.

      Attachments

        Activity

          People

            broartem Artem Plotnikov
            broartem Artem Plotnikov
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: