Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-25995

sparkR should ensure user args are after the argument used for the port

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: 2.3.2
    • Fix Version/s: None
    • Component/s: SparkR
    • Labels:
      None

      Description

      Currently if you run sparkR and accidentally specify an argument, it fails with a useless error message.  For example:

      $SPARK_HOME/bin/sparkR  --master yarn --deploy-mode client fooarg

      This gets turned into:

      Launching java with spark-submit command spark-submit   "-master" "yarn" "-deploy-mode" "client" "sparkr-shell" "fooarg" /tmp/Rtmp6XBGz2/backend_port162806ea36bca

      Notice that "fooarg" got put before /tmp file which is how R and jvm know which port to connect to.  SparkR eventually fails with timeout exception after 10 seconds.  

       

      SparkR should either not allow args or make sure the order is correct so the backend_port is always first. see https://github.com/apache/spark/blob/master/R/pkg/R/sparkR.R#L129

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              tgraves Thomas Graves
            • Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

              • Created:
                Updated: