Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-2621

Spark standalone Dockerfile downloads corrupt spark.tgz archive

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • 0.7.0
    • None
    • None
    • None

    Description

      trying to build a docker image as per these instructions

      generates the following error:

      > docker build -t "spark_standalone" .

      (...)
      Step 12 : RUN curl -s http://apache.mirror.cdnetworks.com/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop$HADOOP_PROFILE.tgz | tar -xz -C /usr/local/
      ---> Running in adf4bd575e37

      gzip: stdin: unexpected end of file
      tar: Unexpected EOF in archive
      tar: Unexpected EOF in archive
      tar: Error is not recoverable: exiting now
      The command '/bin/sh -c curl -s http://apache.mirror.cdnetworks.com/spark/spark-$SPARK_VERSION/spark-$SPARK_VERSION-bin-hadoop$HADOOP_PROFILE.tgz | tar -xz -C /usr/local/' returned a non-zero code: 2

      I've tracked this down to the .tgz file being downloaded from http://apache.mirror.cdnetworks.com/ being corrupted. If i substitute with a cloudfront URL, e.g. https://d3kbcqa49mib13.cloudfront.net/, the image gets built successfully.

      Attachments

        Activity

          People

            Unassigned Unassigned
            alrutten Anne Rutten
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: