Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-806

Create worker Instance in Apache Zeppelin running on Spark Standalone

    Details

      Description

      Hi,

      I am running apache zeppelin on a digital ocean droplet where I have created a Spark standalone cluster also. I am trying to extract data from mongoDB using mongo-spark connector by stratio which is taking around 8 min to extract a single collection of 12 million documents from the mongoDB replica sets. So for testing purposes I will like to increase my worker instance. Please help me Thank You.

      I am relatively new to spark and zeppelin.

      zeppelin-env.sh

      export ZEPPELIN_JAVA_OPTS="-Dspark.executor.memory=6g"

      export SPARK_HOME="/home/username/spark"
      export SPARK_SUBMIT_OPTIONS="--packages com.stratio.datasource:spark-mongodb_2.10:0.10.1"
      export MASTER=spark://master-ip:7077
      export ZEPPELIN_PORT=8088
      export ZEPPELIN_MEM=-Xmx8g
      export JAVA_HOME=/usr/lib/jvm/java-7-openjdk-amd64

      Thanks.

      Karthik

        Attachments

          Activity

            People

            • Assignee:
              luciano resende Luciano Resende
              Reporter:
              karthik.sivadas Karthik Sivadas
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: