Uploaded image for project: 'Zeppelin'
  1. Zeppelin
  2. ZEPPELIN-457

Add documentation about Spark on EMR using Zeppelin Sandbox

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Minor
    • Resolution: Unresolved
    • None
    • None
    • None
    • None

    Description

      Nowadays many people is using Spark on AWS EMR clusters.
      So, it would be helpful for the users if Zeppelin provides a step by step guide documentation.

      This documentation may include below contents.

      • How to create clusters and install "Zeppelin-Sandbox".
      • Establishing a connection to the master node using SSH.
      • How can we browse web interfaces hosted on our clusters that we made ? (How to set up a SSH tunnel to the master node using Local / Dynamic port forwarding)
      • Some information about predefined Zeppelin-Sandbox environment variables( such as Zeppelin itself, log and notebook directory locations in the master node), Hadoop, Spark, Zeppelin service port number and etc ..
      • Tutorial for beginners like attached image.

      Any ideas are welcome !

      Attachments

        Activity

          People

            Ahyoung Ahyoung Ryu
            Ahyoung Ahyoung Ryu
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: