Uploaded image for project: 'Hadoop Common'
  1. Hadoop Common
  2. HADOOP-5839

fixes to ec2 scripts to allow remote job submission

VotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • None
    • 0.21.0
    • contrib/cloud
    • None
    • Reviewed

    Description

      i would very much like the option of submitting jobs from a workstation outside ec2 to a hadoop cluster in ec2. This has been explored here:

      http://www.nabble.com/public-IP-for-datanode-on-EC2-tt19336240.html

      the net result of this is that we can make this work (along with using a socks proxy) with a couple of changes in the ec2 scripts:
      a) use public 'hostname' for fs.default.name setting (instead of the private hostname being used currently)
      b) mark hadoop.rpc.socket.factory.class.default as final variable in the generated hadoop-site.xml (that applies to server side)

      #a has no downside as far as i can tell since public hostnames resolve to internal/private IP addresses within ec2 (so traffic is optimally routed).

      Attachments

        1. hadoop-5839.2.patch
          2 kB
          Joydeep Sen Sarma
        2. 5839.1.patch
          2 kB
          Joydeep Sen Sarma

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            jsensarma Joydeep Sen Sarma
            jsensarma Joydeep Sen Sarma
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment