Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-23719

Use correct hostname in non-host networking mode in hadoop 3 docker support

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 3.1.0
    • None
    • Spark Core, YARN
    • None

    Description

      Hostname (node-id's hostname field) specified by RM in allocated containers is the NM_HOST and not the hostname which will be used by the container when running in docker container executor : the actual container hostname is generated at runtime.

      Due to this spark executor's are unable to launch in non-host networking mode when leveraging docker support in hadoop 3 - due to bind failures as hostname they are trying to bind to is of the host machine and not the container.

      We can leverage YARN-7935 to fetch the container's hostname (when available) else fallback to existing mechanism - when running executors.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              mridulm80 Mridul Muralidharan
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated: