Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6420

Driver's Block Manager does not use "spark.driver.host" in Yarn-Client mode

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Duplicate
    • None
    • None
    • Block Manager, Spark Core
    • None

    Description

      In my cluster, the yarn node does not know the client's host name.
      So I set "spark.driver.host" to the ip address of the client.
      But the driver's Block Manager does not use "spark.driver.host" but the hostname in Yarn-Client mode.

      I got the following error:

      TaskSetManager: Lost task 1.1 in stage 0.0 (TID 2, hadoop-node1538098): java.io.IOException: Failed to connect to example-hostname
      at org.apache.spark.network.client.TransportClientFactory.createClient(TransportClientFactory.java:191)
      at org.apache.spark.network.client.TransportClientFactory.createClient(TransportClientFactory.java:156)
      at org.apache.spark.network.netty.NettyBlockTransferService$$anon$1.createAndStart(NettyBlockTransferService.scala:78)
      at org.apache.spark.network.shuffle.RetryingBlockFetcher.fetchAllOutstanding(RetryingBlockFetcher.java:140)
      at org.apache.spark.network.shuffle.RetryingBlockFetcher.access$200(RetryingBlockFetcher.java:43)
      at org.apache.spark.network.shuffle.RetryingBlockFetcher$1.run(RetryingBlockFetcher.java:170)
      at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
      at java.util.concurrent.FutureTask.run(FutureTask.java:262)
      at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
      at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
      at java.lang.Thread.run(Thread.java:745)
      Caused by: java.nio.channels.UnresolvedAddressException
      at sun.nio.ch.Net.checkAddress(Net.java:127)
      at sun.nio.ch.SocketChannelImpl.connect(SocketChannelImpl.java:644)
      at io.netty.channel.socket.nio.NioSocketChannel.doConnect(NioSocketChannel.java:193)
      at io.netty.channel.nio.AbstractNioChannel$AbstractNioUnsafe.connect(AbstractNioChannel.java:200)
      at io.netty.channel.DefaultChannelPipeline$HeadContext.connect(DefaultChannelPipeline.java:1029)
      at io.netty.channel.AbstractChannelHandlerContext.invokeConnect(AbstractChannelHandlerContext.java:496)
      at io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:481)
      at io.netty.channel.ChannelOutboundHandlerAdapter.connect(ChannelOutboundHandlerAdapter.java:47)
      at io.netty.channel.AbstractChannelHandlerContext.invokeConnect(AbstractChannelHandlerContext.java:496)
      at io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:481)
      at io.netty.channel.AbstractChannelHandlerContext.connect(AbstractChannelHandlerContext.java:463)
      at io.netty.channel.DefaultChannelPipeline.connect(DefaultChannelPipeline.java:849)
      at io.netty.channel.AbstractChannel.connect(AbstractChannel.java:199)
      at io.netty.bootstrap.Bootstrap$2.run(Bootstrap.java:165)
      at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:380)
      at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:357)
      at io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:116)
      ... 1 more

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              marsishandsome Liangliang Gu
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: