Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-18971

Netty issue may cause the shuffle client hang

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • None
    • 2.1.2, 2.1.3, 2.2.0
    • Spark Core
    • None

    Description

      Check https://github.com/netty/netty/issues/6153 for details

      You should be able to see the following similar stack track in the executor thread dump.

      "shuffle-client-7-4" daemon prio=5 tid=97 RUNNABLE
              at io.netty.util.Recycler$Stack.scavengeSome(Recycler.java:504)
              at io.netty.util.Recycler$Stack.scavenge(Recycler.java:454)
              at io.netty.util.Recycler$Stack.pop(Recycler.java:435)
              at io.netty.util.Recycler.get(Recycler.java:144)
              at io.netty.buffer.PooledUnsafeDirectByteBuf.newInstance(PooledUnsafeDirectByteBuf.java:39)
              at io.netty.buffer.PoolArena$DirectArena.newByteBuf(PoolArena.java:727)
              at io.netty.buffer.PoolArena.allocate(PoolArena.java:140)
              at io.netty.buffer.PooledByteBufAllocator.newDirectBuffer(PooledByteBufAllocator.java:271)
              at io.netty.buffer.AbstractByteBufAllocator.directBuffer(AbstractByteBufAllocator.java:177)
              at io.netty.buffer.AbstractByteBufAllocator.directBuffer(AbstractByteBufAllocator.java:168)
              at io.netty.buffer.AbstractByteBufAllocator.ioBuffer(AbstractByteBufAllocator.java:129)
              at io.netty.channel.AdaptiveRecvByteBufAllocator$HandleImpl.allocate(AdaptiveRecvByteBufAllocator.java:104)
              at io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:117)
              at io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:652)
              at io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:575)
              at io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:489)
              at io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:451)
              at io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:140)
              at io.netty.util.concurrent.DefaultThreadFactory$DefaultRunnableDecorator.run(DefaultThreadFactory.java:144)
              at java.lang.Thread.run(Thread.java:745)
      

      Attachments

        Issue Links

          Activity

            People

              zsxwing Shixiong Zhu
              zsxwing Shixiong Zhu
              Votes:
              0 Vote for this issue
              Watchers:
              14 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: