Uploaded image for project: 'Spark'
  1. Spark
  2. SPARK-6962

Netty BlockTransferService hangs in the middle of SQL query

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Incomplete
    • Affects Version/s: 1.2.0, 1.2.1, 1.3.0
    • Fix Version/s: None
    • Component/s: Spark Core, SQL
    • Labels:

      Description

      Spark SQL queries (though this seems to be a Spark Core issue - I'm just using queries in the REPL to surface this, so I mention Spark SQL) hang indefinitely under certain (not totally understood) circumstances.

      This is resolved by setting spark.shuffle.blockTransferService=nio, which seems to point to netty as the issue. Netty was set as the default for the block transport layer in 1.2.0, which is when this issue started. Setting the service to nio allows queries to complete normally.

      I do not see this problem when running queries over smaller (~20 5MB files) datasets. When I increase the scope to include more data (several hundred ~5MB files), the queries will get through several steps but eventuall hang indefinitely.

      Here's the email chain regarding this issue, including stack traces:
      http://mail-archives.apache.org/mod_mbox/spark-user/201503.mbox/<CAE61SPFQt2y7D5vQZOMZZ2DMr-JX2c2ZgGCyky40npKjJx44hQ@mail.gmail.com>

      For context, here's the announcement regarding the block transfer service change: http://mail-archives.apache.org/mod_mbox/spark-dev/201411.mbox/<CABPQxssL04Q+rBLtP-D8W+z3aTn+g-Um6GMdGDnh-hZcvD-c_g@mail.gmail.com>

        Attachments

        1. jstacks.txt
          167 kB
          Jon Chase

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              jonchase Jon Chase
            • Votes:
              6 Vote for this issue
              Watchers:
              19 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: