Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5304

Queries fail intermittently when there is skew in data distribution

    XMLWordPrintableJSON

Details

    Description

      In a distributed environment, we've observed certain queries to fail execution intermittently, with an assignment logic issue, when the underlying data is skewed w.r.t distribution.

      For example the TPC-H query 7 failed with the below error:

      java.sql.SQLException: SYSTEM ERROR: IllegalArgumentException: MinorFragmentId 105 has no read entries assigned
      ...
        (org.apache.drill.exec.work.foreman.ForemanException) Unexpected exception during fragment initialization: MinorFragmentId 105 has no read entries assigned
          org.apache.drill.exec.work.foreman.Foreman.run():281
          java.util.concurrent.ThreadPoolExecutor.runWorker():1145
          java.util.concurrent.ThreadPoolExecutor$Worker.run():615
          java.lang.Thread.run():744
        Caused By (java.lang.IllegalArgumentException) MinorFragmentId 105 has no read entries assigned
      

      Log containing full stack trace is attached.

      And for this query, the underlying TPC-H SF100 Parquet dataset was observed to be located mostly only on 2-3 nodes on an 8 node DFS environment. The data distribution skew on this cluster is most likely the triggering factor for this case, as the same query, on the same dataset does not show this failure on a different test cluster (with possibly different data distribution).

      Also, another query failed with a similar error when slice target was set to 1.

      Failed with exception
      java.sql.SQLException: SYSTEM ERROR: IllegalArgumentException: MinorFragmentId 66 has no read entries assigned
      ...
        (org.apache.drill.exec.work.foreman.ForemanException) Unexpected exception during fragment initialization: MinorFragmentId 66 has no read entries assigned
      

      Attachments

        1. query1_drillbit.log.txt
          11 kB
          Abhishek Girish
        2. query2_drillbit.log.txt
          10 kB
          Abhishek Girish

        Issue Links

          Activity

            People

              ppenumarthy Padma Penumarthy
              agirish Abhishek Girish
              Abhishek Girish Abhishek Girish
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: