Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-5586

Null-aware anti-join can take a long time to cancel

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Minor
    • Resolution: Fixed
    • Impala 2.10.0
    • Impala 2.10.0
    • Backend

    Description

      Queries with a null-aware anti-join joining on a large number of NULLs can take a long time to cancel if threads are stuck in PartitionedHashJoinNode::EvaluateNullProbe().

      Reproduction:

      1. Run this query against tpch_parquet:
        select l_orderkey, l_partkey, l_suppkey, l_linenumber
        from lineitem
        where if(l_orderkey % 2 = 0, NULL, l_partkey) not in (
          select p_partkey
          from part
          where p_retailprice < l_extendedprice * l_tax)
        order by 1,2,3,4
        
      2. Cancel it
      3. See Impala still consuming CPU in EvaluateNullProbe() for a long time after cancellation

      Attachments

        Activity

          People

            anujphadke Anuj Phadke
            tarmstrong Tim Armstrong
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: