Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-3261

Tpch query 4 over hive sf100 fails with an OOM error. Query runs fine against text and parquet

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    Description

      git.commit.id.abbrev=5f26b8b

      I have a 2 node cluster with the below memory allocation for the drillbits

      DRILL_MAX_DIRECT_MEMORY="32G"
      DRILL_MAX_HEAP="4G"
      

      The below query never returned

      select
        o.o_orderpriority,
        count(*) as order_count
      from
        orders o
      where
        o.o_orderdate >= date '1996-10-01'
        and o.o_orderdate < date '1996-10-01' + interval '3' month
        and
        exists (
          select
            *
          from
            lineitem l
          where
            l.l_orderkey = o.o_orderkey
            and l.l_commitdate < l.l_receiptdate
        )
      group by
        o.o_orderpriority
      order by
        o.o_orderpriority;
      

      I attached the hive DDL and the error logs. Let me know if you need anything else

      Attachments

        1. tpch100_hive.ddl
          3 kB
          Rahul Kumar Challapalli
        2. error.log
          13 kB
          Rahul Kumar Challapalli

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ppenumarthy Padma Penumarthy
            rkins Rahul Kumar Challapalli

            Dates

              Created:
              Updated:

              Slack

                Issue deployment