Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-3900

OOM with Hive native scan enabled on TPCH-100 parquet, query 05.q

Attach filesAttach ScreenshotAdd voteVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.2.0
    • None
    • Functions - Hive
    • None

    Description

      TPCH-100 parquet dataset. Configure Hive 1.0 pointing to the parquet files as external tables. Enable Hive native scan.

      alter system set `store.hive.optimize_scan_with_native_readers`=true;
      

      Run TPCH query 05 through Hive, drillbit runs out of memory. Same query goes through dfs completes successfully. (Disable hive native scan, drill also runs out of memory through hive.)

      We expect with hive native scan turned on, query should finish.

      Attachments

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            cchang@maprtech.com Chun Chang

            Dates

              Created:
              Updated:

              Slack

                Issue deployment