Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4905

Push down the LIMIT to the parquet reader scan to limit the numbers of records read

VotersWatch issueWatchersLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.8.0
    • 1.9.0
    • Storage - Parquet
    • None

    Description

      Limit the number of records read from disk by pushing down the limit to parquet reader.

      For queries like
      select * from <table> limit N;

      where N < size of Parquet row group, we are reading 32K/64k rows or entire row group. This needs to be optimized to read only N rows.

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            ppenumarthy Padma Penumarthy
            ppenumarthy Padma Penumarthy
            Rahul Kumar Challapalli Rahul Kumar Challapalli
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment