Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5535

Paging Problem with Querying Directories

    XMLWordPrintableJSON

    Details

    • Flags:
      Patch

      Description

      Problem comes with the following Drill query:
      "SELECT * FROM <<mySource>>
      WHERE (dir0='Test1' AND dir1='TestDataSourceID1')
      OR (dir0='Test2' AND dir1='TestDataSourceID2')
      LIMIT 2 OFFSET 0"

      If this call gets run twice it is randomly set which file will be in the result. So if a query is created which should page my result I won't be able to tell which source was used for the result.
      Due two the fact that if file1 contains the columns a, b, c and column b, c, d I also will get a problem with the result as the first results will for example contain the columns a, b, c and the second half of the results will contain a, b, c, d with a filled with null.

      As in the example on your webpage (https://drill.apache.org/docs/querying-directories/) where you query specific columns and order the result without any paging I am wondering if this problem only occurs while using the star in the query.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              LukeP2090 Lucian Poth
            • Votes:
              1 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: