Details
-
Bug
-
Status: Resolved
-
Critical
-
Resolution: Duplicate
-
0.8.0
-
None
Description
Running in embedded mode on my mac.
$ wc -w data.csv 50000 data.csv
Here's the query:
0: jdbc:drill:zk=local> SELECT count(*) FROM dfs.`data.csv`; +------------+ | EXPR$0 | +------------+ | 50000 | +------------+ 1 row selected (0.223 seconds) 0: jdbc:drill:zk=local> SELECT columns[0] FROM dfs.`data.csv` ORDER BY columns[0]; +------------+ | EXPR$0 | +------------+ ... | 6 | +------------+ 50,001 rows selected (0.928 seconds) 0: jdbc:drill:zk=local> SELECT tab.col, COUNT(tab.col) FROM (SELECT columns[0] col FROM dfs.`data.csv` ORDER BY columns[0]) tab GROUP BY tab.col; +------------+------------+ | col | EXPR$1 | +------------+------------+ | 2 | 10000 | | 3 | 10000 | | 4 | 10000 | | 5 | 10001 | | 6 | 10000 | +------------+------------+ 5 rows selected (0.704 seconds)
Attachments
Attachments
Issue Links
- duplicates
-
DRILL-2083 order by on large dataset returns wrong results
- Closed