Affects Version/s: None
Fix Version/s: 0.4.0
Currently, the scan operator will read all the columns from the data source, and pass all the columns to down-stream operator.
In order to improve performance of Drill query, we should instead push the project operation into scan operator. The scan operator will take as an input a list of columns required to retrieve. If the query only refers a subset of columns, the scan operator will only have to retrieve values for the column in the list.
It's expected to see performance improvement once project push-down is supported.
|Field||Original Value||New Value|
|Attachment||DRILL-491.1.patch [ 12639102 ]|
|Attachment||DRILL-491.2.patch [ 12642157 ]|
|Status||Open [ 1 ]||Resolved [ 5 ]|
|Resolution||Fixed [ 1 ]|
|Workflow||no-reopen-closed, patch-avail [ 12854497 ]||no-reopen-closed, patch-avail, testing [ 12860186 ]|
|Fix Version/s||0.4.0 [ 12324963 ]|
|Workflow||no-reopen-closed, patch-avail, testing [ 12860186 ]||Drill workflow [ 12934231 ]|