Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-982

Parquet reader should return NULL value for non-exist column in execution phase, in stead of raising ExecutionSetupException

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: None
    • Fix Version/s: 0.5.0
    • Component/s: Storage - Parquet
    • Labels:
      None

      Description

      If query a non-exist column against a parquet file, DRILL currently raises ExecutionSetupException:

      SELECT NON_EXIT_COLUMN from cp.`tpch/nation.parquet`;

      This will cause problem, when we have multiple parquet files, and the first one does not have the column, while the rest of them have the column.

      It would be better to return NULL expression during execution stage, if the columns does not exist in the parquet file.

      Later on, if DRILL adds a new option to verify column existence before executing a query ( just like what a schema-based system will do), we could throw PlanException in planning phase, once the parquet footer information is available to planner.

        Attachments

          Activity

            People

            • Assignee:
              DrillCommitter DrillCommitter
              Reporter:
              jni Jinfeng Ni
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Due:
                Created:
                Updated:
                Resolved: