Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7225

Merging of columnTypeInfo for file with different schema throws NullPointerException during refresh metadata

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 1.16.0
    • Fix Version/s: 1.17.0
    • Component/s: Metadata
    • Labels:

      Description

      Merging of columnTypeInfo from two files with different schemas throws nullpointerexception. For example if a directory Orders has two files:

      • orders.parquet (with columns order_id, order_name, order_date)
      • orders_with_address.parquet (with columns order_id, order_name, address)

      When refresh table metadata is triggered, metadata such as total_null_count for columns in both the files is aggregated and updated in the ColumnTypeInfo. Initially ColumnTypeInfo is initialized with the first file's ColumnTypeInfo (i.e., order_id, order_name, order_date). While aggregating, the existing ColumnTypeInfo is looked up for columns in the second file and since some of them don't exist in the ColumnTypeInfo, a npe is thrown. This can be fixed by initializing ColumnTypeInfo for columns that are not yet present.

       

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                vdonapati Venkata Jyothsna Donapati
                Reporter:
                vdonapati Venkata Jyothsna Donapati
                Reviewer:
                Aman Sinha
              • Votes:
                0 Vote for this issue
                Watchers:
                4 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved: