Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-7225

Merging of columnTypeInfo for file with different schema throws NullPointerException during refresh metadata

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 1.16.0
    • 1.17.0
    • Metadata

    Description

      Merging of columnTypeInfo from two files with different schemas throws nullpointerexception. For example if a directory Orders has two files:

      • orders.parquet (with columns order_id, order_name, order_date)
      • orders_with_address.parquet (with columns order_id, order_name, address)

      When refresh table metadata is triggered, metadata such as total_null_count for columns in both the files is aggregated and updated in the ColumnTypeInfo. Initially ColumnTypeInfo is initialized with the first file's ColumnTypeInfo (i.e., order_id, order_name, order_date). While aggregating, the existing ColumnTypeInfo is looked up for columns in the second file and since some of them don't exist in the ColumnTypeInfo, a npe is thrown. This can be fixed by initializing ColumnTypeInfo for columns that are not yet present.

       

      Attachments

        Issue Links

          Activity

            People

              vdonapati Venkata Jyothsna Donapati
              vdonapati Venkata Jyothsna Donapati
              Aman Sinha Aman Sinha
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: