Some systems (e.g Spark) write Parquet files with integral types using logical types. Impala fails to handle these logical types when constructing a table from an existing Parquet file. However, reading data from such files works fine.
For example, consider a file the following Parquet schema:
A CREATE TABLE ... LIKE PARQUET statement fails with something like the following:
This functionality is handled by the convertLogicalParquetType method in the com.cloudera.impala.analysis.CreateTableLikeFileStmt class, which currently does not handle integer logical types.
See https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#numeric-types for information about the mapping between logical types and encodings.
We should implement read and write support for this metadata, i.e. allow correct round-tripping of tinyint and smallint types.