Details
-
New Feature
-
Status: Resolved
-
Minor
-
Resolution: Fixed
-
Impala 2.2
Description
I'm trying to DDL a table backed by a parquet file with the following schema:
message org.bdgenomics.formats.avro.Variant_flat { optional int32 variantErrorProbability; optional binary contig__contigName (UTF8); optional int64 contig__contigLength; optional binary contig__contigMD5 (UTF8); optional binary contig__referenceURL (UTF8); optional binary contig__assembly (UTF8); optional binary contig__species (UTF8); optional int32 contig__referenceIndex; optional int64 start; optional int64 end; optional binary referenceAllele (UTF8); optional binary alternateAllele (UTF8); optional binary svAllele__type (ENUM); optional binary svAllele__assembly (UTF8); optional boolean svAllele__precise; optional int32 svAllele__startWindow; optional int32 svAllele__endWindow; optional boolean isSomatic; }
I get the following error:
Query: create EXTERNAL TABLE prepartition LIKE PARQUET '/user/ec2-user/dbsnp/adam_flat_variants/part-r-00000.gz.parquet' STORED AS PARQUET LOCATION 'hdfs:///user/ec2-user/dbsnp/adam_flat_variants' ERROR: AnalysisException: Unsupported logical parquet type ENUM (primitive type is BINARY) for field svAllele__type