Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2525

Impala CREATE TABLE LIKE PARQUET breaks on ENUM logical type

    Details

      Description

      I'm trying to DDL a table backed by a parquet file with the following schema:

      message org.bdgenomics.formats.avro.Variant_flat {
        optional int32 variantErrorProbability;
        optional binary contig__contigName (UTF8);
        optional int64 contig__contigLength;
        optional binary contig__contigMD5 (UTF8);
        optional binary contig__referenceURL (UTF8);
        optional binary contig__assembly (UTF8);
        optional binary contig__species (UTF8);
        optional int32 contig__referenceIndex;
        optional int64 start;
        optional int64 end;
        optional binary referenceAllele (UTF8);
        optional binary alternateAllele (UTF8);
        optional binary svAllele__type (ENUM);
        optional binary svAllele__assembly (UTF8);
        optional boolean svAllele__precise;
        optional int32 svAllele__startWindow;
        optional int32 svAllele__endWindow;
        optional boolean isSomatic;
      }
      

      I get the following error:

      Query: create EXTERNAL TABLE prepartition LIKE PARQUET '/user/ec2-user/dbsnp/adam_flat_variants/part-r-00000.gz.parquet' STORED AS PARQUET LOCATION 'hdfs:///user/ec2-user/dbsnp/adam_flat_variants'
      ERROR: AnalysisException: Unsupported logical parquet type ENUM (primitive type is BINARY) for field svAllele__type
      

        Attachments

          Activity

            People

            • Assignee:
              jkukul Jakub Kukul
              Reporter:
              laserson Uri Laserson
            • Votes:
              1 Vote for this issue
              Watchers:
              10 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: