Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2525

Impala CREATE TABLE LIKE PARQUET breaks on ENUM logical type

    XMLWordPrintableJSON

Details

    Description

      I'm trying to DDL a table backed by a parquet file with the following schema:

      message org.bdgenomics.formats.avro.Variant_flat {
        optional int32 variantErrorProbability;
        optional binary contig__contigName (UTF8);
        optional int64 contig__contigLength;
        optional binary contig__contigMD5 (UTF8);
        optional binary contig__referenceURL (UTF8);
        optional binary contig__assembly (UTF8);
        optional binary contig__species (UTF8);
        optional int32 contig__referenceIndex;
        optional int64 start;
        optional int64 end;
        optional binary referenceAllele (UTF8);
        optional binary alternateAllele (UTF8);
        optional binary svAllele__type (ENUM);
        optional binary svAllele__assembly (UTF8);
        optional boolean svAllele__precise;
        optional int32 svAllele__startWindow;
        optional int32 svAllele__endWindow;
        optional boolean isSomatic;
      }
      

      I get the following error:

      Query: create EXTERNAL TABLE prepartition LIKE PARQUET '/user/ec2-user/dbsnp/adam_flat_variants/part-r-00000.gz.parquet' STORED AS PARQUET LOCATION 'hdfs:///user/ec2-user/dbsnp/adam_flat_variants'
      ERROR: AnalysisException: Unsupported logical parquet type ENUM (primitive type is BINARY) for field svAllele__type
      

      Attachments

        Activity

          People

            jkukul Jakub Kukul
            laserson Uri Laserson
            Votes:
            1 Vote for this issue
            Watchers:
            10 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: