Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-4156

Parquet group converter cannot cope with ENUM original type values

    XMLWordPrintableJSON

Details

    • Patch

    Description

      When importing parquet data written using a Thrift schema, Drill cannot cope with ENUMs. This was fixed elsewhere in DRILL-1775 but not in the DrillParquetGroupConverter class (see mailing list thread).

      It appears that the ParquetToDrillTypeConverter class was updated to use a varbinary as it's default behaviour, but the DrillParquetGroupConverter class was not updated to do the same.

      Caused by: java.lang.UnsupportedOperationException: Unsupported type ENUM
      	at org.apache.drill.exec.store.parquet2.DrillParquetGroupConverter.getConverterForType(DrillParquetGroupConverter.java:249)
      	at org.apache.drill.exec.store.parquet2.DrillParquetGroupConverter.<init>(DrillParquetGroupConverter.java:154)
      	at org.apache.drill.exec.store.parquet2.DrillParquetGroupConverter.<init>(DrillParquetGroupConverter.java:147)
      	at org.apache.drill.exec.store.parquet2.DrillParquetGroupConverter.<init>(DrillParquetGroupConverter.java:147)
      	at org.apache.drill.exec.store.parquet2.DrillParquetGroupConverter.<init>(DrillParquetGroupConverter.java:147)
      	at org.apache.drill.exec.store.parquet2.DrillParquetRecordMaterializer.<init>(DrillParquetRecordMaterializer.java:40)
      	at org.apache.drill.exec.store.parquet2.DrillParquetReader.setup(DrillParquetReader.java:267)
      	... 14 more
      

      Attachments

        1. parquet_enum_v2.patch
          2 kB
          Chris Jansen
        2. parquet_enum.patch
          2 kB
          Chris Jansen

        Issue Links

          Activity

            People

              Unassigned Unassigned
              janstenpickle Chris Jansen
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: