Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-10021

Rewrite ds_kll_* functions to use Binary

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Open
    • Major
    • Resolution: Unresolved
    • None
    • None
    • Backend, Frontend
    • None

    Description

      Once Impala supports BINARY type the ds_kll_* could use it instead of STRING to receive/return serialized KLL sketches.

      To follow the BINARY support implementation - https://issues.apache.org/jira/browse/IMPALA-9579

      Currently when sketches are written by Hive as BINARY to ORC table and this table is loaded to Impala where the sketch columns are STRINGs then we get an error

      ERROR: Type mismatch: table column STRING is map to column binary in ORC file
      

      Interestingly the works with Parquet format. Once we have binary support make sure to add coverage for ORC table where the table is created and populated by Hive and read for estimating by Impala.

      Attachments

        Activity

          People

            Unassigned Unassigned
            gaborkaszab Gabor Kaszab
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: