Uploaded image for project: 'Hive'
  1. Hive
  2. HIVE-25420

Ignore time type column in Iceberg testing for vectorized runs

    XMLWordPrintableJSON

Details

    Description

      Time is a valid type in Iceberg but not in Hive. In Hive it is represented as a string type column, while (at least if ORC is used as underlying file format) long type is written out to data files.

      This requires translation two times: long@ORC -> LocalDate@Iceberg -> toString()@Hive and it works well for non vectorized reads, but when vectorization is turned on, we will get:

      org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector cannot be cast to org.apache.hadoop.hive.ql.exec.vector.LongColumnVector 

      Thus for now, time type is not supported with vectorization, and the relevant test cases should be ignored in such test configs.

      Attachments

        Issue Links

          Activity

            People

              szita Ádám Szita
              szita Ádám Szita
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 0.5h
                  0.5h