Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-5495

convert_from function on top of int96 data results in ArrayIndexOutOfBoundsException

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • 1.10.0
    • 1.14.0
    • None

    Description

      git.commit.id.abbrev=1e0a14c

      The data set used is generated from spark and contains a timestamp stored as int96

      [root@qa-node190 framework]# /home/parquet-tools-1.5.1-SNAPSHOT/parquet-meta /home/framework/framework/resources/Datasources/parquet_date/spark_generated/d4/part-r-00000-08c5c621-62ea-4fee-b690-11576eddc39c.snappy.parquet 
      creator:     parquet-mr (build 32c46643845ea8a705c35d4ec8fc654cc8ff816d) 
      extra:       org.apache.spark.sql.parquet.row.metadata = {"type":"struct","fields":[{"name":"a","type":"integer","nullable":true,"metadata":{}},{"name":"b","type":"strin [more]...
      
      file schema: spark_schema 
      -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      a:           OPTIONAL INT32 R:0 D:1
      b:           OPTIONAL BINARY O:UTF8 R:0 D:1
      c:           OPTIONAL INT32 O:DATE R:0 D:1
      d:           OPTIONAL INT96 R:0 D:1
      
      row group 1: RC:10000 TS:8661 
      -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
      a:            INT32 SNAPPY DO:0 FPO:4 SZ:2367/2571/1.09 VC:10000 ENC:RLE,PLAIN,BIT_PACKED
      b:            BINARY SNAPPY DO:0 FPO:2371 SZ:2329/2843/1.22 VC:10000 ENC:RLE,PLAIN_DICTIONARY,BIT_PACKED
      c:            INT32 SNAPPY DO:0 FPO:4700 SZ:1374/1507/1.10 VC:10000 ENC:RLE,PLAIN,BIT_PACKED
      d:            INT96 SNAPPY DO:0 FPO:6074 SZ:1597/1740/1.09 VC:10000 ENC:RLE,PLAIN_DICTIONARY,BIT_PACKED
      

      The below query fails with an ArrayIndexOutOfBoundsException

      select convert_from(d, 'TIMESTAMP_IMPALA') from dfs.`/drill/testdata/resource-manager/d4`;
      
      Fails with below error after displaying a bunch of records
      Error: SYSTEM ERROR: ArrayIndexOutOfBoundsException: 0
      
      Fragment 1:0
      
      [Error Id: f963f6c0-3306-49a6-9d98-a193c5e7cfee on qa-node190.qa.lab:31010] (state=,code=0)
      

      Attached the logs, profiles and data files

      Attachments

        1. 26edf56f-6bc6-1e1f-5aa4-d98aec858a4a.sys.drill
          3 kB
          Rahul Kumar Challapalli
        2. d4.tar.gz
          56 kB
          Rahul Kumar Challapalli
        3. drillbit.log
          330 kB
          Rahul Kumar Challapalli

        Issue Links

          Activity

            People

              vitalii Vitalii Diravka
              rkins Rahul Kumar Challapalli
              Arina Ielchiieva Arina Ielchiieva
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: