Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-817

Parquet file created from impala using data from an impala table causes drill to hang

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None
    • 0.4.0
    • Storage - Parquet
    • None

    Description

      A normal table was created with impala:

      [localhost.localdomain:21000] > create table integer_table2 (keycolumn string, column1 int) row format delimited fields terminated by ',';
      Query: create table integer_table2 (keycolumn string, column1 int) row format delimited fields terminated by ','
      
      Returned 0 row(s) in 0.10s
      [localhost.localdomain:21000] > show tables;
      Query: show tables
      +-----------------------+
      | name                  |
      +-----------------------+
      | integer_table2        |
      +-----------------------+
      Returned 1 row(s) in 0.01s
      

      Data from a csv is loaded into the table

      46:44 PDT 2014)
      [localhost.localdomain:21000] > load data inpath '/user/hive/warehouse/Integer_Table.csv' overwrite into table integer_table2;
      Query: load data inpath '/user/hive/warehouse/Integer_Table.csv' overwrite into table integer_table2
      +----------------------------------------------------------+
      | summary                                                  |
      +----------------------------------------------------------+
      | Loaded 1 file(s). Total files in destination location: 1 |
      +----------------------------------------------------------+
      Returned 1 row(s) in 0.23s
      

      A parquet table was then created in impala and data loaded in from the first table:

      [localhost.localdomain:21000] > create table integer_table (keycolumn string, column1 int) row format delimited fields terminated by ',' stored as parquet;
      Query: create table integer_table (keycolumn string, column1 int) row format delimited fields terminated by ',' stored as parquet
      
      Returned 0 row(s) in 0.10s
      [localhost.localdomain:21000] > insert overwrite table integer_table select * from integer_table2;
      Query: insert overwrite table integer_table select * from integer_table2
      Inserted 18 rows in 0.20s
      

      The parquet file (in the format <long alphanumeric string>_data.0 and renamed) was extracted from hdfs and moved to drill's dfs. The query through sqlline hangs:

      0: jdbc:drill:schema=hivestg> select * from `dfs`.`/opt/drill/integer.parquet`;
      <hang>
      

      Attachments

        1. Integer_Table.csv
          0.3 kB
          Norris Lee

        Activity

          People

            DrillCommitter DrillCommitter
            norrisl Norris Lee
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: