Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-11576

query_test.test_iceberg.test_multiple_storage_locations fails on S3

    XMLWordPrintableJSON

Details

    Description

      The test seems to fail on a badly constructed file name.
      Stack trace:

      query_test.test_iceberg.TestIcebergTable.test_multiple_storage_locations[protocol: beeswax | exec_option: {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0} | table_format: parquet/none]
      
      query_test/test_iceberg.py:785: in test_multiple_storage_locations
          vector, unique_database)
      common/impala_test_suite.py:706: in run_test_case
          result = exec_fn(query, user=test_section.get('USER', '').strip() or None)
      common/impala_test_suite.py:644: in __exec_in_impala
          result = self.__execute_query(target_impalad_client, query, user=user)
      common/impala_test_suite.py:980: in __execute_query
          return impalad_client.execute(query, user=user)
      common/impala_connection.py:212: in execute
          return self.__beeswax_client.execute(sql_stmt, user=user)
      beeswax/impala_beeswax.py:189: in execute
          handle = self.__execute_query(query_string.strip(), user=user)
      beeswax/impala_beeswax.py:367: in __execute_query
          self.wait_for_finished(handle)
      beeswax/impala_beeswax.py:388: in wait_for_finished
          raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
      E   ImpalaBeeswaxException: ImpalaBeeswaxException:
      E    Query aborted:Disk I/O error on impala-ec2-centos79-m6i-4xlarge-ondemand-1313.vpc.cloudera.com:27001: Failed to open HDFS file s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet
      E   Error(2): No such file or directory
      E   Root cause: FileNotFoundException: No such file or directory: s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a:/impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet

      Here the file name (this is a single, contuguous string despite the apparent line breaks!)
      s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations/s3a://impala-test-uswest2-2/test-warehouse/iceberg_test/hadoop_catalog/ice/iceberg_multiple_storage_locations_data02/col_int=2/00001-1-26bc91ef-b403-4b65-a6b0-566396b8d097-00001.parquet
      contains the s3a: protocol specifier in the middle of the string, which seems to be a result of a badly set up concatenation.

      Attachments

        Issue Links

          Activity

            People

              gfurnstahl Gergely Fürnstáhl
              laszlog Laszlo Gaal
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: