Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2468

5.5.x full dataload crashed while querying lineitem_multiblock

    XMLWordPrintableJSON

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Blocker
    • Resolution: Fixed
    • Affects Version/s: Impala 2.3.0
    • Fix Version/s: Impala 2.3.0
    • Component/s: None
    • Labels:
      None

      Description

      The full dataload on 5.5.x looks like it crashed last night while executing queries against lineitem_multiblock.

      04:04:53.434 -- executing against localhost:21000
      04:04:53.434 SELECT count(*) from lineitem_multiblock;
      04:04:53.434 
      04:04:53.434 -- executing against localhost:21000
      04:04:53.434 SELECT count(l_comment), min(l_partkey), max(l_linenumber) from lineitem_multiblock;
      04:04:53.437 
      04:04:53.437  TestTpchMemLimitError.test_low_mem_limit_q20[mem_limit: 180 | exec_option: {'disable_codegen': False, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0, 'batch_size': 0, 'num_nodes': 0} | table_format: parquet/none] 
      04:04:53.438 [gw2] linux2 -- Python 2.6.6 /data/2/jenkins/workspace/impala-cdh5.5.x-nightly-data-load/repos/Impala/bin/../infra/python/env/bin/python
      04:04:53.439 query_test/test_mem_usage_scaling.py:184: in test_low_mem_limit_q20
      04:04:53.439     self.low_memory_limit_test(vector, 'tpch-q20', self.MIN_MEM_FOR_TPCH['Q20']);
      04:04:53.439 query_test/test_mem_usage_scaling.py:96: in low_memory_limit_test
      04:04:53.439     assert TestLowMemoryLimits.EXPECTED_ERROR_MSG in str(e)
      04:04:53.439 E   assert 'Memory limit exceeded' in "ImpalaBeeswaxException:\n INNER EXCEPTION: <class 'socket.error'>\n MESSAGE: [Errno 104] Connection reset by peer"
      04:04:53.439 E    +  where 'Memory limit exceeded' = TestLowMemoryLimits.EXPECTED_ERROR_MSG
      04:04:53.439 E    +  and   "ImpalaBeeswaxException:\n INNER EXCEPTION: <class 'socket.error'>\n MESSAGE: [Errno 104] Connection reset by peer" = str(ImpalaBeeswaxException())
      04:04:53.439 ----------------------------- Captured stderr call -----------------------------
      

      There was also a failure like this:
      http://sandbox.jenkins.cloudera.com/view/Impala/view/Builds%20-%202.3.0%20Release/job/impala-cdh5.5.x-nightly-data-load/32/testReport/junit/query_test.test_scanners/TestParquet/test_parquet_exec_option____disable_codegen___False___abort_on_error___1___exec_single_node_rows_threshold___0___batch_size___0___num_nodes___0____table_format__parquet_none_/

      E   ImpalaBeeswaxException: ImpalaBeeswaxException:
      E    Query aborted:
      E   Column chunk should not contain two dictionary pages.
      E   
      E   
      E   
      E   Column chunk should not contain two dictionary pages.
      

      Sailesh Mukil, please take a look and see how quickly you think the fix will be. If it will take a while to resolve, we may need to back out the parquet multiblock change.

        Attachments

          Activity

            People

            • Assignee:
              tarmstrong Tim Armstrong
              Reporter:
              dhecht Dan Hecht
            • Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: