Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-947

Impala caused datanode storm

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • Impala 1.2.4
    • Product Backlog
    • None
    • CDH 4.6, Impala 1.2.4, all from parcels

    Description

      Hi, this morning we got "datanode" storm.
      Suddenly impala started to read blocks from many datanodes. And we didn't have any running queries. When we restarted the whole impala service, storm stopped.
      Please see networking when storm started (screenshot).
      Affected datanodes had these log entries:

      Previous Next
      Host	Log Level	Time	Source	Message
      prod-node0101.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: /10.66.49.226:50010, dest: /10.66.49.225:38969, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_722537315_1, offset: 0, srvID: DS-1895691478-10.66.49.226-50010-1392796222466, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_8166683644010943415_23956636, duration: 1006760
       View Log File
      prod-node0101.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: 127.0.0.1, dest: 127.0.0.1, op: REQUEST_SHORT_CIRCUIT_FDS, blockid: 132388256993022462, srvID: DS-1895691478-10.66.49.226-50010-1392796222466, success: true
       View Log File
      prod-node0107.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: 127.0.0.1, dest: 127.0.0.1, op: REQUEST_SHORT_CIRCUIT_FDS, blockid: -6297200201966372620, srvID: DS-1792190177-10.66.49.232-50010-1392796222893, success: true
       View Log File
      prod-node0118.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: /10.66.49.243:50010, dest: /10.66.49.232:54808, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_453101187_1, offset: 0, srvID: DS-1075945273-10.66.49.243-50010-1392796222416, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_-5476067019874567999_23956435, duration: 1184010
       View Log File
      prod-node0118.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: /10.66.49.243:50010, dest: /10.66.49.244:36042, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_-1114160638_1, offset: 0, srvID: DS-1075945273-10.66.49.243-50010-1392796222416, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_1086074988829525783_23956481, duration: 1181419
       View Log File
      prod-node0118.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: 127.0.0.1, dest: 127.0.0.1, op: REQUEST_SHORT_CIRCUIT_FDS, blockid: -7620517135406691819, srvID: DS-1075945273-10.66.49.243-50010-1392796222416, success: true
       View Log File
      prod-node0120.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: /10.66.49.245:50010, dest: /10.66.49.224:43178, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_-1991936378_1, offset: 0, srvID: DS-1789713258-10.66.49.245-50010-1392796222375, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_1107624047476372504_23956836, duration: 1029509
       View Log File
      prod-node0120.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: /10.66.49.245:50010, dest: /10.66.62.89:42025, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_-1092616849_1, offset: 0, srvID: DS-1789713258-10.66.49.245-50010-1392796222375, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_8764079704706642093_23956571, duration: 1583802
       View Log File
      prod-node028.kyc.megafon.ru	INFO	15 апрель 2014 10:21	clienttrace	
      src: /10.66.49.183:50010, dest: /10.66.62.59:43322, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_-70190780_1, offset: 0, srvID: DS-1141421786-10.66.49.183-50010-1363767221252, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_6336672533820089443_23956530, duration: 1421004
       View Log File
      

      View Log File:

      10:21:26.601	INFO	org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace	
      src: /10.66.49.226:50010, dest: /10.66.49.225:38969, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_722537315_1, offset: 0, srvID: DS-1895691478-10.66.49.226-50010-1392796222466, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_8166683644010943415_23956636, duration: 1006760
      10:21:26.606	INFO	org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace	
      src: 127.0.0.1, dest: 127.0.0.1, op: REQUEST_SHORT_CIRCUIT_FDS, blockid: 132388256993022462, srvID: DS-1895691478-10.66.49.226-50010-1392796222466, success: true
      10:21:26.607	INFO	org.apache.hadoop.hdfs.server.datanode.DataNode.clienttrace	
      src: /10.66.49.226:50010, dest: /10.66.49.241:33859, bytes: 132096, op: HDFS_READ, cliID: DFSClient_NONMAPREDUCE_1845184176_1, offset: 0, srvID: DS-1895691478-10.66.49.226-50010-1392796222466, blockid: BP-2086241135-10.66.49.155-1363767213272:blk_2940094362365288428_23956714, duration: 1005634
      

      "dest" on other host is impala process (ps -ef, nestat -alnpt)

      Attachments

        Activity

          People

            mjacobs Matthew Jacobs
            serega_sheypak Sergey
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: